Meme Corporate Governance

Can retail investors revolutionize corporate governance and make public companies more responsive to social concerns? Beginning in 2021, there was a dramatic influx of retail investors into the shareholder base of “meme” stock companies such as GameStop, AMC, and Bed Bath & Beyond. Observing the unprecedented, coordinated trading among retail investors, scholars and practitioners predicted that the influx of retail investors would reduce the power of large institutional investors and democratize corporate governance. These predictions were driven by three factors: generational, with assumptions that millennial and Gen Z investors would challenge corporate management; societal, reflecting growing discontent with slow progress on issues such as sustainability and boardroom diversity; and technological, with the advent of easily accessible and user-friendly mobile apps allowing investors to directly intervene in corporate governance. While plausible, these predictions have so far not been tested. This Article empirically analyzes the impact of retail investors on corporate governance, particularly at meme stock companies. We provide new quantitative evidence regarding the origins of meme investing and conclude that—despite their coordinated trading behavior in the market—meme investors have not democratized corporate governance or advanced social issues. The Article presents three principal findings. First, we show how the “meme stock” frenzy was affected by the abolition of trading commissions by major brokerages in 2019. Meme stock companies experienced positive abnormal stock returns when commission-free trading was widely introduced and saw elevated trading volumes afterward. Second, we find that despite the promise of a more active retail shareholder base, meme stock companies experienced a significant decrease in shareholder voting. Shareholder proposals have also been very limited, with most meme stock companies seeing no proposals after the rapid rise in retail ownership. Third, we do not find any improvement in meme stock companies’ corporate governance, financial performance, and social responsibility, as represented by director independence, board gender diversity, ESG scores, and capital and R&D expenditures. Collectively, our findings suggest that the influx of retail shareholders has not translated into more “democratic” governance regimes or encouraged shareholder participation in corporate governance at companies most affected by the meme investor storm.

INTRODUCTION

Buoyed by pandemic checks and the advent of commission-free mobile apps such as Robinhood, retail investors took Wall Street by storm in early 2021.1Robinhood utilizes the payment for order flow (“PFOF”) business model, under which the company receives payment from market makers in return for delivering a large order flow. For more detailed information, see Siqi Wang, Consumers Beware: How Are Your Favorite “Free” Investment Apps Regulated?, 19 Duke L. & Tech. Rev. 43, 50, 52–53 (2021); Robert H. Battalio & Tim Loughran, Does Payment for Order Flow to Your Broker Help or Hurt You?, 80 J. Bus. Ethics 37, 38, 41 (2008); and Kate Rooney & Maggie Fitzgerald, Here’s How Robinhood Is Raking in Record Cash on Customer Trades—Despite Making It Free, CNBC (Aug. 14, 2020, 10:17 AM), https://www.cnbc.com/2020/08/13/how-robinhood-makes-money-on-customer-trades-despite-making-it-free.html [https://perma.cc/KGD2-AP47]. In our companion paper, we discuss the development and evolution of the PFOF system in more detail. See generally Dhruv Aggarwal, Albert H. Choi & Yoon-Ho Alex Lee, The Meme Stock Frenzy: Origins and Implications, 96 S. Cal. L. Rev. 1387 (2024). Coordinating through social media sites such as Reddit and using catchy “memes,”2A “meme” is “a . . . chunk of information . . . [that] self-replicates because we humans like to share and repeat stuff.” See Alexis Benveniste, The Meaning and History of Memes, N.Y. Times (Jan. 26, 2022), (quoting Professor Kirby Conrod), https://www.nytimes.com/2022/01/26/crosswords/what-is-a-meme.html [https://perma.cc/KA9U-JLWU]. retail investors engaged in an active “buy” campaign to dramatically push up the GameStop stock price from $4 a share to a stratospheric level of over $485 per share. GameStop, a gaming merchandise retailer, had been losing money and seemed headed toward bankruptcy.3See generally GameStop Corp., Registration Statement (Form S-3) (Dec. 8, 2020); GameStop Corp., Quarterly Report (Form 10-Q) (June 9, 2020); GameStop Corp., Quarterly Report (Form 10-Q) (Sept. 9, 2020); GameStop Corp., Quarterly Report (Form 10-Q) (Dec. 8, 2020); GameStop Corp., Quarterly Report (Form 10-Q) (June 9, 2021); GameStop Corp., Quarterly Report (Form 10-Q) (Sept. 8, 2021); GameStop Corp., Quarterly Report (Form 10-Q) (Dec. 8, 2021); GameStop Corp., Quarterly Report (Form 10-Q) (June 1, 2022); GameStop Corp., Quarterly Report (Form 10-Q) (Sept. 7, 2022); GameStop Corp., Quarterly Report (Form 10-Q) (Dec. 7, 2022); GameStop Corp., Annual Report (Form 10-K) (Mar. 27, 2020); GameStop Corp., Annual Report (Form 10-K) (Mar. 23, 2021); GameStop Corp., Annual Report (Form 10-K) (Mar. 17, 2022). A number of hedge funds had taken large short positions against the stock, betting that the price would drop even further.4See, e.g., Laurence Fletcher, Hedge Fund That Bet Against GameStop Shuts Down, Fin. Times (June 21, 2021), https://www.ft.com/content/397bdbe9-f257-4ca6-b600-1756804517b6 [https://perma.cc/65FN-HDPB]. Meme investors seem to have been motivated to “punish” the hedge funds by driving up the stock price and creating a “short squeeze” against them.5Tim Hasso, Daniel Müller, Matthias Pelster & Sonja Warkulat, Who Participated in the GameStop Frenzy? Evidence from Brokerage Accounts, Fin. Rsch. Letters, Mar. 2022, at 1, 4. Using a sample of all trades that took place on GameStop with a broker between December 1, 2020, and February 12, 2021, the authors were able to show that many retail investors closed their positions before the price peak and other retail investors even took a short position against GameStop. Id. The evidence that many retail investors had a strong interest in taking a bet against Wall Street suggests that their interests were not merely “financial,” and they were willing to pay a price that is higher than what the firm’s financials (or “fundamentals”) dictated. Given that many meme companies, including GameStop and Bed Bath & Beyond, are performing quite poorly and many retail investors are staying loyal to these companies long after the meme surge, these long-term retail investors are also likely to be motivated by non-financial interests, such as the company’s survival. The end result was a severe loss and a subsequent retreat for the hedge funds.6See, e.g., Toby Mathis, How Much Did Hedge Funds Lose on GameStop?, Infinity Investing (Sept. 27, 2021), https://infinityinvesting.com/gamestop-hedge-fund [https://perma.cc/PT7J-3EPE]. Eventually, Melvin Capital would shut down a little more than a year later. See Reuters, Melvin Capital to Shut After Heavy Losses on Meme Stocks, Market Slump, CNN (May 19, 2022, 12:48 PM), https://www.cnn.com/2022/05/19/investing/melvin-capital-hedge-fund-closes/index.html [https://perma.cc/BJ5H-EXRC]. For a detailed exposition of how the GameStop saga unfolded in January of 2021, see generally Jill E. Fisch, GameStop and the Reemergence of the Retail Investor, 102 B.U. L. Rev. 1799 (2022). Taking advantage of the elevated stock price, GameStop raised large amounts of capital through stock sales,7See, e.g., GameStop, Prospectus Supplement 2 (Apr. 5, 2021) (“We have previously sold an aggregate of 3,500,000 shares of our common stock for aggregate gross proceeds of approximately $556,691,221 pursuant to the Sales Agreement and the prospectus supplement filed by us on April 5, 2021.”). While it is reasonable to expect most meme stock companies to raise capital during moments of meme surges, our EDGAR search of SEC filings shows that only two companies—GameStop and AMC Entertainment—took advantage of meme surges and made offerings. Other meme stock companies may have chosen not to take advantage of meme surges out of the concern that they may be blamed for knowingly selling shares at an inflated price. See, e.g., Matt Levine, Money Stuff: Meme Stocks Will Come with a Warning, Bloomberg (Feb. 9, 2021, 9:03 AM), https://www.bloomberg.com/news/newsletters/2021-02-09/the-sec-wants-reddit-meme-stocks-to-admit-they-re-dangerous-kky96vuo [https://perma.cc/CM8Z-TJXP]. After the capital raising, AMC Entertainment attempted to increase the authorized number of common shares to engage in further equity issuance, but the amendment proposal was resisted by the stockholders and was later dropped. More recently, AMC Entertainment issued AMC Preferred Equity Units (“APEs”), with the same economic rights as common stock, using the board’s authority to issue preferred stock to get around the charter amendment issue. See, e.g., Matt Levine, AMC Has Some Clever APEs, Bloomberg (Feb. 1, 2023, 10:15 AM), https://www.bloomberg.com/opinion/articles/2023-02-01/amc-has-some-clever-apes [https://perma.cc/DP3K-MQUU]. alleviating its dire liquidity condition. Retail shareholders, who have long played second-fiddle to institutional asset managers and pension funds, thus seemed to have vanquished Wall Street hedge funds and resurrected an ailing company destined for bankruptcy.

Over the ensuing months, it became clear that the GameStop saga was but one instance of more widespread meme stock surges. A number of other companies (hereinafter “meme stock companies” or “meme companies”) would experience surges in their stock prices, which, like GameStop, could not be explained by their financial fundamentals. Investing has become a social phenomenon. Furthermore, meme investing has remained a persistent phenomenon: long after the GameStop saga, meme investors continue to target regional bank stocks,8See Gunjan Banerji, Are Regional Banks the New Meme Stocks?, Wall St. J. (May 5, 2023, 6:37 PM), https://www.wsj.com/livecoverage/stock-market-today-dow-jones-05-05-2023/card/are-regional-banks-the-new-meme-stocks–6I9cBRACnUKp9dZk5ivc [https://perma.cc/33QS-YKJH]. special purpose acquisition companies (“SPACs”),9See Chris Bryant, SPAC + Meme Stock = A Dangerous Combination, Bloomberg (Sept. 27, 2022, 2:30 AM), https://www.bloomberg.com/opinion/articles/2022-09-27/what-s-wrong-with-this-spac-picture-gety-stings-warrants [https://perma.cc/5AGY-GR3M]. and even firms that have filed for bankruptcy.10See Angelique Chen & Krystal Hu, Analysis: Meme Stock Investors Bet on Bankrupt Revlon Being the Next Hertz, Reuters (June 27, 2022, 3:06 AM), https://www.reuters.com/markets/us/meme-stock-investors-place-risky-bet-bankrupt-revlon-being-next-hertz-2022-06-27 [https://perma.cc/6MJ9-RRS7]. What this persistence implies is that GameStop’s meme surge was not a one-time event: meme investing and meme surges are here to stay. Furthermore, the meme surges tell us a broader and more generalizable story about the behavior of retail investors and the impact of the change in shareholder base. The potentially transformative effect of retail investors—driven by demographic, societal, and technological factors—has emerged as one of the central debates in corporate finance and corporate governance.

These experiences have motivated scholars, practitioners, and policymakers to ask several important questions. What impact did the influx of retail shareholders have on meme stock companies? More specifically, how did the new retail shareholders affect their governance and performance? Do the meme surges signify broader and more general implications for corporate and financial law and policy?

This Article’s key contribution is to focus on meme stock companies—the firms where retail investors are most likely to have become more powerful—and empirically test how corporate governance and performance has been affected by the dramatic influx of retail investors in their shareholder base. We start by analyzing the background of meme trading. The existing scholarship has almost exclusively associated meme stocks with the surge in social media interest (such as Reddit forums) in these companies starting in 2021.11See, e.g., Sue S. Guan, Meme Investors and Retail Risk, 63 B.C. L. Rev. 2051, 2054 (2022) (defining meme investors as a “subset of retail investors that were involved in stock rallies fueled by social media”). While social media surely played an important role in popularizing these stocks, we trace the origins of meme trading further back, to the pre-pandemic era. Using an event study methodology, we find that meme stocks exhibited abnormal returns (and an abnormal increase in trading volume) in October 2019, when major brokerages abolished commissions for trading.12See infra Figure 1. This suggests that meme stock companies were well positioned to benefit from the subsequent surge in retail investor interest: zero-commission trading laid the groundwork for the subsequent surge. The emergence and the significance of zero-commission trading for the meme stock phenomenon implies more fundamental changes that can happen at other public companies and across the financial markets.

After documenting the impact of zero-commission trading on meme stocks, we proceed to examine the consequences of meme trading for corporate governance and performance at meme stock companies. Specifically, we ask: Did the influx of retail investors create a more engaged shareholder base at meme stock companies and change corporate governance or environmental, social, and governance (“ESG”) activity at these companies? To answer these questions, we begin with corporate law’s paradigmatic framework for shareholder influence in public corporations: voting and shareholder proposals.13See generally Frank H. Easterbrook & Daniel R. Fischel, Voting in Corporate Law, 26 J.L. & Econ. 395 (1983); Marcel Kahan & Edward Rock, The Hanging Chads of Corporate Voting, 96 Geo. L.J. 1227 (2008). Somewhat surprisingly, we find that non-voting—that is, the share of votes that were not cast for or against, or marked as abstentions—significantly increased in the year of the meme surge for meme stock companies, as compared with other, non-meme stock companies.14See infra Figure 3. Even more surprisingly, we find that the fraction of non-votes began to increase in 2019 and continued to increase in 2022, long before and after the meme surge of 2021. By late 2021 and early 2022 the retail investors who remain loyal to the company would presumably care more about the company’s (long-term) performance and governance.15Although we do not have a direct measure on what fraction of the non-votes came from retail shareholders, since non-voting is usually associated with retail investors (as shown in the existing literature), the finding suggests that meme traders were apathetic in their role as stockholders and did not exercise their franchise. The fact that the level of shareholder engagement seems to be getting worse in 2019 and 2022 indicates that the increase in non-vote shares is not driven just by short-term speculators.

Turning to shareholder proposals, we find no evidence that shareholders at meme stock companies are more likely to participate in governance activities by submitting shareholder proposals, either before or after the meme surge.16See infra Table 5. Between 2015 and 2022, only one meme stock company—Bed Bath & Beyond—had any shareholder proposals included in the company’s definitive proxy statements at all (three proposals, all in 2016), but these proposals predate the introduction of zero-commission trading and the influx of retail investors. Within the sample companies, there was also no record of any shareholder proposal being excluded via the Securities and Exchange Commission’s (“SEC”) no-action letter process during the sample period—apart from GameStop, which successfully excluded three shareholder proposals submitted in 2022. The evidence is consistent with retail investors brought in by the meme phenomenon being either uninterested in voting or making proposals, or unable to do so effectively.

Third, we examine whether retail investors might have had an indirect effect on meme stock companies. One of the most visible ways contemporary insurgent shareholders can affect company policy is to alter its orientation toward ESG goals. For example, in 2021, a small hedge fund (named Engine No. 1) waged a stunningly successful campaign to install three of its directors on the Exxon Mobil board to pressure the energy company to reduce its carbon footprint.17See Matt Phillips, Exxon’s Board Defeat Signals the Rise of Social-Good Activists, N.Y. Times (June 9, 2021), https://www.nytimes.com/2021/06/09/business/exxon-mobil-engine-no1-activist.html [https://perma.cc/S3ZK-MC44]. Utilizing the data from the standard MSCI ESG Indexes, we find that meme stock companies actually deteriorated in terms of prosocial performance after the meme surge of 2021.18See infra Table 6. We also look at whether meme stock companies performed better in terms of director independence or board gender diversity—other salient issues in corporate governance—and find no evidence that meme stock companies performed better (or worse) on these metrics after the surge of retail investor interest.19See infra Tables 7, 8. Thus, meme retail investors do not seem to have made their companies’ policies more prosocial or improved the quality of corporate governance. If anything, the ESG result suggests that these firms’ orientation toward social causes may have worsened in recent years.

As a final measure of indirect impact, we look at how the affected companies changed their operations and performance after both the abolition of commissions in 2019 and the meme surge of early 2021. Meme stock companies’ average return on assets (“ROA”), an important metric for profitability, has substantially worsened over the period compared with non-meme stock companies. If meme investors were engaged and pushing management to make value-increasing investments, one might have expected a rise in expenditures on research and development (“R&D”) or capital investments. These expenses could potentially help meme stock companies adjust their business model and business operations so that they can improve their long-term profitability. We instead find that meme stock companies significantly reduced R&D expenses after the influx of retail investors.20See infra Part V. Although we will explain in more detail in Part V, we do want to caution, however, that many of the meme stock companies were suffering from a liquidity crisis, which likely did not help in giving them room to make long-term investments. This result suggests that retail shareholders may not be effective in (directly or indirectly) pressuring management to make productive investments. This contrasts with the findings that an increase in institutional investor ownership is correlated with more innovative activities at firms.21See Brian J. Bushee, The Influence of Institutional Investors on Myopic R&D Investment Behavior, 73 Acct. Rev. 305, 315, 322, 328 (1998) (showing less “myopic” research & development (“R&D”) spending when the share of institutional holdings increases); Philippe Aghion, John Van Reenen & Luigi Zingales, Innovation and Institutional Ownership, 103 Am. Econ. Rev. 277, 278 (2013) (showing how increase in institutional ownership increases more innovative activities at firms, including R&D expenditure); Ian R. Appel, Todd A. Gormley & Donald B. Keim, Passive Investors, Not Passive Owners, 121 J. Fin. Econ. 111, 115, 133 (2016) (making similar findings when institutional ownership increases due to changes in Russell 1000 and 2000 index compositions); see also Ming Dong, David Hirshleifer & Siew Hong Teoh, Misvaluation and Corporate Inventiveness, 56 J. Fin. & Quantitative Analysis 2605, 2628–69 (2021) (documenting an increase in R&D activity, among others, when firms are “overvalued” due to mutual fund inflows).

Viewing these results collectively, we find that there is, so far, little evidence to suggest that corporate governance is being “democratized” in the way that the investing public has been. The organized movement among retail investors seems to be limited to their trading behavior and has not otherwise affected retail shareholders’ engagement with corporations in a noticeable way.22In a sense, retail investors can be seen as mirror images of institutional investors, who are often passive as investors, while remaining active as shareholders. See Appel et al., supra note 21. If anything, the evidence points in the opposite direction. As a large block of retail investors remain passive, paradoxically, this can give institutional shareholders, who are active shareholders,23See id. even more influence. We do not take issue with the three trends identified as potentially presaging a larger role for retail investors: generational shifts in investor attitudes, societal concerns over social and environmental issues, and technological changes making it easier for retail investors to participate in financial markets. Nevertheless, our empirical findings show that corporate governance has not significantly been democratized or changed yet, even at the companies most dramatically affected by the influx of retail shareholders.24As we explain in Section V.B, the prospects for retail shareholder governance may diminish even further, at least in the near future, due to regulatory changes such as the SEC raising thresholds for submitting shareholder proposals under Rule 14a-8. See 17 C.F.R. § 240.14a-8 (2024).

The remainder of this Article is organized as follows: Part I surveys the demographic, societal, and technological changes that have led some commentators and scholars to express high hopes for the impact of meme and other retail investors on corporate governance. Part II explains our data sources and presents summary statistics. Part III examines the origins of meme trading and explains the importance of the 2019 abolition of commissions by online brokerages. Part IV shows that, despite the surge of retail investor interest in meme stock companies, shareholder nonvoting at meme stock companies increased in recent years, and retail investors failed to make much of an impact using the shareholder proposal process. Part V looks at potential indirect effects of shareholder engagements, such as firm ESG performance, board independence, gender diversity, R&D, and capital expenditures. We find that these companies have not become more prosocial recently, and meme firms’ ESG scores have decreased. We also find that these companies decreased both R&D as well as capital expenditures. In Part VI, we explore potential reasons as to why corporate governance has not been democratized notwithstanding the prevailing scholarly predictions. In doing so, we highlight important differences between the activities involved in meme investing versus those involved in meme shareholding. We then conclude and offer some possible directions for a future meme stock research agenda.

I.  RETAIL INVESTORS, RETAIL SHAREHOLDERS, AND MEME TRADERS

When GameStop was experiencing a dramatic meme surge in January 2021, it was easy to dismiss the phenomenon as a transient anomaly that could be explained away by pandemic boredom and stimulus checks.25See Joe Rennison & Stephen Gandel, Meme Stocks Are Back. Here’s Why Wild Trading May Be Here to Stay., N.Y. Times (Aug. 19, 2022), https://www.nytimes.com/2022/08/19/business/meme-stocks-bed-bath-beyond.html [https://perma.cc/7KUN-6ASL]. However, the pandemic has long ended and meme surges continue, albeit sporadically. Experts now believe meme trading is here to stay.26See id. If meme trading has become a fact of life, it begs the question of what we should expect from retail investors participating in meme trading or trading more generally. Given that (coordinated) retail investing is here to stay, what impact will the shifting of the shareholder and investor base away from institutional shareholders and toward retail shareholders have on financial markets and corporations?

There are three principal drivers scholars and commentators have proposed as to how retail investors have become poised to transform financial markets and corporate governance. The first relates to perceived generational shifts in investor preferences. In this story, millennials and Gen Z entering the market as retail investors will seek to create a footprint, based on their social, cultural, and distributional preferences, on corporate policies.27See Fisch, supra note 6, at 1841–42, 1846–47; Sergio Alberto Gramitto Ricci & Christina M. Sautter, Corporate Governance Gaming: The Collective Power of Retail Investors, 22 Nev. L.J. 51, 90–95 (2021) [hereinafter Gramitto Ricci & Sautter, Corporate Governance Gaming]; Sergio Alberto Gramitto Ricci & Christina M. Sautter, The Wireless Investors Movement, U. Chi. Bus. L. Rev.: Online Edition (Jan. 28, 2022), https://businesslawreview.uchicago.edu/online-archive/wireless-investors-movement [https://perma.cc/XZ5J-DMXC] (contending that retail trading “will naturally expand into corporate-governance-based initiatives”). Some analyses of these market participants have concluded that this generation cares deeply about issues beyond profit maximization, and are willing to forsake returns to pursue these interests through the corporation. For instance, Professors Michal Barzuza, Quinn Curtis, and David Webber argue that in order to attract investment from millennials, index funds should push more for various governance and social changes at companies—such as board diversity—which are issues that millennials care about.28See Michal Barzuza, Quinn Curtis & David H. Webber, Shareholder Value(s): Index Fund ESG Activism and the New Millennial Corporate Governance, 93 S. Cal. L. Rev. 1243, 1249–50, 1304, 1309 (2020). Similarly, Professors Sergio Alberto Gramitto Ricci and Christina Sautter observe that millennials have a generationally defined and distinct set of values, and are more likely to prioritize ESG goals over profit.29Gramitto Ricci & Sautter, Corporate Governance Gaming, supra note 27, at 77. In their telling, meme investors will seamlessly transform into engaged shareholders and usher in a new paradigm for corporate governance.30Id. at 78.

Secondly, meme investors could be highly motivated to affect corporate policies because of the societal time period in which the meme surge occurred. Some have argued that decades of profit-centric corporate governance have led to workers, residents of surrounding communities, and the environment all suffering from profit-centric corporate policies.31See Aneil Kovvali, Stark Choices for Corporate Reform, 123 Colum. L. Rev. 693, 693–96 (2023). The shareholders best positioned to change corporate policies—large asset managers such as BlackRock and Vanguard, who own more than a fifth of the average S&P 500 firm32See Matthew Backus, Christopher Conlon & Michael Sinkinson, Common Ownership in America: 1980–2017, 13 Am. Econ. J.: Microecon. 273, 285 (2021). —are constrained in their ability to pressure management to change policies because any such change would hurt the interests of at least some of the hundreds of investment funds they operate.33See John D. Morley, Too Big to Be Activist, 92 S. Cal. L. Rev. 1407, 1407–08, 1454 (2019). In this vein, Professor Jill Fisch has argued that retail investors are a useful antidote to the concentration of market power in large institutional investors, and can help enlist ordinary citizens in the larger project of national economic development.34See Fisch, supra note 6, at 1805. Free from the structural constraints faced by institutional investors, who owe a fiduciary duty to their clients and are likely to be obligated to pursue profit maximization,35See C. Scott Hemphill & Marcel Kahan, The Strategies of Anticompetitive Common Ownership, 129 Yale L.J. 1392, 1437 (2020). retail shareholders are theoretically able to demand that firms adopt prosocial policies even at the expense of profit. Citizen involvement via retail investing could also have the advantage of tempering corporate power, with retail investors able to sway management through their ability to influence close votes.36See Fisch, supra note 6, at 1840. An example of this from the meme surge came from meme investors who wanted to keep AMC theaters open despite the COVID-19 pandemic severely disrupting the firm’s business model.37See Sarah Whitten, AMC’s ‘Apes’ Gave It a Lifeline. Now, Its CEO Wants to Use the Meme Frenzy as a Springboard for Growth, CNBC (June 1, 2021, 3:04 PM), https://www.cnbc.com/2021/06/01/amcs-ceo-wants-to-use-the-meme-frenzy-as-a-springboard-for-growth.html [https://perma.cc/YY9A-V5XY]. By putting their money into the company (through additional capital raising) and keeping the movie theaters running, meme investors arguably offered a lifeline to the thousands of workers employed by the chain.

Finally, there is a technological element to the promise of meme and other retail investing. Since the mid-2010s, Robinhood has offered a game-like and easily accessible mobile app allowing retail investors to participate in the market. Financial economics literature has shown that retail investors overreact to market signals and allow overconfidence to distort portfolio allocation, reducing their financial returns.38See Brad M. Barber, Xing Huang, K. Jeremy Ko & Terrance Odean, Leveraging Overconfidence (Nov. 30, 2020) (unpublished manuscript), https://papers.ssrn.com/sol3/Papers.cfm?abstract_id=3445660 [https://perma.cc/FT5H-FWXS]; Mark Grinblatt & Matti Keloharju, The Investment Behavior and Performance of Various Investor Types: A Study of Finland’s Unique Data Set, 55 J. Fin. Econ. 43, 44, 66 (2000); see also James Fallow Tierney, Investment Games, 72 Duke L.J. 353, 357, 362–85 (2022) (highlighting the game-like nature of retail investing through mobile apps and expressing support for regulatory intervention). Given this research, it is unsurprising that many retail investors decided to engage in stock-picking after gaining uninterrupted access to a flashy mobile investing app. Beyond investing apps, the GameStop saga and the meme stock frenzy of 2021 demonstrated the power of social media technology to coalesce dispersed individuals who can unite to bring about an impact and put checks on the forces of institutional players. Today, social media platforms such as Facebook, Reddit, and X (formerly known as Twitter), provide a space where individuals form communities, share information, and engage in collective action. These platforms have also made it easier for people to spread information quickly, allowing them to mobilize and respond to events in real time. From these perspectives, the meme surges of early 2021 could be seen as foreshadowing a future in which technology can further enable and empower dispersed individuals to overcome the cost of collective action and promote a collectively cobbled together agenda.

Collectively, the demographic, societal, and technological trends could be seen as ushering in an amplified role for retail investors. Consistent with this intuition, a study by Professors Alon Brav, Matthew Cain, and Jonathon Zytnick empirically assesses the collective voting heft of retail investors using a large proprietary sample of shareholder ownership and voting records.39See generally Alon Brav, Matthew Cain & Jonathon Zytnick, Retail Shareholder Participation in the Proxy Process: Monitoring, Engagement, and Voting, 144 J. Fin. Econ. 492 (2022). They conclude that retail investor voting can have as much of an impact on corporate outcomes as the voting preferences of the three largest institutional investors.40Id. at 504. This suggests that, to the extent meme stock surges can motivate greater retail shareholder participation, there is a realistic possibility of significant changes in corporate governance. For these reasons, more than three years after the GameStop saga, this Article seeks to empirically examine what changes, if any, have taken place in the governance of the companies subject to meme surges.

II.  DATA AND SUMMARY STATISTICS

We use a variety of sources to collect information about both meme and non-meme stocks. Our first step is to identify which companies qualify as meme stock companies in the relevant period. We use Factiva41Factiva, owned by Dow Jones & Company, is a business research tool. It aggregates content from both free and licensed sources and provides access to over 32,000 newspapers, journals, magazines, and so forth. See Factiva, Dow Jones, https://www.dowjones.com/professional/factiva [https://perma.cc/FGU4-SC3F]. searches and Internet queries with appropriate keywords (“meme,” “retail investors,” and “Reddit” in conjunction with “stock” and so on), modeling our approach on the nascent financial economics literature studying the meme trading phenomenon.42See Michele Costola, Matteo Iacopini & Carlo R.M.A. Santagiustina, On the “Mementum” of Meme Stocks, Econ. Letters, Oct. 2021, at 1, 2. The authors show how certain “meme stocks,” GameStop, AMC, Koss, Moody’s, Pfizer, and Disney, exhibited dynamics of price, trading volume, and social media activity, as measured by the number of tweets. Id. We identify the following eight companies as meme stock companies: GameStop,43See Yun Li, The $300 Billion Meme Stock That Makes GameStop Look Like Child’s Play, CNBC (Aug. 3, 2022, 8:35 AM), https://www.cnbc.com/2022/08/03/the-300-billion-meme-stock-that-makes-gamestop-look-like-childs-play.html [https://perma.cc/F2BH-NZD9]. AMC Entertainment, Inc.,44See Paul R. La Monica, Meme Stock Mania May Finally Be Over, CNN (Dec. 6, 2022, 12:43 PM), https://www.cnn.com/2022/12/06/investing/meme-stocks-gamestop-amc/index.html [https://perma.cc/5UX8-CAC4]. Bed Bath & Beyond,45See id. Blackberry,46See Bernard Zambonin, BlackBerry (BB): Why Jim Cramer Is Warning Investors to Avoid This Stock, TheStreet (Oct. 12, 2022, 6:57 AM), https://www.thestreet.com/memestocks/other-memes/blackberry-bb-why-jim-cramer-is-warning-investors-to-avoid-this-stock [https://perma.cc/R8J9-GUAM]. Express, Inc.,47See WYCO Researcher, Express, Inc.: A Former Meme Stock Could Be Headed into Serious Trouble in a Recession, Seeking Alpha (July 19, 2022, 10:06 AM), https://seekingalpha.com/article/4524180-express-inc-a-former-meme-stock-could-be-headed-into-serious-trouble-in-a-recession [https://perma.cc/RE67-R8ET]. Koss,48See Samuel O’Brient, Why Is Meme Favorite KOSS Stock Soaring 40% Today?, Inv. Place (July 25, 2022, 2:04 PM), https://investorplace.com/2022/07/why-is-meme-favorite-koss-stock-soaring-40-today [https://perma.cc/SHF4-GDV4]. Robinhood,49See Maggie Fitzgerald, Robinhood Is Not a Meme Stock and Doesn’t Plan to Sell Shares to Raise Funds, CFO Says, CNBC (Aug. 19, 2021, 8:49 AM), https://www.cnbc.com/2021/08/19/robinhood-is-not-a-meme-stock-and-doesnt-plan-to-sell-shares-to-raise-funds-cfo-says.html [https://perma.cc/VK7L-KBHQ]. and Vinco.50See Clark Schultz, Vinco Ventures Skyrockets on Big Day for Meme Stocks, Seeking Alpha (Aug. 16, 2022, 1:47 PM), https://seekingalpha.com/news/3873788-vinco-ventures-skyrockets-on-big-day-for-meme-stocks [https://perma.cc/FR3A-HELJ]. For meme and non-meme stocks (i.e., other public companies), we collect an array of financial and non-financial information for the time period of 2015 to 2022. First, stock price information comes from the Center for Research in Stock Prices (“CRSP”). Firm financial data (size as proxied by assets in millions of dollars, performance (return on assets), debt ratio, cash ratio, closing stock price at the end of the fiscal year, and market value in millions), R&D, and capital expenditures are collected from Compustat. Finally, we get data on shareholder voting from Institutional Shareholder Services (“ISS”) (formerly known as Riskmetrics).

Table 1 presents the summary statistics for our dataset. Vote figures are organized at the shareholder proposal level and are matched to the firm-year level observations for financials from Compustat.51Although we have also collected institutional ownership data based on 13F filings from Thomson Reuters to indirectly back out the fraction of retail ownership, because the 13F reporting is done on a quarterly basis and there was a large turnover at the meme stock companies during the “meme surge,” the data turned out to be unreliable. For instance, when the institutional ownerships were aggregated, for some companies, the fractions exceeded one. Panel A displays the overall descriptive statistics (in millions for financial measures), while Panel B compares the relevant statistics between meme and non-meme companies, along with t-test results. The first statistic in Panel B, Percent Non-Votes, measures the extent of shareholder non-participation in direct governance. Following the accounting literature, we define shareholder non-participation as the percentage of outstanding shares that were not voted “for,” “against,” or “abstention” with respect to proposals at a meeting. Between 2015 and 2022, the average yearly non-participation rate for meme stocks was 28.75%. This is higher than the 25.04% average for non-meme firms and the difference is significant at the 1% level.

The next four statistics in Panel B, Return on Assets, Cash Ratio, Debt Ratio, and the natural logarithm of assets, present a picture of their respective financial status and performance. When we compare the respective returns on assets, we see that the mean return on assets for meme stocks over the entire period is –0.098, which is statistically significantly (at the 1% level) lower than –0.05 for the non-meme companies. In addition, while Cash Ratio and Ln(Assets) are not statistically significantly different, the meme companies have a statistically significantly higher debt ratio (at 32%) compared with non-meme companies (at 28%).

Table 1.  Summary Statistics

Panel A

 

N

Mean

SD

Percent Votes for Proposal

289422

70.53

19.16

Percent Votes Against Proposal

289422

4.34

8.6

Percent Non-Votes

289422

24.84

17.26

Ln(Assets)

282189

7.8

2.31

Cash Ratio

276587

0.12

0.18

Debt Ratio

224586

0.28

0.24

Return on Assets

282061

 –0.04

0.25

R&D Expense

180296

294.56

2123.57

Capital Expenditures

228832

449.6

2117.09

Closing Price

288369

56.14

141.04

Market Value

259924

15314.18

74007.27

 

Panel B

 

 

 

 

Non-Meme

Firms

(1)

Meme

Firms

(2)

t-statistic

(1)–(2)

Percent Non-Votes

25.04

28.75

–3.94***

Return on Assets

–0.05

–0.098

3.17***

Cash Ratio

0.14

0.13

1.3

Debt Ratio

0.28

0.32

–3.05***

Ln(Assets)

7.56

7.37

1.54

Closing Price

56.18

28.54

3.72***

Market Value

15331.22

3023.8

3.15**

Note: Panel A presents information on the shareholder voting results and financial variables for the meme stocks identified in Part II, for the period of 2015–22. All financial variables are winsorized at the 1% level. Panel B presents t-tests for some of these variables between meme and non-meme stocks. The ***, **, and * denote significance at the 1%, 5%, and 10% levels.

Finally, the last two statistics in Panel B, Closing Price and Market Value (in millions), show some of the characteristics of the respective stock. Perhaps not surprisingly, meme companies, on average, had lower stock prices and lower market capitalization than non-meme companies: the average stock price of meme companies is about half of non-meme companies and the average market capitalization of meme companies (a little over $3 billion), one-fifth of that of non-meme companies. In sum, the descriptive statistics indicate that meme companies are on average less profitable (or unprofitable), more heavily leveraged, and have lower stock prices and market capitalizations, consistent with media reports.52See James Mackintosh, AMC’s Meme-Stock Traders Mess with Corporate Theory, Wall St. J. (Jun. 8, 2021, 8:00 AM), https://www.wsj.com/articles/amcs-meme-stock-traders-mess-with-corporate-theory-11623107259 [https://perma.cc/99YP-6JRQ].

III.  THE TWIN SHOCKS TO MEME STOCKS

In the popular imagination, social media usage during the coronavirus pandemic has been singled out as the main driver of the emergence of meme stocks. The New York Times has characterized meme stock investments as being “propelled by a social media frenzy and a bit of boredom” during the pandemic.53See Erin Griffith, No End to Whiplash in Meme Stocks, Crypto and More, N.Y. Times (June 23, 2021), https://www.nytimes.com/2021/06/23/technology/no-end-to-whiplash-in-meme-stocks-crypto-and-more.html [https://web.archive.org/web/20210623090425/https://www.nytimes.com/2021/06/23/technology/no-end-to-whiplash-in-meme-stocks-crypto-and-more.html]. The Wikipedia entry for “meme stock” defines it as “a stock that gains popularity among retail investors through social media.”54See Meme Stock, Wikipedia, https://en.wikipedia.org/wiki/Meme_stock [https://perma.cc/YSD4-EWCA]. However, for our set of meme stocks, we identify an association with retail investors that (1) predates the pandemic and (2) does not relate to social media platforms, such as Reddit or X (formerly known as Twitter). More specifically, we look at the meme stocks’ response to the abolition of commissions by major brokerage platforms in late 2019.

On October 1, 2019, the major online brokerages Charles Schwab and TD Ameritrade eliminated commissions for all their customers. These platforms, which had dominated the online brokerage business, were responding to stiff competition from a new rival, Robinhood, which had heavily utilized the zero-commission trading model.55E-Trade, the other major online brokerage, abolished commissions the next day. See Paul R. La Monica, E-Trade Cuts Commissions to Zero Along with Rest of Brokerage Industry, CNN (Oct. 3, 2019, 6:26 AM), https://www.cnn.com/2019/10/02/investing/etrade-zero-commissions/index.html [https://perma.cc/V6P8-BPMM]. Experts termed this move “inevitable” after Charles Schwab and TD Ameritrade’s decision on October 1. See id.; see also Past CFO Commentary, Charles Schwab (Oct. 1, 2019), http://www.aboutschwab.com/cfo-commentary/oct-2019 [https://perma.cc/R46S-X7LP]. Share prices of Charles Schwab, TD Ameritrade, and E-Trade experienced a significant loss in response to Charles Schwab’s zero commission announcement. See Lisa Beilfuss & Alexander Osipovich, The Race to Zero Commissions, Wall St. J. (Oct. 5, 2019, 5:30 AM), http://www.wsj.com/articles/the-race-to-zero-commissions-11570267802 [https://perma.cc/YX65-UE9Z]. The advent of zero-commission trading has been widely acknowledged as a root cause of the explosion in retail investing activity. One of the leading explanations for why individuals do not participate in the stock market is that there is a fixed cost of investing that proves potentially insurmountable for the less wealthy.56See generally Joseph S. Briggs, David Cesarini, Erik Lindqvist & Robert Östling, Windfall Gains and Stock Market Participation, 139 J. Fin. Econ. 57 (2021) (showing that winning a $150,000 lottery increases stock market participation among recipients who previously did not own stocks); Annette Vissing-Jorgensen, Towards an Explanation of Household Portfolio Choice Heterogeneity: Nonfinancial Income and Participation Cost Structures (Nat’l Bureau of Econ. Rsch., Working Paper No. 8884, 2002), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=307121 [https://perma.cc/45VK-YHB3] (estimating that stock market non-participation for half of those not owning stocks can be explained by a small participation cost). It is unsurprising that, by reducing the entry cost of trading (i.e., commissions), the 2019 decision by major brokerages increased retail investor activity.57See Maggie Fitzgerald, Retail Investors Continue to Jump into the Stock Market After GameStop Mania, CNBC (Mar. 10, 2021, 1:59 PM), https://www.cnbc.com/2021/03/10/retail-investor-ranks-in-the-stock-market-continue-to-surge.html [https://perma.cc/QPG6-3FKF] (“[r]etail trading has been accelerating since the industrywide decision to drop commissions in the fall of 2019”).

How did the abolition of trading commissions affect meme stocks? The relatively unexpected and sudden decision of the major brokerage platforms to introduce commission-free trading allows us to use the event study methodology to assess its impact. Given that this was prior to the meme stock surge of 2021, to the extent that the market was informationally “efficient,” the stock prices around October 1, 2019, would reasonably reflect the impact of the abolition of commissions on meme stocks.58For a review of event study methodology, see generally Sanjai Bhagat & Roberta Romano, Event Studies and the Law: Part I: Technique and Corporate Litigation, 4 Am. L. & Econ. Rev. 141 (2002); Sanjai Bhagat & Roberta Romano, Event Studies and the Law: Part II: Empirical Studies of Corporate Law, 4 Am. L. & Econ. Rev. 380 (2002); A. Craig MacKinlay, Event Studies in Economics and Finance, 35 J. Econ. Literature 13 (1997). First, we identify what the expected return for each stock would have been during the event period if the event had not occurred (that is, if the commissions had not been dropped). Using the standard Fama-French three-factor model,59See S.P. Kothari & Jerold B. Warner, Econometrics of Event Studies, in 1 Handbook of Corporate Finance 4, 25 (B. Espen Eckbo ed., 2008). this may be written as:

Here, Rit is the return on stock i on date t minus the risk free rate;  Rmt is the market return on date t minus the risk free rate; RSMB is the return on a portfolio of small companies; and RHML is the book to market factor that is the portfolio of firms with a high book value to market value ratio. The abnormal return that can be traced to the event (that is, the associated stock price movement) is the actual return minus the expected return:

We calculate the abnormal returns on October 1, 2019, for all companies in the Compustat database, and regress them against an indicator for whether the company is one of our eight meme stocks. Table 2 presents the results from the event study. Column (1) shows that meme stocks had abnormal returns that were 2.25 percentage points higher than the market, and the coefficient on the indicator variable is highly statistically significant.60In follow-up research, we show that the introduction of zero-commission trading was associated with positive and statistically significant abnormal returns for a broader array of stocks popular with retail investors, beyond meme companies. See generally Dhruv Aggarwal, Albert H. Choi & Yoon-Ho Alex Lee, Retail Investors and Corporate Governance: Evidence from Zero-Commission Trading (Northwestern L. & Econ. Rsch. Paper, No. 24-01, Aug. 29, 2024), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4708496 [https://perma.cc/WJ4F-3Z5E]. Column (2) reruns the regression in model (1) adding controls for firm financials (size as proxied by the natural logarithm of assets, cash ratio, debt ratio, and return on assets) and the results remain largely unchanged. One concern with our results could be that meme stocks are categorically different from non-meme companies. As a final robustness check, we control for the possibility that the financials of meme and non-meme companies may be different. Using the entropy-balancing technique invented in the social science literature,61See generally Jens Hainmueller, Entropy Balancing for Causal Effects: A Multivariate Reweighting Method to Produce Balanced Samples in Observational Studies, 20 Pol. Analysis 25 (2012). we balance the means of the covariates for meme and non-meme companies. As shown in column (3), our result for meme stocks remains robust to the entropy-balancing method.

Table 2.  Event Study Results

 

(1)

(2)

(3)

 

Baseline

With Financials

Entropy Balanced

Meme Stock

2.252***

2.238***

2.230***

 

(0.645)

(0.655)

(0.614)

Constant

–0.127***

–0.269

0.300

 

(0.0269)

(0.246)

(0.768)

Observations

7,110

3,531

3,531

R-squared

0.001

0.010

0.208

Firm Financials

No

Yes

Yes

Note: This table presents results from an event study using the Fama-French three-factor model. The dependent variable in this linear regression model is the abnormal stock return on October 1, 2019. Columns (1) and (2) use ordinary least squares regression and column (3) balances covariates for meme and non-meme companies using the entropy-balancing technique. Columns (2) and (3) add controls for firm size (proxied by the natural logarithm of assets), cash ratio, debt ratio, and return on assets. All financial variables are winsorized at the 1% level. The ***, **, and * denote significance at the 1%, 5%, and 10% levels.

Figure 1 graphs the mean cumulative abnormal returns of meme stocks. The mean abnormal returns are represented by the solid line in the middle, while the dotted lines enclose the 95% confidence interval. Day “0” in this figure refers to October 1, 2019. The figure shows an economically and statistically significant gain for meme stocks around the time the major brokerages dropped trading commissions. In unreported results, we find that meme stocks had significant abnormal returns on October 1, 2019, when we use alternative asset pricing models such as the capital asset pricing model (“CAPM”) or Carhart four-factor model.62These asset pricing models are described in detail in MacKinlay, supra note 58, at 19, and Kothari & Warner, supra note 59, at 25–26.

Figure 1.  Cumulative Abnormal Returns for Meme Stocks

Note: This figure graphs the mean cumulative abnormal returns for meme stocks around October 1, 2019, when major brokerages abolished trading commissions (denoted as day 0). The dotted lines represent the 95% confidence interval for cumulative abnormal returns.

In addition to the abnormal returns, we also examine the magnitude of share turnover. Figure 2 presents data on the share turnover for meme stocks and other companies between 2015 and 2022. We define turnover as the daily average of the number of stocks of the firm traded as a percentage of total outstanding common stock, using data from CRSP. Since meme stocks are, on average, smaller firms, we subdivide non-meme companies into those belonging to the smallest quartile in terms of market capitalization and other bigger firms. Meme stocks saw both an increase in trading volume after the abolition of commissions on October 1, 2019, and a further increase in 2021–22 after the explosion of social media interest in these firms. There was a significant increase compared to both smallest-quartile and larger non-meme firms. Three points are notable. First, meme stocks had a higher turnover compared with non-meme stocks even in the first period, that is, before the abolition of commissions. Second, both meme and non-meme stocks saw an increase in trading volumes after the abolition of commissions, although the increase was markedly greater for meme companies. Third, during the time of meme surge, while the meme stock trading volume exploded, there seems to be no noticeable increase in trading volume for non-meme stocks.

Figure 2.  Average Turnover for Meme Stocks and Other Firms

Note: This figure graphs the mean share turnover (shares traded each day as a percentage of total outstanding common stock) according to CRSP data. The data is presented separately for meme and non-meme stocks. Non-meme stocks are further subdivided into those that belong to the smallest quartile by market capitalization and larger firms. “Pre-Zero Commission” refers to the period from 2015 to September 2019, “Post-Zero Commission” to October 1, 2019, to December 31, 2020, and “Post-Meme Surge” to 2021–22.

We estimate a regression model in which we analyze the factors affecting the average daily turnover for CRSP companies in all three periods. We include as explanatory variables an indicator for meme stock, two dummies for Post-Zero Commission and Post-Meme Surge, and the interaction of the meme indicator with each time dummy. We include firm fixed effects to make sure the results are not driven by idiosyncratic factors unique to any given company. Both interaction terms are positive and statistically significant. The results are presented in Table 3. Note that, even controlling for firm fixed effects and time trends, meme companies seem to have especially gained with respect to this measure of liquidity in the latter time periods. The results remain qualitatively unchanged when we additionally control for firm market value. The event study results presented in this Section show that meme stock companies gained value around the time major brokerages abolished commissions. The influx of retail investors precipitated by zero commissions could therefore have been particularly impactful for the meme stocks. Moreover, the results on turnover indicate that meme firms saw greater trading volumes after the major brokerages eliminated commissions.

Table 3.  Meme Stocks and Trading Volume

Post-Zero Commission

0.664***

 

(0.0714)

Post-Zero Commission x Meme

3.252*

 

(1.874)

Post-Meme Surge

0.517***

 

(0.0822)

Post-Meme Surge x Meme

12.23***

 

(3.634)

Constant

1.080***

 

(0.0402)

 

 

Observations

20,764

R-squared

0.875

Firm Fixed Effects

Yes

Note: This table presents the results of a linear regression model in which the dependent variable is the daily percentage of outstanding shares that are traded. “Post-Zero Commission” refers to the time period of October 1, 2019, through December 31, 2020, and “Post-Meme Surge” to 2021–22. The regression model includes firm fixed effects, and all standard errors are clustered at the firm level. The ***, **, and * denote significance at the 1%, 5%, and 10% levels.

IV.  DIRECT SHAREHOLDER ENGAGEMENT AT MEME STOCK COMPANIES

In this Part, we explore the effect of meme stock investing on the direct mechanisms for shareholder engagement: voting and submitting shareholder proposals. This can help us empirically assess claims that the influx of retail investors would affect corporate governance and possibly empower shareholders to engage with management more actively. To briefly summarize the findings, our empirical results show that predictions of retail investor-driven changes in corporate governance may be overstated. First, the level of shareholder voting at meme companies decreased after the abolition of commissions by online brokerages and decreased still further in the aftermath of 2021 meme surge. Second, we find no evidence of active shareholder engagement by way of submitting shareholder proposals at the companies in our sample, except in limited circumstances unrelated to corporate governance.

A.  Non-Voting at Meme Stock Companies

An important claim in the literature is that the retail shareholders brought in after the meme phenomenon may be more likely to be assertive and more vigorously engage with management.63See supra Part I. This is a plausible claim: if retail investors could coordinate their trades to attack institutional investors—a feat previously unimaginable—so, too, can they coordinate votes to have their voices heard. Accordingly, one could expect more retail shareholders to vote on governance proposals, including director elections and other consequential decisions (such as mergers and acquisitions and charter amendments), at these firms after 2021. Ideally, if we can observe each shareholder’s characteristics (for example, institutional versus retail), how many shares are owned by each shareholder, and how many of those shares are voted on, we will be able to tell exactly what the rate of participation among retail shareholders is. Nevertheless, due largely to the limitations on data, we do not have access to any information on whether certain votes came from a retail versus an institutional shareholder.64Some scholars have been successful in accessing data owned by proxy service firms, such as Broadridge, and have been able to estimate retail shareholder participation much more accurately. See generally, e.g., Brav et al., supra note 39.

Instead, we rely on an indirect measure in estimating shareholder participation that is commonly used in the accounting literature. One way of such an indirect estimation is by measuring aggregate non-votes at shareholder meetings. The accounting scholarship attributes non-votes in shareholder meetings (that is, votes that were not cast for or against a proposal and were not abstentions) to retail investors.65See Kobi Kastiel & Yaron Nili, In Search of the “Absent” Shareholders: A New Solution to Retail Investors’ Apathy, 41 Del. J. Corp. L. 55, 62–64 (2016). Corporate insiders and institutional investors, on the other hand, are much more diligent in registering their votes. Under this standard assumption, if retail investors became more engaged after 2021, we could expect the overall share of non-votes to fall.66See Rachel Geoffroy, Electronic Proxy Statement Dissemination and Shareholder Monitoring 12 (Nov. 30, 2018) (unpublished manuscript), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3264846 [https://perma.cc/83KF-FSCY]. The author examines the changes from postal mail to electronic distribution of proxies and shows how electronic distribution of proxies actually reduced shareholder participation by about 1% to 2%. Id. at 4. With the assumption that the non-participation comes from retail investors, this implies that retail investor participation decreased by about 7% to 17%. Id.

In analyzing the rate of non-votes, it is also important to account for the type of proposal. Shareholder proposals at U.S. public companies are generally of two types: routine and non-routine. Routine proposals are those that pertain to the company’s day-to-day operations but are not expected to significantly affect the company’s overall operation and performance. Examples of this type are proposals for the ratification of auditors or approving stock splits. By contrast, non-routine proposals typically relate to the company’s long-term strategy or are expected to have a significant impact on the company’s financial performance. Examples include the issuance of new stock, election of directors, a merger with another company, divesting a business unit, or any other proposal stockholders could have concerns with and would affect their ownership. For our purposes, there is an important distinction between these two types: brokers can vote shares on behalf of the beneficial owners for routine matters, but not for non-routine matters. Therefore, only shareholders can vote their own shares for non-routine proposals.67Id. at 4.

Figure 3 graphically presents the yearly average of non-vote rates on proposals at both meme and non-meme companies between 2015 and 2022. We hand-coded each proposal listed in the Institutional Shareholder Services (“ISS”) data as either “routine” or “non-routine” based on Rule 452 of the New York Stock Exchange (“NYSE”).68See N.Y. Stock Exch., Rule 452 (2003), https://nyseguide.srorules.com/rules/negg0109013e2c855b2572 [https://perma.cc/EYQ6-NSWH]. As expected, we find that non-routine proposals (in which brokers cannot vote on behalf of shareholders) have consistently higher levels of non-participation for both meme and non-meme firms.

More importantly, we find an increase in shareholder non-voting rate after 2018, concentrated in meme companies (for both routine and non-routine proposals). In fact, before 2019, meme companies had lower non-vote rates compared with non-meme companies, but by 2022, non-vote rates are at above 50% and 30% on non-routine and routine matters, respectively, at meme companies. At the same time, at non-meme companies, as Figure 3 shows, there seems to have been only marginal changes in non-vote shares over the same period. This is the opposite trend from what one would expect if the retail shareholders were more engaged with respect to corporate governance at meme firms, such as AMC and GameStop. Instead of seeing a burst of shareholder engagement, meme companies have seen increasing retail shareholder apathy in recent years.

Figure 3.  Average Share of Non-Votes for Meme and Non-Meme Stocks over Time, by Proposal Type

Note: This figure presents information on the yearly average percentage of votes that were not voted in shareholder meetings. We define the number of non-votes as Total Outstanding Shares minus (Votes For + Votes Against + Abstentions). We split the data by meme/non-meme stock as well as proposal type (that is, whether it qualifies as “routine” as defined in NYSE Rule 452).

If we were to expect that retail shareholders are less likely to participate in direct governance, this finding, on the one hand, may not be too surprising. Recall, however, that many of these retail investors were the drivers of coordinated meme surges in early 2021, collectively taking a stance against institutional investors. There is also reason to believe that many of them have remained loyal to the firm.69See, e.g., Caitlin McCabe, GameStop’s Most Loyal Shareholders Are in It for the Long Haul, Not the Memes, Wall St. J. (June 6, 2021, 5:30 AM), https://www.wsj.com/articles/gamestops-most-loyal-shareholders-are-in-it-for-the-long-haul-not-the-memes-11622971801 [https://perma.cc/AAV4-U97N]; see also Caitlin McCabe, Karen Langley, Gunjan Banerji, Hardika Singh & Gregory Zuckerman, Where Six Meme Stock Investors Are Now, Wall St. J. (Jan. 28, 2022, 5:30 AM), https://www.wsj.com/articles/where-six-meme-stock-investors-are-now-11643365810 [https://perma.cc/VMA6-589V]. If the fraction of retail investors at meme stock companies remains relatively high through 2021 and 2022, and many of them care more about the companies’ survival and performance, one would expect them to be more active in firm governance. From this perspective, the fact that the share of no-votes keeps increasing through 2021 and 2022, long after the initial “meme surge” was over, is surprising.

Table 4 presents a more formal regression analysis (using linear regression models), in which the dependent variable is the percentage of non-votes at a shareholder proposal level. We collected this data for all companies from the ISS database (from 2015 through 2022) to make sure we captured any secular time trends in shareholder voting across the market. Column (1) presents the baseline model, while column (2) adds financial variables as controls. We included firm fixed effects to account for any idiosyncratic factors unique to each company. Note, foremost, that the coefficient estimates (except for the estimate on the variable “Meme x 2019–20”), along with their statistical significance, are fairly consistent across the two models, indicating that the specifications are fairly robust. In terms of the results, at the top of the table, the dummy for non-routine proposals is positive (with the point estimates of 14.04 and 13.98, respectively) and highly statistically significant (at the 1% level), indicating that these types of matters generally have greater non-participation than routine proposals (per stock exchange regulations): non-vote shares on non-routine matters are about 14 percentage points higher compared with those on routine matters.

The coefficient estimates on 2019–20 and 2021–22 indicator variables are also positive and statistically significant, indicating that there is a general trend toward non-votes across all companies. When we interact both time period dummies with the Meme indicator, the coefficient for these terms is positive and highly statistically significant, at least in the baseline model, indicating that there seems to be more non-voting at meme companies after both the abolition of commissions and the surge in social media interest in these companies.70Controlling for firm financials in column (2), the interaction between Meme and 2019–20 is no longer significant. The rise in non-voting for meme stocks seems concentrated in non-routine proposals, as one would expect since brokers cannot vote on behalf of the shareholders on these issues. Most tellingly, the triple interaction of Meme, each time period dummy, and Non-Routine is also positive (with coefficient estimates ranging from about 7.5 to 8.1) and highly statistically significant (at the 1% level) in both the baseline model and with financial controls.

Table 4.  Meme Stocks and Non-Voting

 

(1)

(2)

 

Baseline

With Financials

Non-Routine

14.04***

13.98***

 

(0.198)

(0.223)

Meme x Non-Routine

–5.607***

–5.585***

 

(1.495)

(1.534)

2019–20

0.681***

0.830***

 

(0.140)

(0.170)

Meme x 2019–20

5.204***

2.650

 

(1.780)

(1.760)

Non-Routine x 2019–20

–0.630***

–0.688***

 

(0.149)

(0.162)

Meme x Non-Routine x 2019–20

7.910***

8.125***

 

(1.406)

(1.425)

2021–22

3.899***

4.622***

 

(0.195)

(0.232)

Meme x 2021–22

13.91***

11.59**

 

(4.841)

(4.672)

Non-Routine x 2021–22

–3.297***

–3.531***

 

(0.178)

(0.195)

Meme x Non-Routine x 2021–22

7.503***

7.901***

 

(1.725)

(1.942)

Constant

12.29***

23.68***

 

(0.179)

(2.087)

 

 

 

Observations

238,506

194,929

R-squared

0.699

0.735

Firm Fixed Effects

Yes

Yes

Firm Financials

No

Yes

Note: This table presents the results of a linear regression model in which the dependent variable is the percentage of shares that were not voted for a proposal at shareholder meetings. We define the number of non-votes as Total Outstanding Shares – (Votes For + Votes Against + Abstentions). 2019–20 equals 1 for years 2019 and 2020, while 2021–22 equals 1 for 2021 and 2022. We split the data by proposal type (that is, whether or not it qualifies as “routine” as defined in NYSE Rule 452). Column (2) adds controls for firm assets, cash ratio, debt ratio, and return on assets. Columns (1) and (2) include year and firm fixed effects, and all standard errors are clustered at the firm level. The ***, **, and * denote significance at the 1%, 5%, and 10% levels.

The estimates tell us that, compared with routine matters at meme companies at these two time periods, the share of non-votes on non-routine matters are about 7.5 to 8 percentage points higher. The results indicate that, for 2019–22, meme companies saw a greater rise in non-voting among shareholders as compared with non-meme companies, and this effect was especially pronounced for non-routine proposals for which brokers could not vote on behalf of shareholders.

B.  Shareholder Proposals at Meme Stock Companies

As another measure of shareholder engagement, we looked at the number (and the content) of shareholder proposals that were submitted by retail shareholders at meme stock companies. For example, it is possible that even if the level of retail shareholder voting at meme companies has remained low (or decreased), the meme surge may have emboldened a minority of retail shareholders to take more active steps in submitting shareholder proposals to affect corporate governance and corporate policies. While there are other channels of influencing corporate governance—such as running a proxy contest or nominating a director candidate through proxy access (if the company allows it)—these other channels require significant economic resources (in the case of proxy contests) or more substantial ownership thresholds and holding periods (in the case of accessing proxy ballots directly). As such, these are less salient means for meme traders. For this reason, the more promising route for meme traders is likely through submission of a shareholder proposal.

First, we discuss some institutional background and a potential complication for our empirical analysis. The eligibility requirement for a shareholder to submit a shareholder proposal is governed by Rule 14a-8,71See SEC Shareholder Proposals Rule, 17 C.F.R. § 240.14a-8 (2024). which imposes an ownership threshold and a holding period requirement. Once a proposal is submitted by an eligible shareholder, the SEC rule requires the company to add the proposal to the agenda for voting at the next annual shareholders’ meeting, unless the SEC provides special permission to exclude it from consideration.72See id. Since 1998, Rule 14a-8 has maintained a relatively low share ownership threshold: it required only that a shareholder had held at least $2,000 or 1% of a company’s securities for at least one year.73See 17 C.F.R. § 240.14a-8 (1998); 17 C.F.R. § 240.14a-8 (2007); 17 C.F.R. § 240.14a-8 (2011). The SEC, however, in 2020 replaced the $2,000 threshold with three alternative thresholds and adjusted the corresponding holding periods. Specifically, (1) if a shareholder owns more than or equal to $25,000, then he may submit a proposal if he has held the shares for at least one year; (2) if a shareholder owns less than $25,000 but more than or equal to $15,000, he must have owned company shares for at least two years; and (3) if a shareholder owns less than $15,000 but more than or equal to $2,000, he must have been a stockholder for at least three years.74See 17 C.F.R. § 240.14a-8 (2020). The rule was proposed on November 5, 2019,75Procedural Requirements and Resubmission Thresholds Under Exchange Act Rule 14a-8, Exchange Act Release No. 34-87458, 84 Fed. Reg. 66458 (Dec. 4, 2019). adopted on September 23, 2020, and went into effect on January 4, 2021.76Procedural Requirements and Resubmission Thresholds Under Exchange Act Rule 14a-8, Exchange Act Release No. 34-89964, 85 Fed. Reg. 70240 (Nov. 4, 2020). However, the SEC noted that the changed thresholds would only affect proposals submitted for annual meetings that take place after January 1, 2022.77Press Release, Secs. & Exch. Comm’n, SEC Adopts Amendments to Modernize Shareholder Proposal Rule (Sept. 23, 2020), https://www.sec.gov/news/press-release/2020-220 [https://perma.cc/VMA6-589V] (“[T]he final amendments will apply to any proposal submitted for an annual or special meeting to be held on or after January 1, 2022.”).

The SEC’s revised thresholds are more difficult to meet, and this was indeed the Agency’s intention. The previous requirement of $2,000 and a one-year holding period is arguably a more achievable threshold for meme traders. The revised thresholds and the corresponding holding periods are much less likely to be met by meme traders—especially the segment of retail investors that began participating in the stock market only after the introduction of commission-free trading platforms. For this reason, we can reasonably expect little activity from meme traders by way of shareholder proposals for annual meetings taking place after January 1, 2022.

As a threshold inquiry, we first examined whether investors reacted to the SEC’s decision to change the Rule 14a-8 thresholds. There were no changes in the thresholds between the SEC’s rule proposal (November 5, 2019) and rule adoption (September 23, 2020). We examined both event dates—the rule proposal date as setting the market’s expectation and the rule adoption date as finalizing the proposal through adoption. If meme traders were particularly committed to influencing corporate governance, these events may correlate with negative stock market reactions. In unreported results, we found no significant market reactions for meme stock companies for either event. We interpreted this finding to be consistent with the idea that meme traders were never particularly interested in participating in corporate governance.

We followed through by reviewing the meme stock companies’ definitive proxy statements filed with the SEC’s EDGAR system from 2015 through 2022 to see whether they included shareholder proposals. These proxy statements typically indicate whether a particular proposal is submitted by a shareholder. Even in the absence of any such specification, the proxy statements will invariably indicate whether the board approves each proposal, which is a good indication that the proposal is internally proposed. Note, however, that the lack of shareholder proposals in definitive proxy statements does not necessarily indicate that no shareholder submitted a proposal to be included in the proxy. First, under Rule 14a-8, management is permitted to exclude a shareholder proposal under a few specific circumstances.78See 17 C.F.R. § 240.14a-8 (2024). However, exclusion is permitted only after management submits its reasons to the SEC. For this reason, we also searched through the SEC’s no-action letter archives to see whether any of these companies sought to exclude shareholder proposals, and if so, on what grounds. Second, it is also possible for management to persuade a shareholder to withdraw a proposal through negotiation.79See, e.g., Kobi Kastiel & Yaron Nili, The Giant Shadow of Corporate Gadflies, 94 S. Cal. L. Rev. 569, 580 (2021) (“After a shareholder submits a proposal, . . . the proponent may withdraw the proposal after negotiations with the company.”). There is also reason to believe that companies may be less likely to seek to exclude proposals through SEC no-action letters, as the result of the SEC’s 2021 policy change with respect to issuing no-action letters. See SEC Staff Legal Bulletin No. 14L (Nov. 3, 2021). These are done through private agreements, and we are unaware of any public data set that would capture withdrawn proposals.80We are also unaware of any study that has examined such proposals. For example, in their extensive empirical study on shareholder proposals, Nili and Kastiel acknowledge that their data set “does not include proposals that were withdrawn due to a negotiated agreement or otherwise.” Kastiel & Nili, supra note 79, at 581. For this reason, for data analysis purposes, we will assume that all properly submitted shareholder proposals are reflected under our search. Nevertheless, given the possibility of negotiations that may occur as a result of submitted-but-withdrawn shareholder proposals, we will look to other measures of shareholder engagement in Part V.

For all meme companies in the sample, with respect to observable shareholder proposals, we verified our numbers and analysis for this Section using the SharkWatch dataset, which is a standard resource for studying shareholder proposals.81See, e.g., Kastiel & Nili, supra note 65, at 61 n.19. Table 5 describes, for each meme stock company, the number of shareholder proposals (1) included in the company’s proxy statements, (2) approved each year, and (3) properly excluded via the SEC’s no-action letter. Some benchmark figures may be helpful to set proper expectations. In terms of raw numbers of shareholder proposals among the S&P 1500 companies, Professors Kobi Kastiel and Yaron Nili document “a relatively steady and significant number of shareholder proposals submitted to the S&P 1500 [between 2005 and 2018] (an average of 517 proposals per year).”82Kastiel & Nili, supra note 79, at 581. The pattern, however, is not uniform across all 1500 companies. In 2015, for example, “over 450 proposals were submitted to companies in the S&P 500, which is comprised of large-cap companies,” while “fewer than 150 shareholder proposals [combined] were submitted to the small- and mid-cap companies that comprise the S&P Mid-Cap 400 (S&P 400) and S&P 600, respectively.”83Kobi Kastiel & Yaron Nili, The Corporate Governance Gap, 131 Yale L.J. 782, 807 (2022). Given that meme stock companies are small-cap to mid-cap companies, there would be no expectation that any of these companies would be inundated with shareholder proposals.

Nevertheless, the results shown in Table 5 are revealing. For AMC Entertainment, Inc., Blackberry, Express, Inc., Koss, and Vinco Ventures, there were no shareholder proposals submitted between 2015 and 2022.84Kastiel and Nili explain, however, that “in many cases, shareholder proposals do not reach the voting stage” because “some companies prefer to work with the proposing shareholder to bring about a change rather than have the proposal go to a shareholder vote.” Kastiel & Nili, supra note 79, at 582. The same is true with Robinhood, but the company went public recently, and thus has had only one definitive proxy statement issued (in 2022). Thus, for these companies, no proposal was ever included in any definitive proxy statement (which the board did not recommend), and none of these companies have had to request no-action letters from the SEC (to exclude a shareholder proposal) during the time frame.

Table 5.  Shareholder Proposals at Meme Stock Companies, 2015–2022

Company\Year

Shareholder Proposals Included/Approved/Excluded

2015

2016

2017

2018

2019

2020

2021

2022

AMC Entertainment, Inc.

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

Bed Bath & Beyond

0/0/0

3/2/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

Blackberry

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

Express, Inc.

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

GameStop Corp.

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/3

Koss

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

Robinhood

N/A

N/A

N/A

N/A

N/A

N/A

N/A

0/0/0

Vinco Ventures

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

0/0/0

Note: This table presents the number of shareholder proposals at meme companies between 2015 and 2022. For each company-year observation, we provide the total number of shareholder proposals included in the proxy statements, approved by shareholder vote, and excluded via SEC no-action letters.

Bed Bath & Beyond received three shareholder proposals in 2016, all of which the board recommended against. These included (i) a proposal for the board to implement proxy access, (ii) a proposal to have shareholders approve future severance packages, and (iii) a proposal for equity-based compensation for senior executives. Of these three, only the last one failed to pass.85See Bed Bath & Beyond, Inc., Current Report (Form 8-K) (July 1, 2016), https://www.sec.gov/Archives/edgar/data/886158/000117184316010938/f8k_070116.htm [https://perma.cc/ZN76-DFFD]. Note also that these proposals significantly predate the meme surge, and as such, cannot be attributed to the influx of retail investors.

Finally, GameStop sought and received three no-action letters from the SEC for excluding shareholder proposals, all dating to April 22, 2022.86See GameStop Corp., SEC Staff No-Action Letter (Apr. 21, 2022) [hereinafter Chaney GameStop Proposal], https://www.sec.gov/divisions/corpfin/cf-noaction/14a-8/2022/chaneygamestop042122-14a8.pdf [https://perma.cc/B7HT-TUWV]; GameStop Corp., SEC Staff No-Action Letter (Apr. 21, 2022) [hereinafter Crandall GameStop Proposal], https://www.sec.gov/divisions/corpfin/cf-noaction/14a-8/2022/crandallgamestop042122-14a8.pdf [https://perma.cc/T3J8-4MHS]; GameStop Corp., SEC Staff No-Action Letter (Apr. 21, 2022) [hereinafter Sapienza GameStop Proposal], https://www.sec.gov/divisions/corpfin/cf-noaction/14a-8/2022/sapienzagamestop042122-14a.pdf [https://perma.cc/E2XE-A9W8]. These involved proposals by three different shareholders, and management could permissibly exclude all of them for failing to meet the deadline for submission. Note, however, that given that these proposals were for the 2022 annual meeting, which requires the raised thresholds, these shareholders are unlikely to be meme traders. Meanwhile, the content of these proposals is also worth examining.

In one proposal, a self-described registered GameStop shareholder (who didn’t specify how many shares he held) proposed that the company offer a non-fungible token (“NFT”) dividend to its stockholders.87See Crandall GameStop Proposal, supra note 86. In another proposal, a shareholder who claims to own 191 Class A shares proposed that the board “immediately engage the services of the Company’s Transfer Agent, Computershare Limited (“Computershare”) to enable both investment and Direct Registration of Class A shares in both Roth and Traditional Individual Requirement Account (“IRA”) Shareholder Investment Programs at Computershare.”88Chaney GameStop Proposal, supra note 86. Finally, a third shareholder who beneficially owns 540 Class A shares of GME submitted an identical proposal as the second shareholder.89See Sapienza GameStop Proposal, supra note 86.

What can we learn from the shareholder proposals we examined? The 2016 proposals by Bed Bath & Beyond shareholders do reflect a genuine attempt at participating in corporate governance matters, but as mentioned already, these efforts predate the influx of retail investors. The GameStop shareholder proposals, however, tell a different story. On the one hand, they do indicate retail investor participation: it is possible that they were encouraged by the meme surge of 2021 to organize some activist effort. On the other hand, these proposals also do not relate to corporate governance matters: one is a dividend payment suggestion, while the other is a proposal to help certain retail shareholders obtain tax advantages. As of yet, there is no indication that GameStop investors—meme traders or not—are particularly likely to bring about governance reforms through shareholder proposals.

V.  BEYOND VOTING: ESG, DIRECTOR INDEPENDENCE, BOARD GENDER DIVERSITY, AND R&D

Voting and shareholder proposals are not the only ways shareholders can influence corporate governance at public companies. Boards, institutional investors, and policymakers are increasingly paying attention to a firm’s prosocial performance as captured by ESG metrics.90See generally Max M. Schanzenbach & Robert H. Sitkoff, Reconciling Fiduciary Duty and Social Conscience: The Law and Economics of ESG Investing by a Trustee, 72 Stan. L. Rev. 381 (2020) (explaining the growing market pressures to account for environmental and social factors in investing). Even if retail investors do not directly participate in governance (through voting or submitting proposals), it is possible that their presence and preferences could indirectly influence how firms are governed. For example, the meme surge could have left a considerable imprint on public companies if retail investors managed to make their firms’ preferences more prosocial. For one thing, because management may be able to raise extra cash through at-the-market offerings at inflated stock prices,91An at-the-market offering allows an issuer to sell more of its stocks at the prevailing market price. According to our EDGAR search, GameStop and AMC Entertainment took advantage of meme surges and made at-the-market offerings. See Felix Gillette & Eliza Ronalds-Hannon, AMC’s CEO Turned His $9 Billion Company into a Meme Machine, Bloomberg (Aug. 17, 2022, 2:00 AM), https://www.bloomberg.com/news/features/2022-08-17/amc-amc-stock-became-a-meme-thanks-to-adam-aron-s-antics [https://perma.cc/SFG9-LE4H] (describing how AMC’s CEO “transformed himself into a Twitter-obsessed, gold mine-buying, populist folk hero for retail investors”). managers have reason to cater to the preferences of meme traders—that is, to make sure that meme surges persist (especially in times of trouble) and that these investors do not go away. In addition, it is also possible that a meme trader may have submitted a shareholder proposal but decided to withdraw it in return for some concession from the board or management, such as instituting some prosocial changes. Another possible indirect effect of meme surges could have been an increase in R&D spending. For example, those companies that engaged in at-the-market offerings could have invested the new funds in transformative innovative activity. Below, we also explain other mechanisms that were highlighted in the finance literature.92See infra Section VI.C. To estimate these possible indirect influences, in this Part, we first assess the impact of the meme phenomenon on firm ESG scores, board independence, and gender diversity. We then investigate whether meme companies spent more on R&D and capital expenditures after the influx of retail investors.

A.  ESG Scores at Meme Stock Companies

An important claim in the legal scholarship on retail investors is that these new entrants to the financial markets have different goals and expectations from management (as compared with more established institutional players or to retail investors from the previous generation). Gramitto Ricci and Sauter, for example, envisage meme trading as “a social movement able to bring business corporations to serve their original partly-private-partly-public purpose.”93Gramitto Ricci & Sautter, Corporate Governance Gaming, supra note 27, at 51. In other words, scholars envisioned meme companies as potentially deviating from the shareholder wealth-maximization norm and instead advancing social and environmental causes with pressure from retail investors. There is a demographic aspect to this argument. As Fisch observes, many of the retail investors who invested in meme stocks were younger people. Since some argue that the millennial generation has different preferences and is in favor of socially responsible investing even at the cost of wealth-maximization, Fisch expected the young cohort of retail investors to potentially pressure management to improve ESG metrics.94See Fisch, supra note 6, at 1850–51. However, Fisch also notes that “the extent to which citizens will pursue stakeholder or societal goals in their role as investors remains unclear.”95Id. at 1851.

To shed some light on the extent to which meme traders affected socially responsible investing and management, we obtained data on ESG scores for each firm in the Compustat dataset between 2015 and 2021. The ESG scores are taken from the MSCI ESG Score Indexes. This index measures ESG in several different ways, but we chose the most comprehensive measure—industry-adjusted total ESG score—as our outcome of interest.96There is some debate as to what the ESG rating really captures. The rating is intended to measure risk, but ESG scholars also employ this metric as a performance indicator—e.g., firms’ efforts to manage ESG risks. For studies using this index as a performance indicator, see, e.g., Ľuboš Pástor, Robert F. Stambaugh & Lucian A. Taylor, Dissecting Green Returns, 146 J. Fin. Econ. 403, 417 (2022); Mozaffar Khan, George Serafeim & Aaron Yoon, Corporate Sustainability: First Evidence on Materiality, 91 Acct. Rev. 1697, 1704 (2016). For more on the debate, see generally George Serafeim & Aaron Yoon, Stock Price Reactions to ESG News: The Role of ESG Ratings and Disagreement, 28 Rev. Acct. Stud. 1500 (2022). MSCI measures the ESG score for each firm at different points in the year. Therefore, we counted an ESG score to “belong” to a given year if it was assessed after June 30 of the previous calendar year or before June 30 of that year. For example, any ESG score assessed between June 30, 2015, and June 30, 2016, is counted as that firm’s 2016 ESG score. We estimated a difference-in-difference regression model assessing whether ESG scores changed differently for meme stocks after the abolition of commissions and the meme surge of 2021. Table 6 presents the results of this regression.

Table 6.  Meme Stocks and ESG Scores

 

(1)

(2)

 

Baseline

With Financials

2019–20

0.326***

0.295***

 

(0.0305)

(0.0348)

2019–20 x Meme

–0.0817

–0.0115

 

(0.127)

(0.125)

2021

0.644***

0.604***

 

(0.0434)

(0.0493)

2021 x Meme

–1.818**

–1.727*

 

(0.918)

(0.909)

Constant

4.219***

4.244***

 

(0.0515)

(0.0792)

 

 

 

Observations

13,739

12,039

R-squared

0.804

0.805

Firm Fixed Effects

Yes

Yes

Firm Financials

No

Yes

Note: This table presents the results of a linear regression model in which the dependent variable is the yearly industry adjusted ESG score reported for each firm by MSCI ESG Indexes. 2019–20 equals 1 for years 2019 and 2020, while 2021 equals 1 for 2021. Column (2) adds controls for firm assets, cash ratio, debt ratio, and return on assets. Columns (1) and (2) include firm fixed effects, and all standard errors are clustered at the firm level. Continuous variables are winsorized at the 1% level. The ***, **, and * denote significance at the 1%, 5%, and 10% levels.

As shown in Table 6, there is no observable positive effect of meme trading on our treated companies with respect to ESG scores, either after the abolition of commissions in 2019 or the rise in social media interest in 2021. In fact, not only are all the coefficient estimates on (2019–20 x Meme) and (2021 x Meme) variables negative, but the coefficient estimates on (2021 x Meme) variable are also statistically significantly negative, with or without financial controls, at 10% and 5% levels, respectively. While not conclusive, these results are consistent with the earlier results on shareholder voting and proposals, and perhaps not too surprising. If we expect that the new retail investors are more passive, it would not be surprising to expect that the companies would face less pressure from the retail investors and be less inclined to improve upon ESG issues, even if retail investors may care more about these topics in their personal lives. Furthermore, as discussed briefly in Parts II and III, meme firms had higher debt than other firms, and many of them had faltering business models. With an influx of new passive shareholders, management at these firms may have been tempted to reduce expenditure in compliance or ESG initiatives, especially when they know that they will face little pressure from their retail shareholder base.

B.  Board Independence and Diversity at Meme Stock Companies

Next, we looked at the relationship between meme trading and board characteristics. We used data on director independence and board gender diversity from BoardEx, with the dependent variable equaling the percentage of a company’s board that is independent or female, depending on the empirical test.97We assign a year to board independence and diversity data based on the reporting date in BoardEx in the same way we handle the dating of ESG information. See supra Section V.A. Ethnic diversity is another variable we could examine. Nevertheless, BoardEx datasets do not include ethnicity data in a readily usable format. Table 7 presents regression analyses in which the percentage of independent directors is the dependent variable. Leading academic commentators, regulators, and institutional investors usually take a higher share of independent directors to be a sign of better corporate governance,98See Dorothy S. Lund & Elizabeth Pollman, The Corporate Governance Machine, 121 Colum. L. Rev. 2563, 2630 (2021). even though the empirical evidence on the correlation between board independence and firm performance is mixed.99See generally Sanjai Bhagat & Bernard Black, The Non-Correlation Between Board Independence and Long-Term Firm Performance, 27 J. Corp. L. 231 (2002) (showing that director independence is not associated with several measures of firm performance). Regardless of whether director independence boosts firm value, the results in Table 7 indicate that meme firms did not experience a significant increase in the share of independent directors either after the abolition of commissions or during the meme surges on social media. Nevertheless, unlike the ESG results in Table 6, we do not see meme companies performing “worse” than other companies. Our results simply suggest that there is no significant relationship between the meme phenomenon and director independence.

Table 7.  Meme Stocks and Board Independence

 

(1)

(2)

 

Baseline

With Financials

2019–20

1.489***

1.437***

 

(0.143)

(0.165)

2019–20 x Meme

4.573

4.753

 

(2.920)

(2.932)

2021–22

2.729***

2.545***

 

(0.184)

(0.219)

2021–22 x Meme

5.251

5.596

 

(3.771)

(3.782)

Constant

75.99***

71.91***

 

(0.0641)

(1.404)

 

 

 

Observations

21,581

19,538

R-squared

0.805

0.805

Firm Fixed Effects

Yes

Yes

Firm Financials

No

Yes

Note: This table presents the results of a linear regression model in which the dependent variable is the percentage of directors that are independent, per BoardEx. 2019–20 equals 1 for years 2019 and 2020, while 2021–22 equals 1 for 2021 and 2022. Column (2) adds controls for firm assets, cash ratio, debt ratio, and return on assets. Columns (1) and (2) include firm fixed effects, and all standard errors are clustered at the firm level. Continuous variables are winsorized at the 1% level. The ***, **, and * denote significance at the 1%, 5%, and 10% levels.

Board gender diversity is another area in which major corporations have focused in recent years, seeking to improve the representation of women. For example, California recently passed legislation mandating that firms headquartered in the state ensure that they had at least a minimum number of women directors on the board.100See Darren Rosenblum, California Dreaming?, 99 B.U. L. Rev. 1435, 1439 (2019) (citing Cal. Corp. Code § 301.3 (West 2019)). As with director independence, the empirical evidence for board gender diversity improving firm performance is mixed.101See generally Deborah L. Rhode & Amanda K. Packel, Diversity on Corporate Boards: How Much Difference Does Difference Make?, 39 Del. J. Corp. L. 377 (2014). Given, however, the concerted recent efforts to improve board gender diversity, we examine whether meme firms saw any changes with respect to this corporate governance measure. The regression analyses presented in Table 8 do not show meme companies granting women greater representation on boards after either the abolition of commissions or the advent of the social media-driven meme surges. Therefore, like director independence, we do not observe any significant recent changes for meme firms when analyzing board gender diversity.

Table 8.  Meme Stocks and Board Gender Diversity

 

(1)

(2)

 

Baseline

With Financials

2019–20

6.012***

5.689***

 

(0.167)

(0.191)

2019–20 x Meme

3.033

3.314

 

(3.842)

(3.850)

2021–22

10.50***

9.986***

 

(0.226)

(0.262)

2021–22 x Meme

–3.707

–2.981

 

(6.055)

(6.128)

Constant

14.09***

2.532

 

(0.0767)

(1.617)

 

 

 

Observations

21,581

19,538

R-squared

0.763

0.759

Firm Fixed Effects

Yes

Yes

Firm Financials

No

Yes

Note: This table presents the results of a linear regression model in which the dependent variable is the percentage of directors that are female, per BoardEx. 2019–20 equals 1 for years 2019 and 2020, while 2021–22 equals 1 for 2021 and 2022. Column (2) adds controls for firm assets, cash ratio, debt ratio, and return on assets. Columns (1) and (2) include firm fixed effects, and all standard errors are clustered at the firm level. Continuous variables are winsorized at the 1% level. The ***, **, and * denote significance at the 1%, 5%, and 10% levels.

We note that the results from this Part are not necessarily inconsistent with the observations made in the literature about retail investors primarily belonging to the millennial generation102See generally Barzuza, Curtis & Webber, supra note 28 (discussing corporate governance preferences of millennials). or this age cohort of investors having pro-ESG preferences. Instead, taken together with the earlier results about low levels of voting by retail investors, they suggest that retail investor apathy renders them unlikely to be able to change management policies toward the environment or social causes. Therefore, while the earlier scholarship on retail investors understandably thought millennial and Gen Z retail investors would move firms away from the wealth-maximization norm once they got a seat at the table, they underestimated the possibility that these investors would neglect to actually take their seat by voting or otherwise engaging with management.

C.  Profitability, R&D, and Capital Expenditure

As the final set of empirical exercises, we explore whether the influx of retail investors at the meme stock companies may have (indirectly) affected the financial performance or operations at the companies. To set the stage, Figure 4 presents three graphs on the average ROA, average R&D expenses, and average capital expenditures (“CapEx”), for meme and non-meme companies between 2015 and 2021. With respect to the ROA (a profitability measure) shown in the upper panel, there is a clear downward trend at meme companies while the non-meme companies’ average profitability seems much more stable over the same period. At the beginning of the sample period, in fact, meme stock companies were on average more profitable than other, non-meme companies, but the meme companies have experienced a sustained slide in their ROA, and by the end of the sample period, meme companies are performing significantly worse than non-meme companies. Despite some meme companies raising large sums of money by conducting at-the-market offerings at elevated prices,103See sources cited supra note 7 and accompanying text. notably Game Stop and AMC, these firms saw a decrease in profitability.

The substantial decrease in profitability of meme companies could be because their business models may have been fundamentally untenable in a changing world. It is also possible that the agency costs within these firms have gotten worse due, for instance, to less monitoring and less activism by their new retail shareholders.104See Appel et al., supra note 21, at 131. The authors empirically examine how the influx of institutional shareholders, due to an exogenous change in Russell 1000 and 2000 indices, affect corporate governance and show that the firm performance generally improves when there are more institutional investors. Id. at 134. In some sense, our paper is looking almost at the opposite question, but basing on the meme surge. While Part V examines firm performance, Part IV documents investors’ governance participation. The average R&D expenses and capital expenditure trends (as shown in the middle panel and lower panel, respectively) seem to suggest that, at the operational level, meme companies are spending less on innovation and structural improvements. Before we proceed, we should note that the small number of meme stocks is bound to introduce more variability in numerical averages. As such, there may generally be more noise in the time-trends in Figure 4 for meme (as compared with non-meme) stocks. However, the consistent downward trend for meme companies does indicate a real trend in profitability and innovative expenditures for these firms.105Because we have used ROA as an independent variable in our regression models throughout the paper, we do not report the regression results that examine the effect of meme surge and zero-commission trading on ROA. However, consistent with the figure, the coefficient estimates when we analyze the ROA as a dependent variable are significantly negative for meme companies, with respect to both post zero-commission trading and post meme surge.

Figure 4.  Average Return on Assets, R&D Expenses, and Capital Expenditures for Meme and Non-Meme Stocks


Note: This figure presents three separate graphs: mean return on assets, R&D expenses, and capital expenditures, between 2015 and 2021, separately for meme and non-meme companies.

To get a better understanding of meme companies’ performance, we took a closer look at the two operational measures: R&D and capital expenditures. Two theories of R&D spending are potentially relevant to meme companies. First, financial economists have explored the link between an increase in firms’ market valuation and innovation activity. Specifically, a study by a group of financial economists, using mutual fund flows as a measure of market’s optimistic valuation, found a “very strong and robust association” between firm overvaluation and R&D spending.106See Dong et al., supra note 21, at 2609 (showing how “overvaluation” of firm stock, driven by an exogenous increase in mutual fund outflows, increase a firm’s innovation activity, including R&D spending, through both financing (equity and debt financing) and non-equity (managerial confidence and insulation from a possible takeover) channels). At the same time, however, because the authors rely on mutual fund inflows to proxy for an increase in firm valuation (and outflows for decreases in valuation), it is more difficult to establish whether any increase in innovation activity is due to increases in valuation or to increases in institutional ownership. If the effect is strong with respect to the latter, that would be consistent with the alternate hypothesis, and our finding will also be consistent with the institutional ownership hypothesis. They found that R&D spending is more sensitive to firm overvaluation than to growth in company sales and cash flow.107See id. The underlying reasoning seems to be that corporate managers respond to higher market valuations by becoming more optimistic and engaging in more creative (and higher-risk) forms of innovation. If this theory applies to meme companies, one might expect that meme surges would have emboldened executives at these companies to increase R&D spending.

On the other hand, others have documented increase in R&D spending (and other innovative activities) associated with a larger fraction of institutional ownership.108See Bushee, supra note 21, at 305; Aghion et al., supra note 21, at 277; Appel et al., supra note 21, at 115. As institutional ownership increases, one can argue that this will increase shareholder monitoring (and lower agency costs) and induce the firm managers to engage in more innovative activities (and not shirk), which, in turn, should correlate with various measures of innovation and long-term investment, such as R&D spending and capital expenditures. Moreover, institutional investors have a long-term orientation, which allows managers to make risky expenditures on innovative projects without fear of being fired for failing to deliver short-term profits.109See Aghion et al., supra note 21, at 302–03. To the extent that this theory applies to meme stock companies, one might expect that the influx of retail investors and the resulting transformation of the stockholder base at these companies110See generally supra note 27 and accompanying text. would lead to these companies to decrease their R&D spending, since these firms saw ownership change hands from institutional investors to retail participants. Accordingly, examining how R&D spending has changed at meme stock companies could shed light on which of the two theories is more likely at play. We explored this question by conducting the standard difference-in-difference regression as in the rest of this Article and using R&D and capital expenditures as alternate dependent variables.111Following the financial economics and accounting literature, we replace missing values for R&D spending with zeroes. See Dong et al., supra note 21, at 2620; Ping-Sheng Koh & David M. Reeb, Missing R&D, 60 J. Acct. & Econ. 73, 74 (2015) (showing that about 10.5% of the firms who do not accurately report R&D expenditures in their financial statements file and receive patents which is 14 times larger than those firms that report zero R&D).

Table 9.  Meme Stocks and Expenditures on Innovation

 

(1)

(2)

 

R&D Expenses

Capital Expenditures

2019–20

14.38***

2.485

 

(1.574)

(3.943)

2019–20 x Meme

–28.59**

–62.43***

 

(12.85)

(20.67)

2021–22

30.10***

18.58***

 

(2.801)

(5.872)

2021–22 x Meme

–46.20***

–91.18*

 

(15.10)

(48.18)

Constant

95.81***

273.9***

 

(0.896)

(1.967)

 

 

 

Observations

28,247

34,519

R-squared

0.969

0.957

Firm Fixed Effects

Yes

Yes

Note: This table presents the results of a linear regression model in which the dependent variable is annual corporate R&D spending in column (1) and capital expenditures in column (2). 2019–20 equals 1 for years 2019 and 2020, while 2021–22 equals 1 for 2021 and 2022. Columns (1) and (2) include firm fixed effects, and all standard errors are clustered at the firm level. The ***, **, and * denote significance at the 1%, 5%, and 10% levels.

Table 9 presents the results of these regressions. As seen by the negative coefficient estimates on two interaction dummy variables, “2019–20 x Meme” and “2021–22 x Meme,” the regression results show that there was a significant decrease in both R&D and capital expenditures at the meme companies both after the 2019 abolition of commissions and the meme surge of 2021.112In these regressions, we do not control for firm financials in these models since, for example, R&D spending and capital expenditure could be codetermined with other financial measures. However, our results do not change when we additionally control for firm financials, including return on assets, cash and debt ratios, and ln(assets). Except for the coefficient estimate on the “2021–22 x Meme” in capital expenditures regression, which is significantly negative at the 10% level, the other estimates are significant at the 5% level. These results indicate that, unlike the case for more traditional measures of cash inflows, such as mutual fund flows, meme surges are not associated with increases in either R&D spending or capital expenditure. Rather, the results support the existing findings in the literature that an increase in institutional ownership is correlated with more R&D and capital expenditure spending. Therefore, when meme companies swapped institutional investors for meme traders in recent years, managerial incentives to spend on innovation may have decreased.

VI.  DISENTANGLING MEME INVESTING AND MEME SHAREHOLDING

The central inquiry of this Article has been how a dramatic change in shareholder base can affect corporate governance and whether retail investors can bring about meaningful changes as retail shareholders. While the time trends surveyed in Part I are consistent with more meme investing, meme investing is not, in turn, synonymous with meme shareholding. Indeed, the results discussed in Part II to Part V suggest that these new market entrants have not shaken up the way corporations are governed on an ongoing basis. The main question is why. In this Part, we explore several reasons as to why meme investing has not translated into meme shareholding.

A starting point is recognizing that despite the natural connection between retail investors and retail shareholders, their actual activities are quite different. As an investor, an individual is concerned about profitable short or long run transactions. Her activities include studying market information and diversifying portfolios. As a shareholder, an individual is concerned about capital gains, dividend payments, and her control rights (what may collectively be called “corporate governance”). She has the right to participate in shareholder voting, nominate director candidates, submit proposals, or even run proxy contests.113Another important right given to the shareholder, of course, is to bring lawsuits, based on either federal securities laws or corporate law (such as claims for breach of fiduciary duty), but here we focus on investing and governance. An individual faces different challenges depending on whether she is acting as an investor or as a shareholder. A trader is vulnerable at the moment she is transacting because she may be purchasing or selling stocks at an unfair price due to undisclosed information.114See James J. Park, Reassessing the Distinction Between Corporate and Securities Law, 64 UCLA L. Rev. 116, 116 (2017). By contrast, a shareholder is vulnerable as long as she owns the stock because corporate misconduct and breaches of fiduciary duties can reduce the share price.115See id.

More to the point, there is a significant difference between the payoffs of these activities. Retail traders participating in a meme surge will trade with a certain expectation and the payoff from their trades may be realized (relatively) quickly. There is a sense of instant gratification as well as an immediate opportunity to participate in a social activity. By contrast, retail shareholders may cast their votes only to find out that their votes have made no difference to the outcomes—for example, because the median voter is not among them—or that the proposals approved do not bring about any immediate changes in the way their corporations are run. In addition, if meme stock traders are driven by quick payoffs, they may not even be shareholders as of their company’s record date—in other words, they may not be eligible to vote by the record date for the annual meetings.

Another factor that may be driving the result is that technological developments may have reduced participation costs for shareholder engagement but not information costs. Participation costs refer to the resources the ordinary individual would need to learn about a firm and invest in it, while information costs are the costs she must incur to conduct research into the firm’s business operations and corporate governance. The digital innovations that sparked meme trading, such as the abolition of trading commissions, may not have reduced the information costs of shareholder participation in the same manner that they reduced participation costs of trading. Because meme trading is not an information-intensive activity, mobile apps like Robinhood and the abolition of trading commissions paved the way for retail traders. By contrast, shareholder voting is an inherently information-intensive activity, and thus, even with technologies that are designed to reduce participation costs, information costs that come with voting cannot be fully eradicated. This may account for why a sudden burst of enthusiasm for one type of activity may not instantly translate to a groundswell for another form of market participation.116A point worth highlighting is that it would not be accurate to group all retail investors together. The sudden influx is limited to a new generation of particular types of retail investors, that are now known by different names, such as “meme investors,” “meme traders,” “wireless investors,” or “ultra-retail investors.” See generally Abraham J.B. Cable, Regulating Democratized Investing, 83 Ohio St. L.J. 671 (2022). In our view, the term “meme traders” most aptly captures the observed pattern of transactions. In executing transactions motivated by Reddit discussion threads and triggering “short squeeze” attacks, these individuals cannot be said to be investing in any traditional sense. Although these traders only represent a subset of retail investors, they exist in sufficient numbers to affect price movements in the market for meme stocks.

It is also not insignificant that meme surges to date have been limited to a small set of companies that are not randomly selected. In discussing some common denominators among meme stock companies, one analysis catalogues factors such as low stock prices and enduring cultural relevance.117Naaman Zhou, What Is GameStop, Where Do the Memes Come In, and Who Is Winning or Losing?, The Guardian (Jan. 28, 2021, 1:46 AM), https://www.theguardian.com/culture/2021/jan/28/what-is-gamestop-where-do-the-memes-come-in-and-who-is-winning-or-losing [https://perma.cc/8D6Q-G9DN]. Indeed, our analysis of meme stocks reveals that these firms are mid- to small-cap companies, valued under $10 billion in market capitalization (some, in fact, have a much smaller market capitalization), with low stock prices.118The market capitalizations of meme stock companies we examine range from about $56.2 million to $9.2 billion. Their respective market capitalizations, as of January 2023, are as follows: $9.2 billion for Robinhood, $7 billion for GameStop, $2.8 billion for AMC, $2.5 billion for BlackBerry, $300 million for Bed Bath & Beyond, $150 million for Vinco, $77 million for Express, and $56 million for Koss. By comparison, the smallest company in the S&P 500 index has a market capitalization of $14.6 billion. See also supra Part II (presenting summary statistics for firm financials at meme and non-meme companies). Their modest sizes imply that even trades by a subset of retail investors can affect their stock prices—as such, they can be targets of short squeezes. At the same time, small companies are also more likely to suffer from a lack of significant corporate governance activities. As Kastiel and Nili documented, firms with smaller market capitalizations are less likely to adopt “best practices” in corporate governance and are less organized in doing so.119Kastiel & Nili, supra note 83, at 794. This could in part be because they are less likely to be targets of engagements by institutional shareholders120Id. or attract shareholder proposals related to governance.121Kastiel & Nili, supra note 65, at 807. Since meme companies have thus far been smaller firms, this suggests that they are unlikely to become overnight corporate governance exemplars.

Finally, one explanation consistent with our findings is that the segment of retail investors that entered the market as the result of the abolition of commission fees is not representative of the previously existing retail investor base.122Another point worth highlighting is that meme investors, too, may be a very particular subgroup of retail investors. There is little reason to believe that those who participated in meme frenzies are representative of the entire base of new generation of retail investors. According to Hasso et al., supra note 5, for example, 88% of the investors who participated in the GameStop frenzy were male and the average age was about 34. See id. at 2. The average years of trading experience of these investors was also less than 1 year. See id. Similarly, the 2020 WallStreetBets Census also shows similar statistics. Over 90% of the blog participants are male, and over 72% of them were aged twenty-nine and younger. See Presentation on 2020 WallStreetBets Census (Feb. 20, 2020), https://docs.google.com/presentation/d/1ozj-S3eIwSa6ZERs0kTdE1LiMYcN1kwBUGIDVcVlzLg/edit [https://perma.cc/QRA2-ZK5T]. Rather, they represent a particular subset of investors—those that were highly sensitive to then-existing low transaction fees. While they may have welcomed the commission-free trading platforms and have actively participated in such activities, those investors may also be presumptively unlikely to bear other types of transaction costs, such as submitting shareholder proposals or voting at annual meetings.

We believe the disjuncture between investing and share ownership may explain why meme surges and their impacts have been confined to the trading markets and presently remain divorced from meaningful shareholder activities in corporate America. These differences, of course, are not set in stone. As companies innovate and technological innovations lower the cost of shareholder engagement, it remains possible that future retail shareholders can make a meaningful difference at companies. This can also shift the balance of power away from institutional shareholders and toward retail shareholders. The jury is out on how such changes will come about in the future.

CONCLUSION

This Article has examined the impact of the dramatic influx of retail shareholders (from the “meme surge” of 2021) on various corporate governance and financial metrics at meme stock companies. Our analysis suggests that retail shareholders could be the leopards that failed to change their spots. For one, during the period when retail investor ownership of meme stocks has increased, the rates of non-voting have significantly risen at meme companies. This is in contrast to the rate of non-voting at non-meme companies, which has remained fairly stable over the same period. Although we cannot directly measure that the non-votes were coming more from retail investors—given that non-participation is generally attributed to retail investors and other non-meme companies were not experiencing anything similar in shareholder participation123See Brav et al., supra note 39, at 494; Geoffroy, supra note 66, at 1.—the result supports the hypothesis that the surge of retail investor interest in these companies is linked to the rise in non-voting.

Importantly, we observe that the increase in non-votes began in 2019, the same year that major brokerages abolished trading commissions, and the rate of non-votes went even higher in 2022, long after the meme surge was over. This indicates that the non-votes were not being driven by short-term speculators who may have participated in the meme surge of 2021 purely for financial gain and without any interest in democratic participation. The result is also consistent with the event-study evidence that the 2019 advent of zero-commission trading could have stirred retail investor interest in meme stocks (along with others). Retail investors have also failed to affect corporate governance at meme companies through the shareholder proposal process.

The Article also explored possible indirect avenues through which retail investors could have influenced meme companies. We find that the retail investors have been unable to translate their preferences into concrete improvements in the firms’ ESG scores or board independence and gender diversity. Furthermore, meme surges have not had an indirect effect on corporate innovation through R&D spending and capital expenditure. This finding contrasts with the earlier scholarship that showed positive correlation between increased valuation (for example, from mutual fund flows) and R&D spending, but it is consistent with the findings that showed a positive correlation between an increase in institutional ownership and innovative activities. In sum, all evidence to date suggests that meme trading may be a social phenomenon that remains largely orthogonal to retail shareholders’ aspirations to transform corporate governance. The demographic, societal, and technological changes surveyed in Part I could surely presage an increased role for retail investors at some time in the future. However, our empirical analysis suggests that this notion of democratized governance is yet to arrive at the firms targeted by the recent meme surges.

The Article’s primary focus has been on a handful of meme stock companies, such as GameStop, AMC, and Bed Bath & Beyond, with the principal finding that the new retail shareholders at these companies do not seem to be active in engaging with the management or in influencing the companies’ governance outcomes. Given that the Article’s focus is on a small number of companies that went through an unusual experience of facing a sudden surge of retail investors’ interest, one needs to be cautious about generalizing the results to other companies or making overarching conclusions. At the same time, these companies were chosen precisely because they were the primary targets of meme trading. Thus, to the extent we should have observed a new paradigm of corporate governance associated with meme surges, these companies would have been the most promising ones. Furthermore, particularly with respect to retail investors who remained loyal to the meme companies long after the “meme surge” was over, one would have expected them to be much more active in corporate governance and improving firm performance.

In sum, we believe that the Article’s findings are informative in getting a better understanding of retail shareholders’ engagement and potential democratizing benefits of allowing more retail investor participation. To get a better understanding of the importance of retail investor base on corporate governance, a future research project may take a closer look at how technological changes, including the introduction of zero-commission trading, may have had a broader effect on the capital markets and the more general impact of retail investors on corporate governance across a larger segment of the market. As the meme phenomenon spreads to banking stocks, SPACs, and bankrupt firms, research into meme investing and shareholder activity will become more important124See generally supra notes 8–10 and accompanying text. and shed new light on the issue of shareholder base and corporate governance.

97 S. Cal. L. Rev. 1419

Download

* Assistant Professor of Law, Northwestern Pritzker School of Law.

† Paul G. Kauper Professor of Law, University of Michigan Law School; Research Member, European Corporate Governance Institute (“ECGI”).

‡ Professor of Law, Northwestern Pritzker School of Law; Director, Northwestern University Center on Law, Business, and Economics. The authors thank Quinn Curtis, Jill Fisch, Sue Guan, Dorothy Shapiro Lund, Frank Partnoy, Alex Platt, Roberta Romano, Kathy Spier, George Triantis, and Jonathon Zytnick; conference participants at the 2023 Corporate and Securities Litigation Workshop, the 2023 Korean Commercial Law Association Annual Meeting, the 2023 American Law and Economics Association Annual Meeting, the 2023 Winter Deals Conference, the 2023 Conference on Empirical Legal Studies, and the Law and Technology Conference at the University of Southern California; and workshop participants at Vanderbilt University Law School, Northwestern Pritzker School of Law, University of Michigan Law School, Bocconi University, Corporate Law Academic Webinar Series (CLAWS), Council of Institutional Investors (CII) Webinar Series and 2023 NYU Corporate Roundtable for many helpful comments and suggestions. The authors thank Irving A. Birkner (Kellogg School of Management at Northwestern University), Shay Elbaum (University of Michigan Law Library), and Clare Gaynor Willis (Northwestern Pritzker School of Law Library) for help with data collection, and Danny Damitio, Andrea Lofquist, Michael Palmer, and Nanzhu Wang for their excellent research assistance. Comments are welcome to dhruv.aggarwal@law.northwestern.edu, alchoi@umich.edu, and alex.lee@law.northwestern.edu.             

The Failed Promise of Treasuries in Financial Regulation

U.S. government Treasury bonds (“Treasuries”) anchor financial stability. Public regulation mandates that financial firms maintain deep buffers of Treasuries that can be sold for cash in a crisis. In private lending between financial firms—running into trillions of dollars daily—Treasuries are the preferred form of collateral, designed to make debt fully resistant to default.

But this unquestioned reliance on Treasuries in public and private self-regulation has created a financial system that rests on fragile foundations. The first fundamental problem—thus far unnoticed in existing literature—lies in the system-wide tension that is present when both public and private self-regulation depend on the same scarce Treasuries/cash for survival.

This tension plays out in the common system of intermediation that supports both public and private self-regulation. Crucially, financial regulation places its trust in the competencies of twenty-four large financial firms—primary dealers—that uphold both the buying and selling of Treasuries as well as the supply of Treasuries to lending markets for use as collateral. This system of intermediation, however, is far from perfect. As we show, primary dealers confront incurable information gaps when allocating cash and Treasuries between private lending and public trading markets. Further, facing scarcity, primary dealers must choose whether to devote resources to one space over the other. Finally, as for-profit actors, primary dealers have no reason to continue intermediating if the cost-benefit trade-off turns sour. As it stands, for financial regulation to remain resilient, its mechanisms for intermediating Treasuries must also be lucrative.

The second problem lies in the fragmented system of supervision that governs an interconnected public trading and private lending market for Treasuries. Multiple regulators are in charge, but they lack coordination mechanisms, complementary regulatory approaches, and institutional mandates to facilitate cooperation. It follows that regulators have failed to spot shared risks to Treasuries intermediation and to develop mechanisms to correct them.

This Article sets out a three-part solution to better realize the promise of Treasuries for financial regulation: (1) enhancing transparency across trading and lending markets, (2) developing consolidated oversight, and (3) mandating that primary dealers maintain intermediation during crises. With Treasuries anchoring public regulation and trillions in private contracting, their fragility represents a danger that policymakers can ill afford to ignore.

INTRODUCTION

When COVID-19 shocked the financial system in March 2020, the (then) $17 trillion market for Treasuries became one of its most unexpected casualties.1Karen Brettell, U.S. Treasury Market Faces Structural Issues Even as Liquidity Improves, Reuters (Apr. 22, 2020, 11:26 AM), https://www.reuters.com/article/us-health-coronavirus-treasuryliquidity-idUSKCN224311 [https://perma.cc/YF52-JN3D]; Jeffrey Cheng, David Wessel & Joshua Younger, How Did COVID-19 Disrupt the Market for U.S. Treasury Debt?, Brookings: Up Front (May 1, 2020), https://www.brookings.edu/blog/up-front/2020/05/01/how-did-covid-19-disrupt-the-market-for-u-s-treasury-debt [https://perma.cc/WRY4-RWCM]; U.S. Treasury Monthly Statement of the Public Debt of the United States (MSPD), U.S. Dep’t of the Treasury, https://fiscaldata.treasury.gov/datasets/monthly-statement-public-debt/summary-of-treasury-securities-outstanding [https://perma.cc/QVS7-M5YT] (showing outstanding marketable (that is, tradable) debt of $27.3 trillion for the end of July 2024). In March 2020, the U.S. Treasury owed marketable debt equaling $17.1 trillion. Id. As equity and corporate bond markets reeled, investors rushed to sell Treasuries and raise cash to remain solvent.2Andreas Shrimpf, Hyun Song Shin & Vladyslav Sushko, Leverage and Margin Spirals in Fixed Income Markets During the Covid-19 Crisis, Bank for Int’l Settlements Bull., Apr. 2, 2020, at 1–2; Darrell Duffie, Still the World’s Safe Haven?: Redesigning the U.S. Treasury Market After the COVID-19 Crisis 2–8 (Hutchins Ctr. on Fiscal & Monetary Pol’y at Brookings, Working Paper No. 62, 2020), https://www.brookings.edu/wp-content/uploads/2020/05/WP62_Duffie_v2.pdf [https://perma.cc/97SY-CT64] (detailing the events of March 2020 and the response by authorities to shore up the market). Their reaction was exactly as expected. Viewed as failure-proof, Treasuries provide the world with its most dependable safe haven. When other markets run into distress, Treasuries are supposed to buffer the fall by ensuring a constant supply of default-free assets and cash for those that sell them.3Antoine Bouveret, Peter Breuer, Yingyuan Chen, David Jones & Tsuyoshi Sasaki, Fragilities in the U.S. Treasury Market: Lessons from the “Flash Rally” of October 15, 2014 5–6 (Int’l Monetary Fund, Working Paper No. WP/15/222, 2015) (noting the importance of Treasuries as the “bedrock of the financial system”); Michael Fleming & Francisco Ruela, Treasury Market Liquidity During the COVID-19 Crisis, Fed. Rsrv. Bank of N.Y.: Liberty St. Econ. (Apr. 17, 2020), https://libertystreeteconomics.newyorkfed.org/2020/04/treasury-market-liquidity-during-the-covid-19-crisis.html [https://perma.cc/7N56-SDJ3]. Recognizing this fortress-like quality, public regulation and private industry rely systematically on Treasuries as the shield to protect financial markets against panic, collapse, and uncertainty.4Cheng et al., supra note 1.

Public financial regulation mandates that Treasuries constitute a sizable part of the rainy day safety buffers of any number of regulated financial firms.5See discussion and sources infra Section I.C. The assumption here is that Treasuries can, by dint of quick sales, release cash in a crisis, allowing a firm to pay off its creditors and, in turn, prevent creditors from also going bust themselves.6See discussion and sources infra Section I.C; see also, e.g., Marco Macchiavelli & Luke Pettit, Liquidity Regulation and Financial Intermediaries 15–17 (Fed. Rsrv. Bd., Wash., D.C., Working Paper No. 2018-084, 2018) (describing the impact of the liquidity coverage ratio on the incentive of financial firms to build reserves of Treasuries). Using similar logic, the private market for lending between financial firms—running at trillions of dollars daily—also depends on Treasuries as the preferred form of collateral.7See, e.g., What Types of Asset Are Used as Collateral in the Repo Market?, Int’l Cap. Mkt. Ass’n., https://www.icmagroup.org/Regulatory-Policy-and-Market-Practice/repo-and-collateral-markets/icma-ercc-publications/frequently-asked-questions-on-repo/6-what-types-of-asset-are-used-as-collateral-in-the-repo-market [https://perma.cc/F7MF-D7UN] (highlighting the significance of government debt as collateral and the high reliance on U.S. Treasuries for funding). By securing debt using Treasuries, lenders can be sure that they will be repaid, either by the borrower as promised, or by selling the Treasuries collateral.8See discussion and sources infra Section II.A. This unquestioned confidence in Treasuries as collateral means that parties do not even need to conduct due diligence on one another, so long as sufficient Treasuries can secure the debt.9See generally Bengt Holmstrom, Understanding the Role of Debt in the Financial System (Bank for Int’l Settlements, Working Paper No. 479, 2015). Indeed, it is taken for granted that the price of Treasuries will not fall when that of assets, like corporate bonds or equities, crashes. In other words, investors rush to safety during crises by putting capital into Treasuries and maintaining (or increasing) their price.10Zhiguo He, Stefan Nagel & Zhaogang Song, Treasury Inconvenience Yields During the COVID-19 Crisis, 143 J. Fin. Econ. 57, 57 (2022) (observing that during crises, the price of Treasuries enjoys a price premium owing to the safety and liquidity provided).

March 2020, however, upended these assumptions. Rather than Treasuries providing reliable trading (or liquidity)—allowing sellers to cash out without distorting prices—investors found themselves unable to transact on reasonable terms.11Adam Samson, Robin Wigglesworth, Colby Smith & Joe Rennison, Strains in US Government Bond Market Rattle Investors, Fin. Times (Mar. 12, 2020), https://www.ft.com/content/1a305358-6450-11ea-a6cd-df28cc3c6a68 [https://perma.cc/54R6-696V]. Execution costs increased by 50%–500%, and market depth—or the quantity of offers (quotes) available to trade—plunged to 10%–38% of earlier values.12Fleming & Ruela, supra note 3. Testifying before the Senate Banking Committee in February 2021, the Chair of the Federal Reserve (“the Fed”), Jerome Powell, remarked that the Treasury market did not have “the capacity to handle” the pressure.13The Semiannual Monetary Policy Report to the Congress: Hearing Before the U.S. Comm. on Banking, Hous. & Urb. Affs., 117th Cong., at 02:13:49 (Feb. 23, 2021), https://www.banking.senate.gov/hearings/02/12/2021/the-semiannual-monetary-policy-report-to-the-congress [https://perma.cc/SR9P-NX8G]. Treasuries’ prices became chaotic and fell out of sync with those in related markets.14Cheng et al., supra note 1. As detailed by Annette Vissing-Jorgensen, this price instability had nothing to do with changes to the country’s economic fundamentals (for example, inflation).15Annette Vissing-Jorgensen, The Treasury Market in Spring 2020 and the Response of the Federal Reserve, 124 J. Monetary Econ. 19, 21 (2021). Instead, its cause was the rapid deterioration of trading conditions in the Treasury market with large investors rushing in to sell.16Id. (noting abnormally large sales by mutual funds, hedge funds, and foreign governments); see also U.S. Dep’t of the Treasury, Bd. of Governors of the Fed. Rsrv. Sys., Fed. Rsrv. Bank of N.Y., U.S. SEC & U.S. Commodity Futures Trading Comm’n, Recent Disruptions and Potential Reforms in the U.S. Treasury Market: A Staff Progress Report 7–15 (2021). As such, with equity markets plunging almost 3,000 points daily, the price of Treasuries also dropped precipitously, instead of increasing or staying stable as should have been the case for the world’s premier safe haven.17He et al., supra note 10, at 57–58. On the legal construction of safe assets, see generally Anna Gelpern & Erik F. Gerding, Inside Safe Assets, 33 Yale J. on Regul. 363 (2016).

Worryingly, the disruptions in March 2020 were not a one-off event. Rather, as shown by Matthias Fleckenstein and Francis A. Longstaff, market confidence in the capacity of Treasuries to steadfastly provide a safe haven has diminished significantly in recent years. Fleckenstein and Longstaff observe that Treasuries have traded much more cheaply to their fair value at key moments in modern financial history, with sizable price discounting observed during the 1997 Asian Financial Crisis, the 2000s, and frequently between 2015–2020.18Matthias Fleckenstein & Francis A. Longstaff, Treasury Richness 2, 5 (Nat’l Bureau of Econ. Rsch., Working Paper No. 29081, 2021); see also Yesha Yadav, A Blueprint for Reforming Treasury Markets 4–7 (Vand. Univ. L. Sch.,         Legal Stud. Rsch. Paper Series, Working Paper No. 20-58, 2020), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3739971 [https://perma.cc/8P5V-A4U7] (discussing recent disruptions). Taken together, these repeated performance failures call into question the core assumption made by public and private financial regulation in relying so fundamentally on Treasuries as safe assets: that their default-free nature means that Treasuries are also always perfectly tradable at fair prices.19Samson et al., supra note 11. For a detailed discussion on the ineffective regulatory structure for Treasury markets, focusing on the secondary market for Treasuries trading, see generally Yesha Yadav, The Failed Regulation of U.S. Treasury Markets, 121 Colum. L. Rev. 1173 (2021). We close this gap in the literature to show that this assumption is simply wrong. Rather, while Treasuries can be regarded as risk-free, the market that trades them is not, diminishing their capacity to act as an anchor for public as well as private industry self-regulation. In this Article, we make two claims to detail: (1) the fragile system of intermediation that underpins Treasuries’ distribution, and (2) the deeply flawed model of market supervision that is ill-matched to contend with the risks created by faulty intermediation.

In our first contribution, we show that there is a fundamental, internal tension within a system in which both public and private financial regulation rely on scarce Treasuries to support economic survival. This tension and interconnection crystallize in a shared system of intermediation that must, at once, manage the buying and selling of Treasuries with the public as well as ensure the constant supply of Treasuries collateral to the private lending market.

For a start, this system of intermediation is remarkably fragile. Opacity, conflict, and complexity are pervasive. Crucially, regulation places trust in the capacity of (currently) twenty-four large banks and investment firms—known as primary dealers—to intermediate Treasuries. Primary dealers are uniquely authorized to purchase Treasuries from the government at auction and then to distribute them widely.20Jeffrey Cheng & David Wessel, What Is the Repo Market, and Why Does It Matter?, Brookings: Up Front (Jan. 28, 2020), https://www.brookings.edu/blog/up-front/2020/01/28/what-is-the-repo-market-and-why-does-it-matter [https://perma.cc/GUG6-3KXA]; Primary Dealers: List of Primary Dealers, Fed. Rsrv. Bank of N.Y., https://www.newyorkfed.org/markets/primarydealers [https://perma.cc/TQ8F-NAQD]. This role puts primary dealers center stage in the secondary market for buying and selling Treasuries with investors, in which they sell to those that want to buy and buy from those that want to sell. In this way, primary dealers help operationalize the assumption made in public financial regulation that Treasuries can always be liquidated by those needing cash or bought by firms wanting a reliable safe asset—all at fair and stable prices.

Primary dealers also act as critical intermediaries for the approximately five trillion dollars in exposure in the private market for short-term lending21US Repo Statistics, Sec. Indus. & Fin. Mkts. Assoc. (Aug. 26, 2024), https://www.sifma.org/resources/research/us-repo-statistics [https://perma.cc/A55S-V8DE] (noting that the size of the primary dealer repo segment is over five trillion dollars).—known as the repurchase or repo market—in which Treasuries constitute the preferred form of collateral.22Legally, short-term credit transactions are structured as a sale and repurchase agreement, meaning that the securities are “sold” in return for cash and then bought back when the agreement terminates. By structuring this as a sale and repurchase, the Lender legally owns the securities, and it can sell them in an event of default. For discussion and sources, see infra Section II.A. We do not discuss purchases and sales by the Fed in its monetary policy operations in this Article. For analysis, see generally Carolyn Sissoko, The Collateral Supply Effect on Central Bank Policy (Aug. 21, 2020) (unpublished manuscript), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3545546 [https://perma.cc/3KN9-WAR5]. The repo market allows financial firms with cash to lend it to others that need it.23This describes the classic repo market, in which cash is borrowed. In “reverse repo” markets, firms seek to borrow Treasuries against cash collateral. To eliminate default risk, lending is short-term and secured (mostly using Treasuries).24Cheng & Wessel, supra note 20. By ensuring firms can borrow cash whenever they need, the repo market provides a lifeline to financial firms to address everyday funding demands.25Id. Within the repo market, primary dealers match borrowers with lenders.26SIFMA Rsch., The US Repo Markets: A Chart Book 4–5 (2022), https://www.sifma.org/wp-content/uploads/2022/02/SIFMA-Research-US-Repo-Markets-Chart-Book-2022.pdf [https://perma.cc/A4S2-L4GV]. On tri-party repo, see Tri-Party/GCF Repo, Fed. Rsrv. Bank of N.Y., https://www.newyorkfed.org/data-and-statistics/data-visualization/tri-party-repo/index.html#interactive/volume/collateral_value [https://perma.cc/9DAC-S9ZK] (stating that around four trillion dollars of the six-trillion-dollar repo market is supported by Treasuries collateral). They also act as lenders by using their own cash to serve those looking to borrow.27See generally Fixed Income Outstanding, Sec. Indus. & Fin. Mkts. Ass’n., https://www.sifma.org/resources/research/fixed-income-chart [https://perma.cc/9AYU-FSA6]. Finally, dealers borrow for themselves in the repo market as a way of funding their firm’s everyday operations.28Cheng & Wessel, supra note 20. In intermediating the supply of Treasuries to the repo market, primary dealers help insulate financial firms against default and systemic fallout.

Primary dealers confront steep and pervasive costs when intermediating across both the secondary market for Treasuries as well as the repo market. Information gaps are endemic. This opacity is structurally unavoidable in the repo market. Because Treasuries represent the preferred form of collateral and lending is short-term, due diligence is deemed unnecessary.29See generally Holmstrom, supra note 9. On opacity in the repo market that has not been addressed by post-2008 reform, see generally Paolo Saguato, The Liquidity Dilemma and the Repo Market: A Two-Step Policy Option to Address the Regulatory Void, 22 Stan. J.L. Bus. & Fin. 85 (2017). By design, primary dealers lack the tools and incentives to carefully monitor the default risk posed by parties with whom they contract.30In segments of the repo market that are cleared by a third-party, there is more transparency, data collection, and publication. This data collection has been increasing since October 2019. See generally R. Jay Kahn & Luke M. Olson, Off. of Fin. Rsch., Who Participates in Cleared Repo? (2021), https://www.financialresearch.gov/briefs/files/OFRBr_21-01_Repo.pdf [https://perma.cc/3YQ2-4BZ7] (detailing data collected by regulators from cleared repo markets). They are also unable to fully gauge, on a market-wide basis, how this risk is building—for example, whether certain counterparties might be growing more indebted, less likely to repay, and whether to continue to lend to them, and on what terms.31See discussion and sources infra Section III.A.

To be sure, using Treasuries as collateral should mean that primary dealers and the financial market have nothing to fear from default. But this view glosses over the damaging effect of opacity on intermediation. A system-wide absence of real-time information means that primary dealers are justified in being overly cautious when the prospect of default does arise and in quickly cutting off credit to counterparties across the board on account of not knowing exactly where the problem lies and how widespread it may be. Dealers might demand more Treasuries collateral to match unknown but higher levels of risk—even from borrowers that appear to be safe. In the absence of detailed information, withdrawing intermediation is rational, even advisable, to ensure that primary dealers do not keep lending to any number of defunct firms. After all, there is no rule forcing primary dealers to keep trading.32Alexandra Scaggs, Please Let’s Stop Saying US Primary Dealers Are Required to Make Markets (Updated), Fin. Times (June 17, 2016), https://www.ft.com/content/b6c87a0f-6d50-3f46-b27a-5ecc83d12dc5 [https://perma.cc/8Q9G-QPF7]. From the standpoint of the market and its regulation, however, this kind of preventative action is harmful, chaotic, and liable to amplify distress. Financial firms can end up suddenly unable to meet their daily funding needs, or to roll over past debt, having to quickly find the cash to repay if a dealer calls in a repo loan or makes an existing one more expensive.33On the 2008 Financial Crisis and the effects of the repo runs on the real economy, see, e.g., Gary Gorton & Andrew Metrick, Securitized Banking and the Run on the Repo, 104 J. Fin. Econ. 425, 435–36 (2012); Caitlin Long, The Real Story of the Repo Market Meltdown, and What It Means for Bitcoin, Forbes (Sept. 25, 2019, 2:55 PM), https://www.forbes.com/sites/caitlinlong/2019/09/25/the-real-story-of-the-repo-market-meltdown-and-what-it-means-for-bitcoin [https://perma.cc/23X8-QX4F].

Opacity also raises doubts about whether Treasuries collateral is even capable of being enforced, that is, traced and sold by a primary dealer to recover the amount owed after default. Because the market lacks real-time reporting and due diligence, a borrower may not actually own the Treasuries collateral it offers up to secure a debt. Rather, collateral can belong to another party that has agreed to let the borrower use it for a time.34See discussion and sources infra Sections II.A & IV.A. Collateral reuse is commonplace in Treasury-backed repo markets. Complex collateral chains, in which the same Treasury circulates to collateralize multiple loans, has become a feature.35Long, supra note 33. For example, a Lender takes Treasuries from a Borrower as collateral. The Lender can then use these same Treasuries as collateral to borrow cash for itself. According to Manmohan Singh of the International Monetary Fund, each Treasury security collateralizes around three repo loans.36Id. Reuse affords gains in efficiency. In good times, prized Treasuries can help release credit for numerous parties. But during crisis and with opacity endemic, doubts are raised about whether the collateral is traceable and capable of being sold.37Bilateral Repo Data Collection Pilot Project, Off. of Fin. Rsch., https://www.financialresearch.gov/data/repo-data [https://perma.cc/VYY6-FH2C] (describing available data on the bilateral repo market as “scant”). Stated bluntly, even though a particular Treasury can be reused multiple times to release credit, it can be sold only once to cover a loss. Those believing they have a right to its proceeds may find that the Treasury no longer exists precisely when they need it the most. Opacity means that dealers and others cannot know in advance how complex their collateral chain will be, and whether their collateral is as protective as regulation readily assumes.38See discussion and sources infra Section III.A.

Primary dealers also confront opacity in the secondary market for buying and selling Treasuries with investors.39See U.S. Dep’t of the Treasury, Bd. of Governors of the Fed. Rsrv. Sys., Fed. Rsrv. Bank of N.Y., U.S. SEC & U.S. Commodity Futures Trading Comm’n, Joint Staff Report: The U.S. Treasury Market on October 15, 2014 15–19 (2015); James Collin Harkrader & Michael Puglia, Principal Trading Firm Activity in Treasury Cash Markets, Bd. of Governors of the Fed. Rsrv. Sys. (Aug. 4, 2020), https://www.federalreserve.gov/econres/notes/feds-notes/principal-trading-firm-activity-in-treasury-cash-markets-20200804.html [https://perma.cc/9WNL-3TUC]; e.g., Robert Mackenzie Smith, Client List Reveals HFT Dominance on BrokerTec, Risk.net (Sept. 23, 2015), https://www.risk.net/derivatives/interest-rate-derivatives/2426923/client-list-reveals-hft-dominance-on-brokertec [https://perma.cc/6428-PWMB] (showing that the top eight traders on the main interdealer Treasuries trading platform (BrokerTec) were high speed traders); Portia Crowe, High Frequency Traders Are Dominating Another Huge Market, Bus. Insider (Sept. 23, 2015, 10:57 AM), https://www.businessinsider.com/high-frequency-traders-dominate-the-treasuries-market-2015-9 [https://perma.cc/S25Y-QDGP]. Home to over $600 billion in average daily turnover in both 2020 and 2021, this market lacks real transparency.40US Treasury Securities: Issuance, Trading Volume, Outstanding, Holders, Yield Curve Rates, SIFMA Rsch., https://www.sifma.org/resources/research/us-treasury-securities-statistics/us-treasury-securities-statistics-sifma [https://perma.cc/BB7Y-MUJF]. See generally Harkrader & Puglia, supra note 39. Trades are not reported publicly in real time.41Now Available – Weekly Aggregated Reports and Statistics for U.S. Treasury Securities, FINRA (Mar. 10, 2020), https://www.finra.org/filing-reporting/trace/now-available-weekly-aggregated-reports-and-statistics-us-treasury [https://perma.cc/8WZA-W5E9]. The secondary market did not have a comprehensive trade reporting regime until 2017, capable of delivering insights on a trade-by-trade level.42Harkrader & Puglia, supra note 39. The regime that is currently in place mandates reporting to regulators only (rather than wider dissemination). Until February 2023, trading statistics were published weekly and in aggregate, after which regulators permitted once-daily reporting to the public (also in aggregate terms). The reporting regime has also had major gaps historically (for example, it has not required hedge funds to report trades).43Id.; see Treasury Daily Aggregate Statistics – Files, FINRA, https://www.finra.org/finra-data/browse-catalog/about-treasury/daily-file [https://perma.cc/7S67-6BZ9] (providing daily reporting on trading volume); Treasury Weekly Aggregate Statistics, FINRA, https://www.finra.org/finra-data/browse-catalog/about-treasury/weekly-data [https://perma.cc/VBL3-JCHB] (providing weekly reporting of U.S. treasuries trades, discontinued after February 2023). On February 6, 2024, the SEC approved rules that requires those engaging as a government securities dealer and providing significant liquidity to the market “as a part of a regular business” to register with the SEC, become a part of a self-regulatory organization, and comply with various securities laws. Whereas the earlier trade reporting regime applied to broker-dealers only, thereby excluding hedge funds typically, the new regime can capture liquidity-providing hedge funds and require these funds to register as broker-dealers. These new rules have proved controversial and are being challenged in court by hedge fund industry participants at the time of writing. U.S. SEC, Final Rules: Changes to Definition of Dealer and Government Securities Dealer 1 (2024), https://www.sec.gov/files/34-99477-fact-sheet.pdf [https://web.archive.org/web/20240708062607/https://www.sec.gov/files/34-99477-fact-sheet.pdf]. On the challenge of the new rules in court, see, e.g., Kate Duguid, Treasury Market Reforms Draw Flak from Funds and High-Speed Traders, Fin. Times (June 30, 2022), https://www.ft.com/content/4cc84b80-caca-4ed7-998c-2fb1956ec930 [https://perma.cc/4C62-9GTN]; Davide Barbuscia, Hedge Fund Industry Groups Sue US SEC over Treasury Market Dealer Rule, Reuters (March 18, 2022, 1:27 PM), https://www.reuters.com/markets/us/hedge-fund-industry-groups-sue-us-sec-over-treasury-market-dealer-rule-2024-03-18 [https://perma.cc/RN7B-SCDG]. Limited, comprehensive real-time disclosure adds to the monitoring costs faced by primary dealers, forcing them to buy and organize trading data privately. This can add delays and inaccuracies to data processing, making it harder to determine how risks are building in real time (for example, predicting large orders, predatory traders, or price dislocations).

This opacity feeds tension within a system of intermediation that must meet the needs of both public and private financial regulation at the same time. That is, actions taken by primary dealers to protect repo market operations for private firms can come at a cost to maintaining trading in the secondary market for the wider public.

Reliance on Treasuries as collateral in repo funding markets means that the availability of these securities for trading in secondary markets can become restricted. The repo market requires trillions of dollars in Treasuries (and cash) to be set apart daily to support private lending and borrowing.44See generally SIFMA Rsch., supra note 26. The free-float of Treasuries—or the amount of Treasuries that are circulating freely at a given point in time—is thus reduced by what must be earmarked to support trillions in daily repo operations.45See David Lam, Bing-Xuan Lin & David Michayluk, Demand and Supply and Their Relationship to Liquidity: Evidence from the S&P 500 Change to Free Float, 67 Fin. Analysts J. 55, 55–57 (2011); Xiaoya (Sara) Ding, Yang Ni & Ligang Zhong, Free Float and Market Liquidity Around the World, 38 J. Empirical Fin. 236, 237 (2016). To take a stylized example, if the face value of a single Treasury bond is $1,000, and a particular Treasury bond issue has five million such bonds, then the total face value issued is $5 billion. That is the total supply. Suppose two million of these bonds have been bought by the Fed and are not readily available for being bought and sold. Suppose further that another two million of these bonds are passively held long-term in private accounts, and are, again, not readily available for buying and selling. Thus, at any point of time, only $1 billion is the available “free float.” In a crisis, primary dealers must rapidly shore up Treasuries collateral in repo operations to protect financial stability and ensure that sufficient collateral exists to support trillions in exposure between private firms. In cases when the repo market gets securely ring-fenced, secondary markets can become strained as primary dealers have a smaller supply of assets with which to respond to investors wanting to buy and sell Treasuries in a panic.46The short-term financing rate in repo trades also links the prices of Treasuries with those of Treasury bond futures contracts. “Basis” or “relative-value” trades ensure that these three remain economically aligned. The high volatility of the repo rate during March 2020 led to large short-term losses for hedge funds doing relative-value trades. An Office of Financial Research study suggests that leveraged hedge funds cashing out of these “basis trades” are unlikely to have amplified the illiquidity in treasury securities during the March panic. Daniel Barth & Jay Kahn, Off. of Fin. Rsch., Basis Trades and Treasury Market Illiquidity 11–13 (2020), https://www.financialresearch.gov/briefs/files/OFRBr_2020_01_Basis-Trades.pdf [https://perma.cc/7UMW-CZ6A]. But see Jeanna Smialek & Deborah B. Solomon, A Hedge Fund Bailout Highlights How Regulators Ignored Big Risks, N.Y. Times (Jul. 23, 2020), https://www.nytimes.com/2020/07/23/business/economy/hedge-fund-bailout-dodd-frank.html [https://perma.cc/5C2V-UPCU].

Opacity contributes to the challenge primary dealers face in ensuring steady intermediation to both repo and secondary markets. A lack of full and real-time information means that primary dealers face constant difficulties in attempting to predict the needs of the repo market—like how much cash and Treasuries are needed on any given day. These demands are hard to predict in any event. As a market for funding the daily life of financial firms, pressure on the repo market can vary wildly depending on any number of factors like seasonality (for example, making payroll), time of day (for example, reduced demand during lunchtimes), and the nature of the firm’s business (for example, banks requiring large amounts of cash).47See generally Cheng & Wessel, supra note 20. In March 2020, for example, weekly collateral needs varied by more than $350 billion for positions held by primary dealers.48See discussion infra Section IV.A. While the secondary market for Treasuries tends to be more stable, crises can trigger an unexpected spike. For example, total aggregate weekly trading in the turbulent week of March 6, 2020, was around $5.7 trillion. By late July, however, activity volumes had normalized, and the secondary market saw around $3 trillion in weekly aggregate trading volume.49See discussion infra Section IV.A.

This tension between protecting repo markets and maintaining resilience in secondary trading creates the danger that intermediaries stop performing when the costs of doing so become too high. Regulation does not require dealers to remain trading.50Scaggs, supra note 32. If the cost-benefit trade-off of intermediation becomes overly expensive, intermediaries withdraw. Or, they choose to protect one market over the other, depending on profitability, important client relationships, and keeping a reputational halo.51See generally SIFMA Rsch., supra note 26 (on the dominance of primary dealers in repo markets). Stated differently, for public and private financial regulation to currently remain credible, Treasuries intermediation must be lucrative business for primary dealers.

When primary dealers face such a choice, they have powerful incentives to resolve the tension in favor of the repo market. The repo market is much larger than the secondary market. For example, to take a more typical week in 2020, for example, the week of July 29, 2020, the average daily trading in the secondary market by primary dealers was $518 billion and their average daily risk exposure was $271 billion.52See sources and discussion infra Sections IV.A–B. By comparison, in the repo market, primary dealers had lent out around $1.58 trillion and borrowed $1.81 trillion of Treasuries.53See sources and discussion infra Section IV.B. Taken together, their repo activity measured about six times their average daily secondary market trading volume of Treasuries, and about twelve times their daily average exposure in the Treasuries secondary market.54See sources and discussion infra Section IV.B. With its size and repeat client relationships, dealers can make profitable gains by focusing resources in the repo market ahead of the secondary market.55Adam Copeland, Isaac Davis, Eric LeSueur & Antoine Martin, Lifting the Veil on the U.S. Bilateral Repo Market, Fed. Rsrv. Bank of N.Y.: Liberty St. Econ. (July 9, 2014), https://libertystreeteconomics.newyorkfed.org/2014/07/lifting-the-veil-on-the-us-bilateral-repo-market.html [https://perma.cc/X4BZ-4YHA]. Copeland et al. estimate that primary dealers are involved in almost 80% of repos in the bilateral repo market—the largest segment in which parties connect and lend to one another directly. Id. But this private preference comes with a collective price, in which the secondary market can become disrupted and fails to function as a safe haven for investors at large. Taken as a whole, serious pressures on dealer balance sheets can damage Treasury market function. As shown by Darrell Duffie et al., the quality of U.S. Treasury market operations deteriorates markedly when dealers are forced to delve deep into their balance sheet to intermediate trading.56Darrell Duffie, Michael Fleming, Frank Keane, Claire Nelson, Or Shachar & Peter Van Tassel, Dealer Capacity and US Treasury Market Functionality 2 (Bank for Int’l Settlements, Monetary & Econ. Dep’t, Working Paper No. 1138, 2023).

 In our second contribution, we show that regulators are poorly placed to recognize the tension between a system of public and private financial regulation that is so deeply reliant on Treasuries to function.

That regulators have failed to account for the structural interlinkages between repo and secondary markets is not surprising. From the institutional standpoint, secondary trading and repo markets are subject to a patchwork system of fragmented oversight, governed by a rule book that has failed to adapt to changing market design.57See generally Yadav, supra note 19 (discussing and analyzing the regulation of Treasury market structure). The market does not have a lead regulator; oversight of the secondary market is shared by five or more agencies.58Id., at 1193–99, 1219–27. The repo market, by contrast, looks largely to the Federal Reserve (“the Fed”) and the Federal Reserve Bank of New York (“NY Fed”) for supervision.59Id. This confusing division of authority breeds gaps and blind spots. Owing to fragmentation and an absence of coordination, regulators lack a coherent picture of the risks that run between repo and secondary markets. Rulemaking is costly given the need to overcome bureaucratic walls and divergences in institutional mandates and approaches between different regulators.60Id.

Importantly, each market is regulated in accordance with distinctive methodological approaches. The secondary market for Treasuries trading (broadly speaking) hews to a more capital markets–based approach that focuses on generating smooth trading, price efficiency, and trade reporting to regulators.61Doug Brain, Michiel De Pooter, Dobrislav Dobrev, Michael J. Fleming, Peter Johansson, Collin Jones, Frank M. Keane, Michael Puglia, Liza Reiderman, Anthony P. Rodrigues & Or Shachar, Unlocking the Treasury Market Through TRACE, Fed. Rsrv. Bank N.Y.: Liberty St. Econ. (Sept. 28, 2018), https://libertystreeteconomics.newyorkfed.org/2018/09/unlocking-the-treasury-market-through-trace [https://perma.cc/23EG-VS9Q] (describing liquidity in Treasuries trading and emphasizing greater reporting to regulators as a way to create understanding of the market). By contrast, repo markets fall under a more “prudential” framing that protects the systemic soundness of firms and the market. Disclosure and pricing carry far less emphasis than ensuring that firms avoid default and do not sicken one another if one of them collapses.62Viktoria Baklanova, Adam Copeland & Rebecca McCaughrin, Fed. Rsrv. Bank of N.Y., Reference Guide to U.S. Repo and Securities Lending Markets 34–37 (2015) (highlighting efforts to prevent contagion in repo markets). See generally Cheng & Wessel, supra note 20; SIFMA Rsch., supra note 26; 17. Who Regulates the Repo Market?, Int’l Cap. Mkt. Ass’n., https://www.icmagroup.org/Regulatory-Policy-and-Market-Practice/repo-and-collateral-markets/icma-ercc-publications/frequently-asked-questions-on-repo/17-who-regulates-the-repo-market [https://perma.cc/JT8H-JD84]. A prudential model prioritizes collateralization and deep capital buffers, and can come with the (implied) promise of federal protection in case firm failure sets off systemic contagion.63See discussion and sources infra Part III. These differences in regulatory approach complicate rulemaking, monitoring, and coordination challenges already pervasive to the task of overseeing Treasury repo and secondary markets. Regulators cannot fill information gaps because Treasuries collateralization reduces the need to gather and disclose data in real time. Data gathering in the secondary market also remains patchy. Without full information, policymakers cannot know what tools might work best to prevent sudden loss of liquidity and price distortions. Because ex post interventions to stabilize the market are available, regulators may prefer to rely on them rather than to engage in complex, ex ante, administratively costly rulemaking. When the Treasury market failed in March 2020, the Fed stepped in immediately, making around $1.5 trillion in cash and Treasuries available to primary dealers in a bid to revive intermediation.64Nick Timiraos & Julia-Ambra Verlaine, Fed to Inject $1.5 Trillion in Bid to Prevent ‘Unusual Disruptions’ in Markets, Wall St. J. (March 12, 2020, 5:08 PM), https://www.wsj.com/articles/fed-to-inject-1-5-trillion-in-bid-to-prevent-unusual-disruptions-in-markets-11584033537 [https://perma.cc/RE9Q-TDZF]. This funding was just one measure out of many that was implemented by the Federal Reserve (“the Fed”) and the Federal Reserve Bank of New York (“NY Fed”) to strengthen the liquidity of Treasuries and other securities markets. For discussion of the Fed’s larger response to COVID-19, see also Michael Fleming, Asani Sarkar & Peter Van Tassel, The COVID-19 Pandemic and the Fed’s Response, Fed. Rsrv. Bank of N.Y.: Liberty St. Econ. (Apr. 15, 2020), https://libertystreeteconomics.newyorkfed.org/2020/04/the-covid-19-pandemic-and-the-feds-response.html [https://perma.cc/UV5S-H8GU].

A final observation on the economic significance of the Treasury market. From the standpoint of political economy, weakness in Treasury market structure is profoundly problematic for the status of U.S. debt as the global risk-free asset that is a lynchpin for financial stability. As Anna Gelpern and Erik Gerding write, the notion of a risk-free asset is one that is legally constructed rather than being intrinsically real.65See generally Gelpern & Gerding, supra note 17. Default arises as a matter of contractual design.66See generally id. It is conventionally believed that the United States will pay its debts. However, in theory, it may default.67See generally id. Our thanks also to Mitu Gulati for underscoring this point. On the contractual basis for default in U.S. government debt and analysis of historical instances in which the United Sttes has failed to pay (most recently in 1979), see generally D. Andrew Austin, Cong. Rsch. Serv., R44704, Has the U.S. Government Ever “Defaulted”? (2016). The most tangible manifestation of Treasuries, their power and prestige, comes from the workings of the market—by investors buying and selling Treasuries, or by using Treasuries as collateral to release economic value. Public oversight and private industry self-regulation reinforce this real-world compact. This collective practice makes failures in Treasury market structure particularly dangerous for the long-term dominance of the United States. With the Treasury’s risk-free status ultimately ephemeral, a disrupted market undermines the most fundamental article of faith about the power of the U.S. economy and its financial system.68Our thanks to Anna Gelpern for this framework of thinking about risk-free assets.

In conclusion, to repair the broken promise of Treasuries in financial regulation, this Article proposes a three-part solution for reform. As a starting point, it advocates for systematically greater transparency and reporting, particularly in more opaque repo markets. A richer understanding of how this market works can help regulators and dealers manage their risks, address conflicts, and unravel complexities between the secondary and repo markets. Secondly, the Article seeks to require dealers to maintain intermediation, rather than exit the market at will. Even with information, dealers can still stop intermediating both repo and secondary trading whenever this task becomes too difficult or expensive. As noted earlier, there are no rules keeping key dealers in the market in crisis periods, and so such intermediation can disappear at any moment. To counter this risk, we propose that regulators expressly require key dealers to affirmatively maintain trading and price stability in Treasuries, even in crisis.69Thanks to conversations with Kumar Venkataraman and policymakers for thinking around this idea. We consider this to be necessary in light of the fundamental reliance that financial regulation places on the steadfastness of the intermediation system for Treasuries. Importantly, such a mandate is familiar. For example, trading on the New York Stock Exchange (“NYSE”) was long maintained by dealers that contracted to support trading and price stability during crises.70See discussion and sources infra Section III.B. In addition to being well-worn and familiar, this mandate offers realistic assurance that the Treasury market will always provide liquidity, and, in particular, do so when such liquidity is most needed and when the chances of market failure are greatest. Finally, we support thoroughgoing reform of the regulatory structure for the repo and secondary market to harness the potential for coordination offered by the Financial Stability Oversight Council (“FSOC”).71This proposal supports and refines the proposal set out in Yadav, supra note 19. Created in the wake of the 2008 Financial Crisis, we believe that it is well placed to coordinate a more streamlined approach to rulemaking and supervision and to holistically view the repo and secondary market for Treasuries as interconnected.72Id. at 1236–44 (introducing the importance of coordination under the Financial Stability Oversight Council (“FSOC”)).

This Article proceeds as follows. Part I provides an overview of why Treasuries are risk-free and the reliance placed on their risk-free status in public regulation. It details the centrality of primary dealers to intermediation and market function. Part II analyzes the workings of the multitrillion-dollar repo market and the anchoring role of Treasuries in private contracting. Part III develops a novel account of the interconnected risks of intermediation in both repo and secondary markets to show that it is undermined by opacity, conflict, and complexity. Part IV sets out a solution to remedy fragility in Treasury market design. Part V concludes.

I.  TREASURIES AND THE FINANCIAL SYSTEM

Beyond funding the affairs of state, the Treasury market represents a foundational pillar of global financial stability. A Treasury bond is perceived to be a default-free security that is capable of being traded easily at fair prices, offering investors an asset that can serve as a safe, cash-like store of value.73Michael Fleming, How Has Treasury Market Liquidity Evolved in 2023?, Fed. Rsrv. Bank of N.Y.: Liberty St. Econ. (Oct. 17, 2023), https://libertystreeteconomics.newyorkfed.org/2023/10/how-has-treasury-market-liquidity-evolved-in-2023 [https://perma.cc/2UTR-D4LX]. These attributes ensure that Treasuries occupy a central place in regulation as an asset capable of being used by financial institutions to protect themselves and the market from sudden insolvency and systemic collapse. In public regulation, financial firms must keep Treasuries as part of their firm’s rainy day reserves. By owning Treasuries, firms hold a security that has predictable cash flows and is assumed to be rapidly tradable to generate cash when it is in trouble. Similarly, Treasuries are critical to private self-regulation in anchoring everyday lending between financial firms. They constitute the preferred form of collateral in the five-trillion-dollar repurchase market, allowing firms to borrow and lend to one other safely without undertaking prior due diligence.74Peter Hördahl & Michael R. King, Developments in Repo Markets During the Financial Turmoil, BIS Q. Rev., Dec. 2008, at 37, 39 (detailing the flight to Treasuries in repo markets during the 2008 Financial Crisis); James Clark & Tom Katzenbach, Examining Changes in the Treasury Repo Market After the Financial Crisis, U.S. Dep’t of the Treasury: Treasury Notes (Oct. 12, 2016), https:/www.treasury.gov/connect/blog/Pages/Examining-Changes-in-the-Treasury-Repo-Market-after-the-Financial-Crisis.aspx [https://web.archive.org/web/20161222095332/https:/www.treasury.gov/connect/blog/Pages/Examining-Changes-in-the-Treasury-Repo-Market-after-the-Financial-Crisis.aspx] (noting the high use of Treasury collateral in repo markets). See generally John Mullin, The Repo Market Is Changing (and What Is a Repo, Anyway?), Fed. Rsrv. Bank of Richmond (2020), https://www.richmondfed.org/publications/research/econ_focus/2020/q1/federal_reserve [https://perma.cc/994U-8LNF].

This Part explains why Treasuries have acquired this stature as the foremost safe asset and haven for global financial stability.75Bouveret et al., supra note 3, at 5–6. It highlights that the usefulness of Treasuries is comprised of two main attributes: (1) they are, for all intents and purposes, default-free, meaning that the United States will pay its debts, and (2) they are supposed to be highly tradable (or liquid), capable of being bought and sold in their secondary market with ease, at a fair price, and without trades causing prices to become distorted.76Fleming, supra note 73. These two attributes, while linked, are distinct from one another. This Part describes the key features of this secondary market and its regulation. Like any other active market, Treasuries trade in an environment that is operationally complex and risky. These risks are amplified by a unique framework of public oversight that is fragmented and lacking in leadership, making rulemaking and supervision subject to coordination costs and delays.77See generally Yadav, supra note 19 (discussing the regulatory system for U.S. Treasuries).

A.  Why Treasuries Are Risk-Free

The Treasury market is critical to economic life in the United States and the health of financial markets globally.78This descriptive account of the U.S. Treasury market and its significance is based on and extends the analysis set out in Yadav, supra note 19, at 1187–90. Public debt has proven to be a transformative force for the country.79Marcin Kacperczyk, Christophe Pérignon & Guillaume Vuillemey, The Private Production of Safe Assets, 76 J. Fin. 495, 496–98 (2021); Dominique Dupont & Brian Sack, The Treasury Securities Market: Overview and Recent Developments, Fed. Rsrv. Bull., Dec. 1999, at 785, 786–87, https://www.federalreserve.gov/pubs/bulletin/1999/1299lead.pdf [https://perma.cc/SAF9-G6HY]. See generally Gelpern & Gerding, supra note 17. It has allowed Congress to implement major policy initiatives such as public works projects, wars, and efforts to counteract economic misfortunes.80Matt Phillips, The Long Story of U.S. Debt, from 1790 to 2011, in 1 Little Chart, Atlantic (Nov. 13, 2012), https://www.theatlantic.com/business/archive/2012/11/the-long-story-of-us-debt-from-1790-to-2011-in-1-little-chart/265185 [https://perma.cc/WNU3-Z829]; Peter M. Garber, Alexander Hamilton’s Market Based Debt Reduction Plan 14–16 (Nat’l Bureau of Econ. Rsch., Working Paper No. 3597, 1991). The Treasury market provides policymakers with power to pursue far-reaching goals.81See Phillips, supra note 80. Policies do not have to be constrained by present-day taxpayer contributions. Rather, the Treasury can tap into global capital markets to raise money.82See generally Justin Lahart, The Treasury Market Is Having a Senior Moment, Wall St. J. (June 6, 2018, 1:53 PM), https://www.wsj.com/articles/the-treasury-market-is-having-a-senior-moment-1528307631 [https://perma.cc/6BEU-VAAY]; Rafael A. Bayley, The National Loans of the United States, from July 4, 1776, to June 30, 1880 (2d ed. 1882), https://catalog.hathitrust.org/Record/009011064 [https://perma.cc/P526-BGDN]. For historical context, see Dupont & Sack, supra note 79, at 786–87. Crucially, the economic and political heft of the United States enables investors to have confidence that whatever they lend to the Treasury will be repaid exactly as promised.83See generally Dupont & Sack, supra note 79. This credibility means that the United States can borrow to fund itself much more cheaply than other countries with weaker economies and political institutions.84Neil H. Buchanan & Michael C. Dorf, How to Choose the Least Unconstitutional Option: Lessons for the President (and Others) from the Debt Ceiling Standoff, 112 Colum. L. Rev. 1175, 1177–81 (2011) (detailing the constitutional basis for government borrowing). See generally Garrett Epps, Our National Debt ‘Shall Not Be Questioned,’ the Constitution Says, Atlantic (May 4, 2011), https://www.theatlantic.com/politics/archive/2011/05/our-national-debt-shall-not-be-questioned-the-constitution-says/238269 [https://perma.cc/H8L8-SDCU].

Holding a default-free asset can be uniquely advantageous. Investors can be sure that the money they lend to the U.S. government is safe. Importantly, Treasuries provide a counterpoint to a portfolio containing a mix of investments with riskier options like corporate debt or equity. Whereas other assets involve varying cash flows, uncertainties in valuation, periods where they become hard to sell, or lose their value (in case of bankruptcy), Treasuries are not supposed to face any such danger.85U.S. Treasury Securities, FINRA, https://www.finra.org/investors/learn-to-invest/types-investments/bonds/types-of-bonds/us-treasury-securities [https://perma.cc/2SJL-4274]. Instead, investors believe Treasuries will perform in accordance with their terms, retain value, price, and currency stability. It follows that Treasuries have long been viewed as the safe haven for domestic as well as foreign investors, including other sovereigns looking to invest their reserves.86Yadav, supra note 19, at 1186–90. See generally Gelpern & Gerding, supra note 17.

The key attributes of a Treasury bond—default-free, denominated in the U.S. dollar, designed to be paid out in specific maturities and simple to value—are bolstered by its tradability (for example, its ability to be bought and sold quickly and cheaply without significantly impacting prices).87James Clark & Gabriel Mann, A Deeper Look at Liquidity Conditions in the Treasury Market, U.S. Dep’t of the Treasury: Treasury Notes (May 6, 2016), https:/www.treasury.gov/connect/blog/Pages/A-Deeper-Look-at-Liquidity-Conditions-in-the-Treasury-Market.aspx [https://web.archive.org/web/20160808194502/https:/www.treasury.gov/connect/blog/Pages/A-Deeper-Look-at-Liquidity-Conditions-in-the-Treasury-Market.aspx] (discussing liquidity metrics for the Treasury market and noting transitional elements in market structure). Official government reports into the Treasury market commonly begin by observing that it constitutes the “deepest and most liquid government securities market in the world.”88Id.; Statement from Luis A. Aguilar, Comm’r, U.S. Sec. & Exch. Comm’n, Ere Misery Made Me Wise – The Need to Revisit the Regulatory Framework of the U.S. Treasury Market, (Jul. 14, 2015), https://www.sec.gov/news/statement/need-revisit-regulatory-framework-us-treasury-market [https://perma.cc/QXC8-LKHY].

This liquidity represents a hallmark without which Treasuries could not attract investors as easily.89Cheng et al., supra note 1; Samson et al., supra note 11. Those holding Treasuries would not be able to turn them into cash, while those wishing to add Treasuries to their portfolios would struggle to purchase them. If investors lack liquidity, they will charge the U.S. government more to reflect the cost of keeping a less tradable investment on their books.90On the characteristics of safe assets, see generally Gelpern & Gerding, supra note 17.

B.  Making Treasuries Tradable

Through much of its history, regulators have relied on a cohort of international banks and investment banks—designated as primary dealers—to support intermediation in Treasury markets.91Dupont & Sack, supra note 79, at 787. After the initial issue by the U.S. Treasury (in what is called the “primary” market), the secondary market for day-to-day trading of Treasury securities is divided into two parts: (1) the segment where primary dealers and other dealers transact externally with customers (like foreign governments, or mutual funds) to buy and sell Treasuries, and (2) the interdealer segment where dealers internally transact with one another in order to even out or otherwise manage the inventories they hold of Treasuries. If some dealers want Treasuries but do not have them, while others wish to sell, the interdealer trading market offers a space for Treasury dealers to be able to transact with one another.92See Kevin McPartland, Greenwich Assocs., Sizing and Segmenting in the U.S. Treasury Market 2 (2017), https://www.greenwich.com/fixed-income-fx-cmds/sizing-and-segmenting-trading-us-treasury-market-0 [https://perma.cc/TLS5-KFQT]; Ken Monahan, TRACE “Unlocks” the Treasury Market for the Official Sector. Everyone Else Gets a Peek Through the Keyhole, Coal. Greenwich (Oct. 3, 2018), https://www.greenwich.com/blog/frbny-trace-unlocks-treasury-market-everyone-else-gets-peek-through-keyhole [https://perma.cc/53LM-PK6S] (noting some complexities to this core design, for example, to highlight bilateral trading between dealers as a portion of the secondary market). Historically, primary dealers have played a critical role in each of these three parts of the market.93For a fuller discussion and sources, see Yadav, supra note 19, at 1199–203.

Treasuries at Auction: Primary dealers are expected to use their deep pockets to bid for new Treasury securities when they are issued at government auctions. These bids must be at competitive prices, meaning that primary dealers cannot comply with their obligations by simply making unrealistic and unrealizable bids.94Primary Dealers: Specific Expectations & Eligibility Requirements, Fed. Rsrv. Bank of N.Y. [hereinafter Specific Expectations], https://www.newyorkfed.org/markets/primarydealers [https://perma.cc/TQ8F-NAQD]. See generally Federal Reserve Bank of New York Policy on Counterparties for Market Operations, Fed. Rsrv. Bank of N.Y. (Nov. 9, 2016), https://www.newyorkfed.org/markets/counterparties/policy-on-counterparties-for-market-operations [https://perma.cc/7DX3-HDJR]. The government places great trust in primary dealers by relying on these firms to act as counterparties at every issue of public debt. In their 2007 study, Michael Fleming et al. show that primary dealers purchased an average of 71% of new issues.95Michael Fleming, Giang Nguyen & Joshua Rosenberg, Fed. Rsrv. Bank of N.Y., How Do Treasury Dealers Manage Their Positions? 7 (2007); Michael J. Fleming, Who Buys Treasury Securities at Auction?, 13 Current Issues Econ. & Fin. 1, 3 (2007), https://www.newyorkfed.org/medialibrary/media/research/current_issues/ci13-1.pdf [https://perma.cc/F3TQ-PUWS].

Given regular and heavy demands on their balance sheets, primary dealers are chosen from among those that can demonstrate the financial capacity to purchase, hold, and trade these securities. Though the NY Fed slightly relaxed eligibility conditions in 2016, the firms that perform primary dealer functions come from the ranks of well-regulated financial institutions—mostly banks.96Specific Expectations, supra note 94; FAQs About the New York Fed’s Counterparty Framework for Market Operations, Fed. Rsrv. Bank of N.Y. (Nov. 9, 2016), https://www.newyorkfed.org/markets/counterparties/faq-counterparty-framework-for-market-operations [https://perma.cc/A4RN-YXGN]. To qualify, firms must be regulated as a well-capitalized bank or as a broker-dealer under the jurisdiction of the SEC and the Financial Industry Regulation Authority. As part of their agreement, primary dealers provide the NY Fed with weekly reports into the activities of the Treasury market.97Specific Expectations, supra note 94.

Secondary Market – Dealer-Client: The secondary market comprises two major segments. In the dealer-client market, investors like foreign governments or mutual funds come to buy or sell Treasuries. This segment is critical for ensuring that Treasuries are widely available and capable of performing their stabilizing, protective function. Trading volume for a recent single day—July 31, 2024—in this dealer-client segment was about $735 billion.98Treasury Daily Aggregate Statistics – Files, FINRA (Jul. 31, 2024), https://www.finra.org/finra-data/browse-catalog/about-treasury/daily-file [https://perma.cc/N9DX-G6MZ] (choose “July 2024”; then choose “July 31, 2024”). See generally Brain et al., supra note 61.

Primary dealers have a major advantage in the dealer-client market. As large and internationally active financial firms, they possess an ample base of clients with which to transact. This network provides the means by which to operationalize the protective function of Treasuries by ensuring they are widely distributable.99Specific Expectations, supra note 94 (making it a condition for designation that an applicant be able to show that they have been active in making a market in Treasuries); e.g., Joe Rennison, Amherst Pierpont Becomes ‘Primary Dealer’ for US Treasury Debt, Fin. Times (May 6, 2019), https://www.ft.com/content/b3911dc8-7047-11e9-bbfb-5c68069fbd15 [https://perma.cc/7KYB-WNAN] (describing the importance of transacting with a Primary Dealer for certain kinds of investors). Importantly, by dint of their participation in Treasury auctions, primary dealers have sizable inventories of securities that they can transmit.100Kevin McPartland, Greenwich Assocs., U.S. Treasury Trading: The Intersection of Liquidity Makers and Takers 3 (2015), https://www.greenwich.com/fixed-income-fx-cmds/us-treasury-trading-intersection-liquidity-makers-and-takers [https://perma.cc/P7MV-6Z9T]. Indeed, when seeking to bid for Treasuries at auction, primary dealers usually collect indications of interest from major clients beforehand, ensuring that that they are able to capture and meet demand more precisely.101Fleming et al., supra note 95, at 2–3.

The structure of the dealer-client market is based on an over-the-counter design. Clients contact primary dealers (and other dealers) using telephones or computer screens to ask for bids on what they are willing to buy and sell and at what price.102This is a simple description of a request-for-quote system that, in the dealer-to-client segment of the Treasury market, is provided by two major providers, Bloomberg and Tradeweb. This is not an exchange-type system, but a platform that enables an electronic interaction bilaterally between a customer-dealer, or between a customer and multiple dealers to request bids. For discussion, see generally McPartland, supra note 100. It rewards those most capable of developing repeat relationships with leading investors like foreign governments, insurance, or pension funds. By their proximity to Treasury auctions, the ability to anticipate client appetites, and experience, primary dealers generally represent an efficient and informed disseminator of Treasuries across the world.103Rennison, supra note 99 (discussing the gains of primary dealer status).

Secondary Market Interdealer: The interdealer market represents the other main segment of the secondary market.104Brain et al., supra note 61; see McPartland, supra note 100, at 6–7. It helps Treasuries dealers to even out their reserves by selling what they do not need to other dealers or dipping into this market to buy when they face a shortfall. On July 31, 2024, the day chosen above to illustrate recent dealer-client trading, the volume of trading in the interdealer segment was similar in magnitude to that of dealer-client trading, at around $729 billion.105FINRA, supra note 98. See generally Brain et al., supra note 61.

The interdealer market fulfills two key functions. One, it helps reduce the risk that primary dealers and others face gluts or scarcity in their inventory of Treasuries. It provides a mechanism whereby the availability of Treasuries can be modulated between dealers to meet their external client demands. Primary dealers can sometimes face sudden, one-sided demand. With COVID-19 triggering panic in March 2020, investors sought en masse to sell Treasuries in order to get their hands on cash.106Colby Smith & Robin Wigglesworth, US Treasuries: The Lessons from March’s Market Meltdown, Fin. Times (July 28, 2020), https://www.ft.com/content/ea6f3104-eeec-466a-a082-76ae78d430fd [https://perma.cc/B9FN-RSFT] (noting investor surprise at misfiring Treasuries prices). According to Vissing-Jorgensen, sales by foreign investors, mutual funds, and the household sector (including hedge funds) came to around $287 billion, $266 billion, and $194 billion, respectively, in the first quarter of 2020 alone.107Vissing-Jorgensen, supra note 15, at 21; see also Bryan Noeth & Rajdeep Sengupta, Flight to Safety and U.S. Treasury Securities, Reg’l Economist, July 2010, at 18, https://www.stlouisfed.org/-/media/project/frbstl/stlouisfed/files/pdfs/publications/pub_assets/pdf/re/2010/c/treasury_securities.pdf [https://perma.cc/5DKE-DWWK] (showing how Treasuries and their stabilizing features provide a safety buffer against other volatile asset classes).

A market to regulate supply and demand ensures that the dealer-client market is not disrupted because dealers are unable to obtain Treasuries or cash. For example, confronting heavy demand to buy from a sovereign, dealers can go into the interdealer market to supplement thinning reserves. By doing so, they fulfill client demand. The ability to sell excess inventory to other dealers means that dealers face fewer costs in keeping securities on their balance sheets.

Two, the ability to meet client demand smoothly means that the market becomes less vulnerable to sudden spikes or plunges in Treasury prices. When dealers can transact with one another to manage their supply of cash and Treasuries, they can lower costs to clients. The effects of scarcity or oversupply can be managed, helping markets remain more reliable and affordable for investors.108See generally Harkrader & Puglia, supra note 39.

It is worth briefly noting that, from the mid-2000s, the interdealer market has undergone a structural shift to transition from an analog space to one that is now largely automated.109Bouveret et al., supra note 3, at 6–10. But see generally McPartland, supra note 100 (highlighting that a segment of the interdealer market uses voice-based trading but posits that this portion deals with trading off-the-run Treasuries). Its plumbing has transformed from reliance on telephones to the use of fast, artificially intelligent algorithms to drive trading.110Bouveret et al., supra note 3, at 6–9; Bruce Mizrach & Christopher J. Neely, The Microstructure of the U.S. Treasury Market 6–7 (Fed. Rsrv. Bank of St. Louis, Working Paper No. 2007-052B, 2008) (detailing key aspects of Treasury microstructure and the historical reliance on over-the-counter trading). See generally John Bates, U.S. Commodity Futures Trading Comm’n, Algorithmic Trading and High Frequency Trading: Experiences from the Market and Thoughts on Regulatory Requirements (2010), http://www.cftc.gov/ucm/groups/public/@newsroom/documents/file/tac_071410_binder.pdf [https://perma.cc/3YBA-2E7R]. High-frequency trading (“HFT”)—in which Treasuries are being bought and sold in milliseconds or less—dominates, driving around 50–75% of trading volume.111Greg Laughlin, Insights into High Frequency Trading from the Virtu Initial Public Offering 2–4 (Ctr. for Analytical Fin., Univ. of Cal. Santa Cruz, Working Paper No. 11, 2014) (discussing the common strategies used by high-frequency trading (“HFT”) market makers); see U.S. Sec. & Exch. Comm’n, Equity Market Structure Literature Review, Part II: High Frequency Trading 4–7 (2014) (setting out the key features of high-frequency trading). One study showed that the median time between trades in ten-year Treasury notes in 2015 was around ten milliseconds.112Ernst Schaumburg & Ron Yang, The Workup, Technology, and Price Discovery in the Interdealer Market for U.S. Treasury Securities, Fed. Rsrv. Bank of N.Y.: Liberty St. Econ. (Feb. 16, 2016), https://libertystreeteconomics.newyorkfed.org/2016/02/the-workup-technology-and-price-discovery-in-the-interdealer-market-for-us-treasury-securities.html [https://perma.cc/J732-XL98]. A decade earlier in 2006, trading speed for this note stood at around one hundred times slower, with transactions occurring around one second apart.113Id. To be sure, this trend is a secular one. Electronic, automated trading has become the norm in equities and derivatives markets, reflecting growth in computing power, data processing, and artificial intelligence since the mid-2000s.114Johannes Breckenfelder, Competition Among High-Frequency Traders and Market, Ctr. for Econ. Pol’y Rsch.: VoxEU (Dec. 17, 2020), https://cepr.org/voxeu/columns/competition-among-high-frequency-traders-and-market-liquidity [https://perma.cc/U24T-VWZK]; U.S. Sec. & Exch. Comm’n, supra note 111, at 4 (noting that over 50% of all trading volume on listed equities could be attributed to HFT); Gov’t Off. for Sci., Foresight: The Future of Computer Trading in Financial Markets 20–48 (2012) (describing the welfare-enhancing gains for markets owing to algorithmic and HFT). Its extensive embrace within the historically staid Treasury market has nevertheless come as something of a surprise.115U.S. Dep’t of the Treasury et al., supra note 39, at 15–19 (describing surprise by regulators at discovering that interdealer Treasury markets were seeing high levels of HFT trading).

High-speed trading has given rise to sources of instability.116Automation and HFT have brought benefits on several measures. Michael J. Fleming, Measuring Treasury Market Liquidity, Fed. Rsrv. Bank of N.Y. Econ. Pol’y Rev., Sept. 2003, at 62, https://www.newyorkfed.org/research/epr/03v09n3/0309flem/0309flem.html [https://perma.cc/Q69K-5ACV]; see also Jonathan Brogaard, Terrence Hendershott & Ryan Riordan, High Frequency Trading and Price Discovery 5, 32–33 (Eur. Cent. Bank, Working Paper No. 1602, 2013). But see Tobias Adrian, Michael J. Fleming, Daniel Stackman & Erik Vogt, Has U.S. Treasury Market Liquidity Deteriorated?, Fed. Rsrv. Bank of N.Y.: Liberty St. Econ. (Aug. 17, 2015), https://libertystreeteconomics.newyorkfed.org/2015/08/has-us-treasury-market-liquidity-deteriorated.html [https://perma.cc/4HGA-PHBR] (noting that bid-ask spreads suggest ample liquidity but noting deterioration on some other measures). For discussion, see generally George J. Jiang, Ingrid Lo & Giorgio Valente, High-Frequency Trading in the U.S. Treasury Market Around Macroeconomic News Announcements (H.K. Inst. for Monetary Rsch., Working Paper No.19/2018, 2018) (noting that high-frequency trading improves price efficiency around macroeconomic events but diminishes liquidity and market depth); Alain P. Chaboud, Benjamin Chiquoine, Erik Hjalmarsson & Clara Vega, Rise of the Machines: Algorithmic Trading in the Foreign Exchange Market, 69 J. Fin. 2045 (2014) (highlighting rapid market-wide efficiencies but also the risk of correlated automated responses to information). An automated interdealer market has introduced new types of traders with a different profile to primary dealers.117Brain et al., supra note 61. HFT firms tend to be smaller securities firms that specialize in computerized trading across multiple types of assets, such as equities. Leading HFT firms are not household names, despite driving heavy volumes of trading. Firms like KCG, Spirex, XR Trading, or Jump Trading—while not as well-known as J.P. Morgan or Wells Fargo—have risen to become major suppliers of liquidity in the interdealer market.118Crowe, supra note 39.

C.  Centrality of Treasuries in Public Regulation

Owing to their default-free status and perceived liquidity, Treasuries have become the go-to protective asset in public financial regulation.119It should be noted that the protective power of Treasuries came under serious scrutiny in the wake of the collapse of Silicon Valley Bank, Silvergate Bank, and Signature Bank in spring 2023. Even though banks held Treasuries as part of their rainy-day buffers, rising inflation resulted in the value of their Treasuries holdings falling precipitously, such that they could not be relied on to save distressed banks in a crisis. Contemplated reforms responding to the March 2023 banking crisis included provisions designed to rethink how Treasuries ought to be accounted for on bank balance sheets. In the immediate aftermath of the 2023 bank crisis, the Fed offered banks a facility that allowed Treasuries to be valued at par value in order to permit banks to extract cash by collateralizing their Treasuries. A full discussion is outside the scope of this Article. For discussion, see, e.g., Mark Maurer, Banks, Investors Revive Push for Changes to Securities Accounting After SVB Collapse, Wall St. J. (Mar. 20, 2023, 1:44 PM), https://www.wsj.com/articles/banks-investors-revive-push-for-changes-to-securities-accounting-after-svb-collapse-99caa9ce [https://web.archive.org/web/20240714153150/https://www.wsj.com/articles/banks-investors-revive-push-for-changes-to-securities-accounting-after-svb-collapse-99caa9ce]. For information on the Fed’s Bank Term Funding Program, see Press Release, Bd. of Governors of the Fed. Rsrv. Sys., Federal Reserve Board Announces It Will Make Available Additional Funding to Eligible Depository Institutions to Help Assure Banks Have the Ability to Meet the Needs of All Their Depositors (Mar. 12, 2023), https://www.federalreserve.gov/newsevents/pressreleases/monetary20230312a.htm [https://perma.cc/2VR8-JFMS]. Following the 2008 Financial Crisis, with top financial firms collapsing or needing bailouts, the rapid loss of solvency raised worries about how best to avoid a repeat episode.120See, e.g., Banking: Regulatory Reform, Fed. Rsrv. Bank of S.F., https://www.frbsf.org/banking/regulation/regulatory-reform [https://perma.cc/LX8B-JHML]. The causes of the 2008 Financial Crisis are complex.121See, e.g., Michael S. Barr, Howell E. Jackson & Margaret E. Tahyar, Financial Regulation: Law and Policy 59–73 (2d ed., 2021). Reform has sought to address numerous vulnerabilities.122Id. at 64–65 (focusing on the scope of the Dodd-Frank Wall Street Reform and Consumer Protection Act 2010); Claudio Borio, Marc Farag & Nikola Tarashev, Post-Crisis International Financial Regulatory Reforms: A Primer (Bank for Int’l Settlement, Working Paper No. 859, 2020). But to bluntly mitigate the catastrophic effects of firms becoming unable to pay out on short-term debts and triggering a domino of failures, buffers of rainy-day and high-quality liquid assets (“HQLA”) now represent a mainstay of financial regulatory design.123See generally Supervisory Policy and Guidance Topic: Capital Adequacy, Bd. of Governors of the Fed. Rsrv. Sys., https://www.federalreserve.gov/supervisionreg/topics/capital.htm [https://perma.cc/BCJ3-L4TU].

To illustrate, banks are required to maintain a cushion of highly liquid assets that can help them to cover their short-term outflows over a stressed thirty-day period. Importantly, firms should be able to convert these assets into cash within just one day, without these assets losing value. In other words, assets must be highly liquid and their fire sale in stressed circumstances ought not to cause their prices to become distorted. Overall, banks need to keep more than 100% of their expected thirty-day liquidity needs in the form of HQLA.124J.P. Morgan, Liquidity Investors and Basel III 4–5 (2015) (discussing methodologies used to calculate high-quality liquid assets (“HQLA”)). This liquidity-coverage ratio (“LCR”) represents a post-2008 reform hallmark, designed to ensure that firms do not cause contagious failures by failing to make good on their immediate commitments.125See Mark House, Tim Sablik & John R. Walter, Fed. Rsrv. Bank of Richmond, Understanding the New Liquidity Coverage Ratio Requirements 4–5 (2016), https://www.richmondfed.org/-/media/richmondfedorg/publications/research/economic_brief/2016/pdf/eb_16-01.pdf [https://perma.cc/6YB8-TU3J]. See generally Liquidity Risk Management Standards, 12 C.F.R. §§ 329.1–.50 (2020). By preserving sufficient reserves of cash and cash-like assets, firms can feel safe in their ability to pay, while others are reassured that they will be repaid. The comfort of reliable liquidity buffers can work to limit the chances of a destructive run, when firms might try and seize cash and other assets in the worry that their counterparties cannot pay.126E.g., Morgan Ricks, The Money Problem: Rethinking Financial Regulation 102–45 (2016) (describing panics resulting from short-term debt holdings).

Treasuries rank in the highest tier of liquid assets for firms seeking to build their buffers of HQLA alongside pure cash and deposits with the Fed.127Jane Ihrig, Edward Kim, Cindy M. Vojtech & Gretchen C. Weinbach, How Have Banks Been Managing the Composition of High-Quality Liquid Assets?, 101 Fed. Rsrv. Bank St. Louis Rev. 177, 181 (2019). While cash and Treasuries are not exactly equivalent (for example, one has to sell a Treasury to generate cash), regulation assumes that they fall within the bandwidth of the same ultrasafe, ultra-stable, and ultra-liquid asset-type that can meet the need for immediate redemptions.128Daniel K. Tarullo, The September Repo Price Spike: Immediate and Longer-Term Issues, Brookings (Dec. 5, 2019), https://www.brookings.edu/research/the-september-repo-price-spike-immediate-and-longer-term-issues [https://perma.cc/MT9M-2GZS] (highlighting divergences between cash and Treasuries despite similar regulatory treatment). Further, Treasuries are supposed to do be bought and sold quickly without causing serious price distortions. Regulation encourages banks to hold unencumbered Treasuries and does not impose any discounting on the value of the Treasuries held.129J.P. Morgan, supra note 124, at 4–5.

It is widely accepted that the LCR has had a dramatic impact on how banks fund themselves and on the kinds of business that they undertake. By being mandated to keep this thick buffer of HQLA to cover outflows over a stressed 30-day period, banks confront a constant demand to maintain (and have access to) a regular supply of Treasuries and cash. In consequence, they have dramatically increased their Treasuries holdings to reflect efforts at compliance.130See Vladimir Yankov, The Liquidity Coverage Ratio and Corporate Liquidity Management, Bd. of Governors of the Fed. Rsrv. Sys.: FEDS Notes (Feb. 26, 2020), https://www.federalreserve.gov/econres/notes/feds-notes/the-liquidity-coverage-ratio-and-corporate-liquidity-management-20200226.htm [https://perma.cc/XC9A-56VS]. On the other side, firms have sought to also adapt their business lines to reduce or adjust the size of the liabilities to be covered within the 30-day window. Services like offering large deposit holdings to clients have incurred a cost in the form of LCR holdings.131J.P. Morgan, supra note 124, at 4–5. As banks must develop new business lines, they need to keep one eye on the Treasuries/cash market to maintain constant compliance with LCR requirements.

Beyond banks, the significance of Treasuries as a cash-like, highly liquid buffer of value has led to financial firms raising their holdings across the board. For example, as Nellie Liang and Pat Parkinson note, open-ended mutual funds hold as much as 12% of all Treasuries outstanding, while hedge funds maintain around 9%.132Nellie Liang & Pat Parkinson, Enhancing Liquidity of the U.S. Treasury Market Under Stress 6 (Brookings, Hutchins Ctr., Working Paper No. 72, 2020), https://www.brookings.edu/wp-content/uploads/2020/12/WP72_Liang-Parkinson.pdf [https://perma.cc/KYR7-WXLG]. According to Liang and Parkinson, such deep reserves of safe-haven securities in the hands of regulated firms point to a readiness on their part to sell Treasuries en masse in order to raise cash in distress.133Id. at 6–7. See generally Kenechukwu Anadu & Viktoria Baklanova, The Intersection of U.S. Money Market Mutual Fund Reforms, Bank Liquidity Requirements, and the Federal Home Loan Bank System (Fed. Rsrv. Bank of Bos., Risk and Pol’y Analysis Unit, Working Paper RPA 17-05, 2017) (discussing the interaction between money market mutual fund liquidity reforms and bank holdings of Treasuries).

D.  Regulating the Secondary Market

This central place for Treasuries gives multiple major regulators an interest in their workings. Perhaps reflecting this quality, the oversight framework for Treasuries divides authority between several regulators, with none having lead status, but all having a stake in supervision.134Yadav, supra note 19, at 1193–97 (discussing fully the regulatory structure for the Treasuries secondary market).

Rather than having one lead regulator, like the Securities and Exchange Commission (“SEC”) is for equities, oversight of Treasuries is divided between at least five major agencies: the Fed, the NY Fed, the U.S. Treasury, the SEC, the Commodity Futures Trading Commission (“CFTC”) and the Financial Industry Regulatory Authority (“FINRA”).135This framework was set up by the Government Securities Act 1986, 15 U.S.C. § 78o-5 (2018). For a detailed discussion of the framework setting out the spheres of authority of various regulators, see Yadav, supra note 19, at 1193–97. The Fed supervises the banks, the SEC and FINRA oversee the securities firms, while the Treasury and NY Fed ensure surveillance over the auction process. The NY Fed also exercises designation authority for primary dealers, but it is not an official regulator for primary dealers.136Yadav, supra note 19, at 1193–96; Fed. Rsrv. Bank of N.Y., supra note 20. The CFTC regulates derivatives markets that are linked to Treasuries trading—notably, Treasury futures.137Yadav, supra note 19, at 1193–96. Overlapping authority is commonplace in U.S. administrative law. As Jody Freeman and Jim Rossi outline, regulators sharing authority bring unique expertise. But they also face impediments, such as barriers to information sharing and the need for coordination in the completion of everyday oversight.138Jody Freeman & Jim Rossi, Agency Coordination in Shared Regulatory Space, 125 Harv. L. Rev. 1131, 1181–88 (2012); Yadav, supra note 19, at 1177–78.

Likely in view of their risk-free status, the usual bevy of rules that apply to traders in major markets either do not exist for Treasuries—or do so weakly. This latitude is born out of the broad exemptions that Treasuries enjoy under the Securities Act of 1933 and the Securities Exchange Act of 1934.139Monahan, supra note 92. Treasuries are still subject to standard anti-fraud protection under Rule 10b-5 and § 10b of the Securities Exchange Act. On exempt securities, see 15 U.S.C. § 77(c); 15 U.S.C. § 78(n). Rule 10b-5 prohibits deception and manipulation with respect to “any security” and does not exclude otherwise exempted government securities. See also Margaret V. Sachs, Are Local Governments Liable under Rule 10b-5? Textualism and Its Limits, 70 Wash. U. L.Q. 19, 19–26 (1992). Regulators themselves often lack a concrete picture about which rules apply to Treasuries and how they should be implemented.140E.g., Letter from Stephen Luparello, Dir., U.S. SEC Div. of Trading and Mkts., to Robert W. Cook, CEO, FINRA (Aug. 19, 2016), https://www.sec.gov/divisions/marketreg/letter-to-finra-regulation-of-us-treasury-securities.pdf [https://perma.cc/8BKU-8BDT]. Commentators observe that out of the thousands of FINRA rules for equity broker-dealers, around forty-six apply to broker-dealers in Treasury markets.141Monahan, supra note 92; 0150. Application of Rules to Exempted Securities Except Municipal Securities, FINRA, https://www.finra.org/rules-guidance/rulebooks/finra-rules/0150 [https://perma.cc/QP2Q-6NTR] (discussing FINRA provisions applicable to Treasuries broker-dealers).

Interestingly, the most visible divergence from classic securities markets regulation (for example, equity and corporate bonds) lies in the area of reporting and information dissemination. Treasuries have historically lacked a trade-by-trade reporting regime.142Steven T. Mnuchin & Craig S. Phillips, U.S. Dep’t of the Treasury, A Financial System that Creates Economic Opportunities: Capital Markets 73–75 (2017); Brain et al., supra note 61 (noting the historic lack of reporting and the impact of the first year of the law with respect to delivering insights about the market). Since 2017, FINRA-regulated securities firms are required to report their trades. Banks must do so as well.143FINRA, Order Approving Proposed Rule Change Relating to the Reporting of Transactions in U.S. Treasury Securities to TRACE, 81 Fed. Reg. 73167 (Oct. 24, 2016). Despite this reform, however, the 2017 regime has long left serious gaps. Those that did not traditionally fall within the category of either a broker, dealer, or bank were not required to report to FINRA. For example, a number of HFT securities firms have typically not reported directly, and neither have hedge funds.144Barbara Novick, Dan Veiner, Hubert De Jesus, Daniel Mayston, Jerry Pucci, Eileen Kiely, Stephen Fisher & Samantha DeZur, BlackRock, Lessons from COVID-19: Market Structure Underlies Interconnectedness of the Financial Market Ecosystem 8–9 (2020), https://www.blackrock.com/corporate/literature/whitepaper/viewpoint-lessons-from-covid-19-market-structure-november-2020.pdf [https://perma.cc/KE5V-CV4X] (noting the difficulty in procuring information on hedge fund trading activities). See generally Harkrader & Puglia, supra note 39. In February 2024, the SEC passed a series of measures requiring major liquidity providers—regardless of their institutional category—to register with the SEC and to report their trades. The functional aim of such rulemaking arguably lies in broadening the regulatory perimeter to require reporting by and cast a spotlight on those undertaking major Treasuries-related business (for example, high-speed traders and hedge funds). But, the SEC’s actions were met with heavy resistance and a court challenge to their validity.145See U.S. SEC, supra note 43; Duguid, supra note 43; Barbuscia, supra note 43. At least until the SEC’s new regulations are implemented, data in relation to non-reporting firms must be captured indirectly—for example, when a non-reporting trader transacts with one that falls under the general 2017 mandate, or data is supplied by a trading platform.146Brain et al., supra note 61. Since April 2019, regulators updated reporting rules to mandate that interdealer trading platforms be able to identify an HFT trader explicitly. Previously, an HFT trade was not specifically identified in reporting. Harkrader & Puglia, supra note 39. See generally Liz Capo McCormick, U.S. Recommends Release of Treasuries Trading Volume Statistics, Bloomberg Law (Sept. 23, 2019, 10:59 AM), https://news.bloomberglaw.com/banking-law/u-s-recommends-release-of-treasuries-trading-volume-statistics [https://perma.cc/TLM3-WRJS] (detailing the reasons behind the historic lack of transparency in Treasuries markets). Further, public reporting of Treasuries data is fairly light and recent. The public gained access to Treasuries secondary market trading data but only since March 2020. This data has generally presented aggregate totals for weekly trading in different kinds of Treasury securities, moving only to daily aggregate reporting beginning in February 2023.147Treasury Daily Aggregate Statistics – Files, FINRA, supra note 43.

***

In summary, the Treasury market represents a critical pillar of the U.S. economy and financial system. Treasuries are viewed as risk-free. This label, however, conflates two aspects of Treasuries: (1) the likelihood of repayment and (2) their trading in secondary markets. With respect to the former, it is widely accepted that the United States will not default. However, in relation to the latter, the system of trading for Treasuries does present real risks. It reflects a design choice that places primary dealers as centerpieces in intermediation. Crucially, Treasuries fall under a system of oversight that is fragmented and without a lead authority. With reporting data only recently becoming available, and lacking comprehensive coverage, regulators face coordination and information costs when seeking to understand the riskiness of this market and its intermediation.

II.  TREASURIES AND PRIVATE INDUSTRY SELF-REGULATION

In addition to playing an anchoring role in public regulation, Treasuries have become the lynchpin for safeguarding the six-trillion-dollar market for short-term credit between financial firms. Default-free and highly liquid, Treasuries are the choicest type of collateral, capable of being sold rapidly when a borrower cannot pay. In the repurchase market, any number of financial institutions borrow and lend cash/securities to one another to meet their daily funding needs—with Treasuries being the preferred type of collateral. Particularly after the 2008 Financial Crisis, firms have come to depend on Treasuries as collateral in the repo market, with around four trillion dollars dependent on Treasuries to secure debt. Because repo debt is short-term and collateralized using Treasuries, it is “informationally insensitive,” meaning, so safe that due diligence is unnecessary.148Tri Vi Dang, Gary Gorton & Bengt Holmström, The Information View of Financial Crises, 12 Ann. Rev. Fin. Econ. 39, 40 (2020).

This Part describes how Treasuries have become essential to maintaining the system of financial self-regulation that drives over four trillion dollars in daily lending between financial firms. Just as in the secondary market, primary dealers are the major intermediaries in repo operations, performing a variety of functions as lenders, borrowers, and connectors that match clients. In highlighting the centrality of primary dealer intermediation, this Part describes why this task can be challenging and costly in repo markets, with the risk that primary dealers can quickly withdraw and temporarily restrict credit when their job becomes too difficult.149Gary Forton & Andrew Metrick, Securitized Banking and the Run on Repo, 104 J. Fin. Econ. 425, 426–27 (2012) (noting the role of runs in the repo market as a systemic event contributing to the 2008 Financial Crisis); see Daniel K. Tarullo, Banking on Basel 15 (2008); e.g., V.V. Chari & Ravi Jagannathan, Banking Panics, Information, and Rational Expectations Equilibrium, 43 J. Fin. 749 (1988). On post-2008 capital regulation, see, e.g., Section 171, Dodd-Frank Wall Street Reform and Consumer Protection Act of 2010, Pub. L. No. 111-203, § 171, 124 Stat. 1376, 1435–38 (2010). See generally Douglas W. Diamond & Philip H. Dybvig, Bank Runs, Deposit Insurance, and Liquidity, 91 J. Pol. Econ. 401 (1983) (detailing the classic bank run, in which depositors rush to get their money back, putting bank solvency in jeopardy); Basel III Implementation, Fed. Rsrv. Bd. of Governors, https://www.federalreserve.gov/supervisionreg/basel/usimplementation.htm [https://perma.cc/HJF4-79WN]; Cheng & Wessel, supra note 20. These challenges are not easily addressed through the regulatory framework. Unlike the secondary market for Treasuries, the repo market is regulated under a more prudential format, focused on maintaining the safety and soundness of participating firms and transactions. With a far lower emphasis on disclosure, the repo market constitutes an opaque environment by design, obscuring an understanding of its workings and its interlinkages with the Treasuries secondary market.

A.  Private Credit in Financial Markets

The repo market—offering short-term, secured credit—represents a solution to the funding needs of large financial firms. A basic repo transaction works as follows. A Lender (a firm with cash) offers to loan money to a Borrower (a firm needing cash).150Cheng & Wessel, supra note 20. As with any loan, this transaction carries the risk that the Borrower might default. To limit the risk of losses arising from default, the market (1) ensures that the loan is collateralized and (2) keeps the maturity of the loan short (usually overnight, but it can sometimes extend to a month).151Id. The repo market thus represents a market for secured loans. If the Borrower cannot pay back the cash, the Lender can simply sell the securities and recover their money. The collateral that a Borrower provides is in the form of securities, like Treasuries, corporate bonds, mortgage-backed securities, or stock.152Id. Treasuries, unsurprisingly, constitute the most prized form of collateral. In terms of pricing, the value of the collateral exceeds the size of the loan by an amount called the “haircut.” The riskier the collateral, the larger the haircut.153Generally, the haircut reflects the worst-case loss of value of the collateral over the (overnight or longer) life of the loan. Treasuries usually come with a haircut of around 2% for overnight borrowing.154Grace Xing Hu, Jun Pan & Jiang Wang, Tri-Party Repo Pricing, 56 J. Fin. & Quantitative Analysis 337, 345 (2021). A Borrower looking for a $100 overnight cash loan will need to provide the Lender with $102 in Treasuries.155See, e.g., id.

As noted above, the maturity of loans is short, usually overnight. But loans are routinely rolled over.156See Cheng & Wessel, supra note 20. This means that the loan is refreshed as it comes due: the Borrower can keep using the cash, while the Lender keeps the collateral. The short maturity structure gives Lenders the ability to control their exposure. If the Lender senses trouble, it can ask for the loan to be paid back, or it can choose to ask for more collateral to reflect any additional risk posed by the transaction. If the Borrower cannot repay, the Lender can sell the Treasuries. Danger arises for the market if the Lender feels unable to continue lending or rolling over existing loans, forcing a number of its borrowers into distress where they must pay up or find additional securities to keep borrowed cash.157Ricks, supra note 126, at 102–45.

The repo market is mainly divided in two: (1) the bilateral market and (2) the tri-party repo market. Further, repo transactions come in two distinct types: (1) repo trades and (2) reverse repo trades.

In the bilateral repo market, parties transact directly with one another and privately organize their own risk management. By contrast, in the tri-party repo market, the administration and settlement of trades are handled by a third-party firm that intercedes between parties to manage the risk of ensuring the trade executes.158There is also the general collateral finance (“GCF”) repo market, which is an interdealer repo market with the Fixed Income Clearing Corporation (“FICC”) as the central clearing counterparty. In the tri-party repo market, J.P. Morgan and Bank of New York Mellon have historically facilitated clearing and settlement. In tri-party repo, the transactions tend to be secured by a wider range of assets. Because collateral categories are broader, the tri-party repo market is not always helpful for dealers to source specific securities but can be useful as a place to park cash short-term. See Viktoria Baklanova, Cecilia Caglio, Marco Cipriani & Adam Copeland, Off. of Fin. Rsch., The U.S. Bilateral Repo Market: Lessons from a New Survey 2 (2016) (discussing Treasuries trading implications more fully); Baklanova et al., supra note 62, at 5–7; Cheng & Wessel, supra note 20; see also Kahn & Olson, supra note 30, at 4–6 (detailing the composition of the cleared repo market).

The classification of whether a repo transaction represents a repo or reverse repo depends on whether the dealer is borrowing or lending cash in the transaction.159Repo and Reverse Repo Agreements, Fed. Rsrv. Bank of N.Y., https://www.newyorkfed.org/markets/domestic-market-operations/monetary-policy-implementation/repo-reverse-repo-agreements [https://perma.cc/D8ZY-LB9E]. In a repo, the dealer is borrowing cash (and providing collateral). In a reverse repo, the dealer is lending cash (and receiving collateral).160Cheng & Wessel, supra note 20. Figure 1.A shows the size of the bilateral repo market over the period 2013–2023.161See infra Appendix Figure 1.A. There are 2 points to note: (1) the daily size of the bilateral repo market, comprising both repo and reverse repos, has varied from $3.9 trillion to $5.1 trillion over the last ten years and (2) on average, around 75% of the collateral in the bilateral repo segment comprises Treasuries, meaning that Treasury securities valued at about $3 trillion to $4 trillion remain locked up as “passive” collateral to prevent default, thereby reducing the “float” readily available with dealers for secondary market trading.162SIFMA Rsch., supra note 26, at 6. Figure 1.B shows the size of tri-party repo market over the past decade.163See infra Appendix Figure 1.B. For example, in December 2021, the daily size of the tri-party repo market stood at around $3.7 trillion, around $2.5 trillion of which was backed by Treasuries.164Fed. Rsrv. Bank of N.Y., supra note 26. Given the higher risk of direct trading, the bilateral repo market uses Treasuries as collateral in a much larger proportion of its transactions.

The feature that gives the repurchase (repo) market its name describes the legal arrangements that underlie how a typical repo works. The transaction is effectively structured as a sale and repurchase of securities even though it is, for all intents and purposes, a loan. The Borrower (seeking cash) sells its securities to the Lender (for the cash) with the promise to buy them back at a pre-agreed price the next day (or whenever the deal matures). The pre-agreed repurchase price represents the amount of the loan alongside an additional slice of compensation.165SIFMA Rsch., supra note 26, at 3–4. Because the Lender legally owns the securities, it can sell them if the Borrower defaults. The Bankruptcy Code specifically allows repo Lenders to sell collateralized securities even if the Borrower files for bankruptcy. Ordinarily, without such protection, Lenders would be stopped from doing so by the Code’s automatic stay on enforcement actions.166There are a variety of safe harbors for different kinds of financial contracts. Repurchase agreements are defined under 11 U.S.C. § 101(47) (2022). Under the Code, lenders are restricted by the application of the automatic stay, 11 U.S.C. § 362(a) (2022). See also 11 U.S.C. § 547 (2022) (ensuring that preferences are scrutinized); 11 U.S.C. § 365(e)(1) (2012) (stating that so-called ipso facto clauses are unenforceable. These clauses automatically create a condition of default by the fact of the debtor’s bankruptcy filing). For lender protection, lenders might seek out ways to lift the stay or claim adequate protection under § 362(d)(1) (2022). Because repos are allowed to be closed out in the event of a borrower’s bankruptcy filing, repo lenders do not have to face the costs and consequences entailed by seeking adequate protection or looking to lift the stay.

Scholars observe that repo markets tend to function as a “bank-like” system for financial firms.167Gordon & Metrick, supra note 33, at 433; Ricks, supra note 126, at 40. Those that have cash can earn money by lending it and taking collateral. In turn, those that need cash but have securities can offer their securities as collateral in return for access to cash. Repo markets reflect the reality that financial firms are unique. They cannot park millions and billions of dollars in a bank account. Those with cash want this money to generate some return, rather than having it languish unproductively. On the other side, some firms’ asset base focuses on holding securities rather than cash (for example, if they invest heavily in securities as underwriters).168See Manmohan Singh, Collateral Velocity is Rebounding, Fin. Times (May 21, 2019), https://www.ft.com/content/2cc138a5-df70-3b62-9bd5-6fdf76ecac38 [https://perma.cc/9F65-SJXT] (noting the ability of collateral that is pledged by bank clients to be reused as collateral by the bank). The repo market unlocks value by allowing those that need cash to borrow it on a short-term and secured basis, while permitting those with cash to be able to lend it out and make money from this transaction.169Cheng & Wessel, supra note 20.

Secondly and relatedly, repo markets enable financial firms to use leverage if their business model and balance sheet can support it. For example, suppose a firm has 100 Treasuries, each valued at $100. The typical haircut for Treasuries is 2%. The firm can borrow $9,800 against this collateral. It can then use these funds to buy 98 Treasuries, following which it can again use the 98 Treasuries as collateral to borrow $9,600. It can then use these funds to buy 96 Treasuries and so on, potentially achieving up to 50 times leverage.

Equally, the same Treasuries may be used in multiple transactions. Repo markets permit dealers to take Treasuries belonging to a client and to use these as collateral for their own purposes in the repo market.170Often, this can happen in perfectly normal course. For example, the dealer could be functioning as a pure intermediary routing a loan from a municipal corporation to a hedge fund but with each party transacting through the dealer. The dealer will receive a security from the hedge fund as collateral for the loan, and at the same time, pledge the same collateral to the municipal corporation. But there may often not be a one-to-one correspondence, and the ratio of collateral used by the dealer to that available could be greater than 100%. A hedge fund with 100 Treasuries can entrust a Dealer with their safekeeping. Rather than simply let these assets be unproductive in an account, the Dealer and hedge fund agree that the Dealer can use the Treasuries for the Dealer’s private credit needs. In return, the Dealer can offer the hedge fund a line of credit on cheaper terms than what might otherwise have been possible. Because the Dealer can control the Treasuries, it can use them as collateral to borrow funds from a Lender. The Lender, too, can take these same Treasuries and reuse them for more borrowing.171This arrangement describes a prime brokerage agreement in which the Dealer offers hedge funds and other clients a range of services such as a line of credit on cash and securities, trade execution, and so on. A client’s assets are agreed to be pledged with the Dealer as prime broker with the express agreement that the Dealer can reuse this collateral for its own account. For more detail, see, e.g., Richard Comotto, Repo, Re-Use and Re-Hypothecation, ICMA Centre (Dec. 14, 2013), https://icmacentre.blog/2013/12/14/repo-re-use-and-re-hypothecation [https://perma.cc/Q4DS-AE7V]. According to Infante et al., by dint of contractual agreements, primary dealers are generally permitted to reuse the vast majority of the Treasury collateral that they hold for clients. Studies suggest that as much as 85% of all Treasuries collateral may be subject to reuse.172Sebastian Infante, Charles Press & Jacob Strauss, The Ins and Outs of Collateral Re-Use, Bd. of Governors of the Fed. Rsrv. Sys.: FEDS Notes (Dec. 21, 2018), https://www.federalreserve.gov/econres/notes/feds-notes/ins-and-outs-of-collateral-re-use-20181221.html [https://perma.cc/69JZ-9DAZ].

Reusing collateral has a number of important benefits for both dealers and their clients. It means that dealers can provide cheaper intermediation across the board. Rather than having to buy Treasuries and spend cash to do so, a dealer can request permission from its client to borrow their securities. In turn, the client also receives cheaper services.173Sebastian Infante, Liquidity Windfalls: The Consequences of Repo Rehypothecation, 133 J. Fin. Econ. 42, 43 (2019) (detailing that primary dealers can generate gains for themselves by using their intermediary status to negotiate varying terms with varying counterparties and benefiting from differing haircuts). Reuse also means that the market unlocks maximum value from a Treasury. Rather than only be used once in one trade, reuse can extract economic gains when Treasuries are used across multiple transactions.174See Hyejin Park & Charles M. Kahn, Collateral, Rehypothecation, and Efficiency, 39 J. Fin. Intermediation 34, 34 (2019). So long as parties can repay, reuse can help lower the costs of intermediation and accessing repo credit affordably.175John Dizard, The Horror Scenario Lurking in the Plumbing of Finance, Fin. Times (July 23, 2021), https://www.ft.com/content/a0482f69-be5c-4d92-ae59-17a8e2b2cdde [https://perma.cc/FD3D-6PZX] (noting the importance of reusing Treasuries in unlocking credit for financial firms).

But reusing collateral can also be dangerous.176Dealers do have legal restrictions in their ability to reuse collateral, as stipulated by the Federal Reserve’s Regulation T and the Exchange Act Rule 15c3–3, limiting dealers from exceeding 140% of the a client’s balance for the dealer’s proprietary activities. 17 C.F.R. § 240.15c3-3 (2023); 12 C.F.R. § 220 (2023); see Manmohan Singh & James Aitken, The (Sizable) Role of Rehypothecation in the Shadow Banking System 3–5 (Int’l Monetary Fund, Working Paper No. WP/10/172, 2010). While profitable in good times, collateral reuse can amplify distress in crisis. It undermines the assumption that repo markets provide fully secured lending. If the Hedge Fund client demands its Treasuries back, the Dealer faces a problem—as do others along the collateral chain. The Dealer must immediately source the Treasuries to return to the Hedge Fund.177See Manmohan Singh, Senior Economist, Int’l Monetary Fund, Presentation to Brookings Institution, Understanding the Role of Collateral in the Financial System (Feb. 23, 2015), https://www.brookings.edu/wp-content/uploads/2015/02/20150223_collateral_markets_transcript.pdf [https://perma.cc/L733-NJ8V]. If the Dealer has used Treasuries to borrow cash, it must find cash to pay its Lender back and recover the Treasuries. Where the Dealer has failed (like Lehman Brothers), the Hedge Fund can find itself caught up in long legal proceedings to recover the assets.178See generally Manmohan Singh & James Aitken, Deleveraging After Lehman—Evidence from Reduced Rehypothecation (Int’l Monetary Fund, Working Paper No. WP/09/42, 2009).

The fragility of collateral chains was made clear around September 16, 2019, when rates to borrow cash in the repo market spiked, climbing to almost 10% from about 2% in the week prior.179Long, supra note 33; see also Cheng & Wessel, supra note 20. In seeking to understand why, a number of commentators pointed to concerns about the quality of the collateralization—and whether collateral chains of reused Treasuries could be counted on as watertight. Manmohan Singh of the International Monetary Fund estimated that, in the 2018 Treasury repo market, around three separate actors believed that they were entitled to the very same Treasury security.180See Long, supra note 33; see also Singh, supra note 168. This imputed a reuse rate of 2.2. That is, in addition to the actual owner (for example, the Hedge Fund above), 2.2 further firms considered themselves entitled to the Treasury collateral.181Singh, supra note 168; Long, supra note 33. See generally Liz Capo McCormick & Alex Harris, The Repo Market’s a Mess. (What’s the Repo Market?), Bloomberg (Dec. 17, 2019, 9:22 PM), https://www.bloomberg.com/news/articles/2019-09-19/the-repo-market-s-a-mess-what-s-the-repo-market-quicktake [https://web.archive.org/web/20240717071025/https://www.bloomberg.com/news/articles/2019-09-19/the-repo-market-s-a-mess-what-s-the-repo-market-quicktake]. Owing to uncertainties about whether lending was really fully collateralized, primary dealers and others became wary of parting with cash, despite the promise of an almost 10% interest rate on offer that day.182As discussed later, commentators also suggested that post-Crisis regulatory reforms imposed prudential requirements that reduced the ability and incentives of large banks to lend cash. See, for discussion, McCormick & Harris, supra note 181; Long, supra note 33. As this episode makes clear, even though a Treasury can be reused multiple times as collateral, it can only be sold once to cover exposure. In an informationally opaque environment, in which parties do not know if they might be the one caught without viable collateral, it makes sense for primary dealers to stop intermediation and withdraw from the market.

B.  Intermediation in the Repo Market

Primary dealers are the key intermediaries in the Treasuries-backed repo market. According to Copeland et al., primary dealers appear to intermediate around 80% of the bilateral repo market.183Copeland et al., supra note 55. In addition to serving the financing needs of clients, primary dealers also use the repo markets to secure funding for themselves.184See Marco Arnone & Piero Ugolini, Primary Dealers in Government Securities 16–17 (2004) (describing the importance of financial capacity). See generally Viral V. Acharya, Michael J. Fleming, Warren B. Hrung & Asani Sarkar, Dealer Financial Conditions and Lender-of-Last-Resort Facilities, 123 J. Fin. Econ. 81 (2016) (noting the importance of access to the Fed’s emergency lending facilities in the run-up to the 2008 Financial Crisis to preserve continuity in dealer market making); Press Release, Bd. of Governors of the Fed. Rsrv. Sys., Federal Reserve Board Announces Establishment of a Primary Dealer Credit Facility (PDCF) to Support the Credit Needs of Households and Businesses (Mar. 17, 2020), https://www.federalreserve.gov/newsevents/pressreleases/monetary20200317b.htm [https://perma.cc/WJZ4-T6AA] (discussing the Fed’s 2020 emergency facilities); Primary Dealer Credit Facility (2008), Fed. Rsrv. Bank of N.Y., https://www.newyorkfed.org/markets/pdcf.html [https://perma.cc/8LNF-T85A] (discussing the earlier 2008 Primary Dealer support facility). On the essential role of dealer balance sheets for maintaining high-quality intermediation in U.S. Treasury markets, see generally, e.g., Duffie et al., supra note 56.

As intermediaries, primary dealers are tasked with fulfilling a number of functions in the repo market. At the most basic level, they match borrowers with lenders. When one client has cash (for example, a mutual fund), while another needs it (for example, a bank), the primary dealer connects both parties and facilitates the repo transaction (for a price). A more engaged role involves the primary dealer acting as one side of the repo trade for a client, either as borrower or as lender. When doing so, the primary dealer deploys the power of its balance sheet to take risk directly on its books.185Cheng & Wessel, supra note 20.

Primary dealers have unique advantages when it comes to intermediating the Treasuries-backed repo market. Beyond access to government auctions, they also possess positional dominance. For one, as connected nodes in financial markets, with access to networks of global clients, primary dealers represent trusted repositories for client cash and securities. In return to offering services and expertise, dealers acquire access to vast amounts of client securities and cash that can be reused for repo operations. Because the repo market enables collateral reuse, dealers can subsidize the costs of intermediation.

In addition, this central position affords primary dealers some informational advantages. They are well-placed to identify, connect, and transact with counterparties. Primary dealers are likely to have knowledge about which kinds of firms tend to hold sufficient cash (for example, mutual funds) to lend, and who has enough securities to be able to borrow. Repeat relationships can help build trust, deepen knowledge about the client’s financials, and allow the primary dealer to more precisely price the terms of repo debt. Connections with multiple clients can permit primary dealers to fulfill orders for larger volumes of Treasuries/cash in which a dealer can tap and pool assets across clients. Crucially, this ability to tap into a sprawling network of resources can strengthen the financial system because its key intermediaries are positioned to supply liquidity in an elastic way.186See, e.g., Mathias S. Kruttli, Phillip J. Monin, Lubomir Petrasek & Sumudu W. Watugala, Hedge Fund Treasury Trading and Funding Fragility: Evidence from the COVID-19 Crisis 3 (Fed. Rsrv. Bd., Wash. D.C., Working Paper No. 2021-038, 2021) (noting that the largest dealers (G-SIBs) provided 11–13% higher repo funding to hedge funds during the March 2020 crisis).

Finally, primary dealers are active throughout financial markets and supply liquidity in a variety of assets like equities and corporate bonds.187Hendrik Bessembinder, William Maxwell & Kumar Venkataraman, Market Transparency, Liquidity Externalities, and Institutional Trading Costs in Corporate Bonds, 82 J. Fin. Econ. 251, 262 (2006) (highlighting dealer inventory management in corporate bonds). See generally Paul Schultz, Inventory Management by Corporate Bond Dealers (May 11, 2017) (unpublished manuscript), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2966919 [https://perma.cc/7M3F-XZKR]; Jaewon Choi, Yesol Huh & Sean Seunghun Shin, Customer Liquidity Provision: Implications for Corporate Bond Transaction Costs, 70 Mgmt. Sci. 187 (2024) (discussing the practice of prearranging trades). This broad-based participation in capital markets puts primary dealers in a strong position to use their experience and expertise to better predict demand for repo funding. For example, by being active suppliers of liquidity to the corporate bond market, primary dealers are likely to have a detailed understanding of who the key buyers of bond issues are likely to be (for example, insurance firms or mutual funds). This can provide special insight into possible future pockets of demand for Treasuries/cash collateral in which investors might need short-term cash to purchase a sizable volume of bonds.

C.  Regulating the Repo Market

Despite its short-term and collateralized nature, the repo market is criticized for its structural instability and the profound risk that it poses for the financial system.188See, e.g., Diamond & Dybvig, supra note 149, at 401–04 (discussing banks runs); e.g., Ricks, supra note 126, at 103–40. Scholars argue that the repo market suffers from a similar vulnerability to banks: the chance that it suffers a run in which lenders are frightened enough to recall their short-term debt en masse. According to Gary Gorton and Andrew Metrick, a major catalyst for the 2008 Financial Crisis came from a run in one part of the repo market, making it impossible or expensive for financial firms to continue funding themselves.189See generally Gorton & Metrick, supra note 33. In Gordon and Metrick’s study, the collateral underlying the repo loans was largely comprised of mortgage-backed securities that plunged in value.190Gorton & Metrick, supra note 33, at 430. Even where underlying securities were not as risky, the fear that they could be and that repo borrowers would be unable to repay ramped up what lenders were charging or caused them to call in their loans.191Gorton & Metrick, supra note 33, at 1 (analyzing the bilateral repo market); Saguato, supra note 29, at 116–18. As shown by Copeland et al., the increased margin signaling a run in the repo market was largely confined to the bilateral repo market. In the tri-party repo market, in which repos trading is intermediated by clearing and risk management, such sharp increases in margin did not take place. Adam Copeland, Antoine Martin & Michael Walker, Fed. Rsrv. Bank N.Y., Repo Runs: Evidence from the Tri-Party Repo Market 2–4 (2014). See generally Ricks, supra note 126.

Unlike banks, the repo market lacks preventative structural safeguards, like deposit insurance, that could mitigate the risk of a run.192Gorton & Metrick, supra note 33, at 426–27; see Gary Gorton, Slapped in the Face by the Invisible Hand: Banking and the Panic of 2007 2–4 (2009) (analyzing the role of banking panics and arguing that the repo markets were similarly vulnerable). See generally Baklanova et al., supra note 62; Zoltan Pozsar, Tobias Adrian, Adam Ashcraft & Hayley Boesky, Fed. Rsrv. Bank of N.Y., Shadow Banking (2010). Instead, after 2008, it largely relies on Treasuries collateralization to assure parties that they will be repaid.193Peter Madigan, The Meteoric Rise of Treasuries, BNY Mellon (Sept. 2019), https://www.bnymellon.com/us/en/insights/aerial-view-magazine/the-meteoric-rise-of-treasuries.html [https://web.archive.org/web/20240601183605/https://www.bnymellon.com/us/en/insights/aerial-view-magazine/the-meteoric-rise-of-treasuries.html].

This focus on collateralization reflects the prudential approach that characterizes the regulation of the repo market. Broadly, a slew of capital rules for financial institutions take account of a primary dealer’s repo participation to calculate how much capital it needs.194Cheng & Wessel, supra note 20 (highlighting the discussion around whether the liquidity-coverage ratio can impact repo market regulation).

As detailed in Part I, banks must maintain a supply of highly liquid assets that can help them to remain solvent in the event of a sudden cash drain. The mandate that major financial institutions buffer themselves up with a thick reserve of highly liquid assets reflects the lessons learned in 2008 that underscored the potential for a liquidity crunch, as exemplified by the repo market’s failure.195Yankov, supra note 130.

However, their obvious utility and importance notwithstanding, these liquidity rules have also been blamed by some commentators for amplifying the instability in repo operations. In September 2019, when cash in the repo market seemed to run dry, no bank came forward to take advantage of what would have been a lucrative opportunity to lend (with a rate of 10% on offer). According to some commentators, post-2008 liquidity rules meant that banks did not wish to lend cash because they preferred to maintain high cash reserves and meet their compliance requirements. Importantly, the Fed pays interest on the cash reserves that it holds in its accounts for banks. If these interest payments are sufficiently high, banks might hesitate before using the cash for repo lending.196Cheng & Wessel, supra note 20; see Interest on Reserve Balances, Bd. of Governors of the Fed. Rsrv. Sys., https://www.federalreserve.gov/monetarypolicy/reqresbalances.htm [https://perma.cc/D5WQ-SGTN]. See generally 12 C.F.R § 204 (2023). As Joshua Younger et al. observe, large banks were not short of cash in mid-September 2019.197Joshua Younger, Ryan J. Lessing, Munier Salem & Henry St. John, J.P. Morgan, What Is Preventing the Banks from Policing the Repo Market? 2 (2019). They held around $700 billion in cash reserves—far in excess of what was required of them under law.198Id. On paper at least, they should have been able to direct some of these funds into the repo market and ease the costs of lending. This suggests that dealers preferred to prioritize keeping a thick liquidity buffer, accruing interest in their account with the Fed and waiting out the uncertainty, rather than actively deploying their balance sheet to alleviate it.199Id. (noting that stress testing may favor reliance on cash reserves rather than Treasuries as a way of showing their ability to withstand extreme crisis). But see Kruttli et al., supra note 186, at 18 (showing that G-SIBs were providing higher levels of repo funding to hedge funds during the March 2020 crisis).

In addition to liquidity ratios, primary dealer banks can also become subject to a “capital surcharge” over and above the basic capital buffer that banks maintain—resulting in banks seeking out ways to avoid the full force of paying this extra cost. Under post-2008 rules, the surcharge kicks in when a bank is deemed to be large and systemic.20012 C.F.R. § 217.403 (2015). It should be noted that U.S. regulators are discussing potential reforms to bank capital regimes that may work to increase bank capital buffers, focusing on larger banking firms. A full discussion is outside the scope of this Article. For an outline of proposed reforms, see, e.g., David Wessel, What Is Bank Capital? What Is the Basel III Endgame?, Brookings (Mar. 7, 2024), https://www.brookings.edu/articles/what-is-bank-capital-what-is-the-basel-iii-endgame [https://perma.cc/B4QZ-397V]. The greater the size and interconnectedness of a dealer bank, the greater the likelihood that it faces a higher charge.201See Bank for Int’l Settlements, Basel Comm. on Bank Supervision, The G-SIB Assessment Methodology – Score Calculation 1–2 (2014) (setting out the factors that determine the intensity of a bank’s interconnectedness and systemic size); Wayne Passmore & Alexander H. von Hafften, Are Basel’s Capital Surcharges for Global Systemically Important Banks Too Small?, Bd. of Governors of the Fed. Rsrv. Sys.: FEDS Notes (Feb. 27, 2017), https://www.federalreserve.gov/econresdata/notes/feds-notes/2017/are-basels-capital-surcharges-for-global-systemically-important-banks-too-small-20170223.html [https://perma.cc/9MSP-8UQD] (analyzing and critiquing the BCBS’s methodology for calculating the surcharge). From the standpoint of policy, this systemic surcharge serves the purpose of ensuring the financial markets are better girded against the possibility that a large bank fails because this bank should have a deeper buffer from which to cover its losses. However, as an unintended consequence, it can motivate dealer banks to suddenly reduce the depth of their repo intermediation in order to avoid becoming sufficiently large and interconnected to become subject to a higher capital charge.202Cheng & Wessel, supra note 20.

Industry analysts report that large, systemically important banks routinely seek out ways to show a reduced footprint when it comes time for regulators to assign scores for the purposes of the surcharge.203See generally Joseph Abate, Barclays, GSIB Score: Repo Diet (2019). These assessments usually take place at year-end, meaning that large dealer banks may be incentivized to reduce their repo activities at the end of the year. Under this regime, banks have incentives to reduce their systemic activities to just below the threshold at which a higher charge would apply. Because activities in the repo market constitute one signal of a bank’s interconnectedness, heavily limiting the depth of its involvement can help a bank to reduce the capital surcharge it faces.204See generally id. Further, owing to the short-term nature of repo lending, dealers can precisely time their retraction to quickly closeout repo transactions before being rated.205See generally Adam Freedman & Francisco Covas, The GSIB Surcharge and Repo Markets, Bank Pol’y Inst. (Nov. 26, 2019), https://bpi.com/the-gsib-surcharge-and-repo-markets [https://perma.cc/RRB4-7XHZ]. This kind of behavior—while perhaps rational for any single dealer—clearly poses problems for the market as a whole. Repo markets can experience a broad fall in the intensity of intermediation around times when dealers are to be assessed for a surcharge.206Id. Even if the market can predict that such an eventuality will occur, episodic illiquidity can still create periods of fragility in which firms struggle to get the repo loan they need at an acceptable price. Indeed, even the fact of anticipating such liquidity-draining milestones can constitute a self-fulfilling prophecy. Firms may be less willing to come forward with their cash and Treasuries to trade assuming likely frictions in the market. Whenever dealers decide to scale down their intermediation, even if temporarily, it can introduce a broad slowdown in the flow of credit across the financial system.

In summary, private self-regulation in financial markets looks to Treasuries to protect firms and the system against default. Private lending between firms in the repo market relies on primary dealers for intermediation. Despite being collateralized using Treasuries, however, the repo market suffers from built-in risks. It is opaque by design. Reuse of Treasuries collateral creates a source of instability in crisis. Short-term financing can dry up quickly. Dealers are free to withdraw or reduce intermediation. That being said, with around four trillion dollars in daily lending backstopped by Treasuries, the financial system is entrenched in its belief that Treasuries constitute the protective safe asset to anchor private industry self-regulation.

III.   THE FALSE PROMISE OF TREASURIES

This Part argues that systematic reliance on Treasuries in public and private financial regulation is internally in tension. First, we show that the secondary market and the repo market are inextricably linked and interdependent such that loss of function in one market can affect the other. Secondly, primary dealer intermediation, underpinning both markets, is subject to a slew of costs and problems. Opacity is pervasive. This prevents primary dealers from gaining a full picture of the risks. This increases the challenge facing primary dealers to navigate the internal conflict between the repo and the secondary market operations. With trillions of dollars in Treasuries and cash collateral locked-in to support the repo market, the secondary market for trading Treasuries can face a shortfall, especially during crises. In seeking to resolve this tension, primary dealers are likely to favor intermediating in the market in which they will gain the most, economically and reputationally. Alternatively, if neither market provides lucrative gains, primary dealers will rationally have every reason to withdraw intermediation altogether.

Additionally, this Part observes that the regulatory system is ill-placed to recognize the risks of an interconnected repo and secondary market for Treasuries. The Treasury market is overseen by a panoply of agencies. Their approaches to oversight diverge. Cooperation costs are built-in to a fragmentated system of oversight.207Yadav, supra note 19, at 1173. Regulatory incapacity increases the dangers of risky intermediation for Treasury market fragility and casts further doubt on the ability of Treasuries to function as safe assets in public and private financial regulation.

A.  Opacity, Information Costs, and Monitoring

Opacity in the repo market and the secondary market adds systematic information costs to primary dealer intermediation. A lack of full and real-time information limits monitoring and makes it harder for primary dealers to anticipate demand on their balance sheets. If the costs of opacity become too much, primary dealers may limit or withdraw intermediation to one or both markets.208McCormick & Harris, supra note 181; Saguato, supra note 29, at 113–15 (discussing repo market opacity). See generally Long, supra note 33.

Primary dealers face uncertainties from a number of sources in intermediating both Treasury secondary trading and repo markets. First, both markets are home to a diverse set of users whose needs for cash and securities can vary unexpectedly. The repo market exemplifies this vulnerability.

Taking data from the periodic reports that primary dealers provide to the NY Fed, it becomes clear that Treasuries’ exposure of primary dealers is much greater in the repo markets relative to the Treasury secondary market.209The NY Fed provides data on primary dealers that is updated weekly. This data includes the overall positions and transactions of primary dealers in the Treasury secondary market, and their positions in repos and reverse repos, collateralized not only by Treasuries, but also by other assets. Figures 2, 3.A, 3.B, and 4 are based on data on primary dealers from January 2016 to December 2023. See infra Appendix Figures 2, 3.A, 3.B & 4. We utilize these figures in our discussion in this Part and use this data to develop the charts. Primary Dealer Statistics, Fed. Rsrv. Bank of N.Y., https://www.newyorkfed.org/markets/counterparties/primary-dealers-statistics [https://perma.cc/HJ5B-GTT3]. Figure 2 shows the average daily primary dealer exposure to outstanding repos and reverse repos over the period from 2016 to 2023.210Figure 2 relates to the positions of primary dealers in repo markets. It plots several attributes of interest relating to the exposure of primary dealers as a group, each measured in billions of dollars: (1) the weekly outstanding repo and reverse repo positions collateralized by Treasuries and (2) the weekly outstanding repo and reverse repo positions collateralized by all assets. The difference between the repo and reverse repos held by primary dealers measures effectively the net borrowing from the repo market of primary dealers as a group. See infra Appendix Figure 2. Similarly, Figures 3.A and 3.B show, respectively, the average daily primary dealer inventory exposure in the secondary market, and the average daily trading volume in the secondary market.211Figures 3.A and 3.B relate to activities of primary dealers in the Treasury secondary market. See infra Appendix Figures 3.A & 3.B. Figure 3.A plots for the eight-year period the net inventory exposure of primary dealers in Treasuries, and also their net inventory exposure to all assets. Similarly, Figure 3.B plots the transaction volume of primary dealers in Treasuries, and also their transaction volume across all assets. All figures are in billions of dollars. The data underlying the charts show that the average daily collateral exposure of primary dealers in the Treasury-backed repo market averaged $1.91 trillion in 2020 and $1.84 trillion in 2021. In contrast, the average net primary dealer exposure to the Treasuries’ secondary market totaled only around $244 billion in 2020 and $159 billion in 2021. Even the average daily trading volume of primary dealers in the Treasury secondary market—that includes both buying and selling by primary dealers—was only around $500 billion in both 2020 and 2021.

Importantly, the repo market also exerts large unpredictable demands on dealer balance sheets. By contrast, the secondary market is steadier. The data underlying Figure 2 shows that the average daily collateral exposure of primary dealers in the Treasury-backed repo market varies wildly and unpredictably from 1 week to the next. In 2020, this week-to-week difference varied from under $1 billion to $385 billion during the year and averaged around $72 billion. The average daily collateral used by primary dealers for the Treasury-backed repo market also continued to vary substantially from one week to the next in 2021, with weekly changes exceeding $100 billion about 20% of the time. Further, these position changes in the repo market are not correlated from week-to-week, reflecting constantly shifting market-wide needs.212See generally Narayan Y. Naik & Pradeep K. Yadav, Risk Management with Derivatives by Dealers and Market Quality in Government Bond Markets, 58 J. Fin. 1873 (2003) (showing greater predictability in government bond dealer positions in the United Kingdom).

By contrast, the secondary market is much less dramatic. Using data underlying Figure 3.A, it saw average weekly variation in primary dealer exposures of only around $17 billion in both 2020 and 2021, with maximum weekly variation of only $46 billion. These figures reflect a consistent pattern over time. Figure 4 shows the sizable extent by which changes in primary dealer exposures to the repo market dominate changes in primary dealer exposures to the Treasury secondary market between 2016–2023.213Figure 4 encompasses both the Treasury secondary market and the repo market. It plots weekly changes in the total outstanding repo and reverse repo Treasury collateral of primary dealers against the weekly changes in the exposure of primary dealers to Treasuries due to their market-making role in Treasury secondary markets. See infra Appendix Figure 4.

Relatedly, primary dealers are also subject to the vagaries of how other dealers use Treasuries and cash in repo intermediation. Primary dealers do not intermediate all bilateral repo transactions. Copeland et al. estimate that around 20% of transactions are not intermediated by primary dealers.214Copeland et al., supra note 55. While primary dealers occupy an outsize and influential position, their perch only allows them a partial view. Consequently, primary dealers confront blind spots about how these other firms use collateral and cash. This can create further difficulties for primary dealers in seeking to estimate the Treasuries and cash required to sustain repo operations as well as Treasury secondary markets.215Id. For example, hedge funds have emerged as active and intriguing players in repo markets. According to one 2021 study, hedge funds have doubled their exposure to Treasuries to $2.4 trillion between 2018–2020, holding around $1.4 trillion in Treasuries in mid-2019, compared to the largest banks that held only around $524 billion at that time.216Alexandra Scaggs, Hedge Funds Now Dominate the Treasury Market. They Failed Their First Test., Barron’s (May 22, 2021), https://www.barrons.com/articles/suspect-behind-recent-treasury-market-dysfunction-highly-leveraged-hedge-funds-51621625376 [https://perma.cc/M9CU-BYRC]. In addition to using the repo market to borrow to fund themselves, hedge funds have also taken over some of the trading and liquidity supplying functions traditionally performed by major dealers.217Id.

Opacity attaching both to the repo market and to hedge funds has precluded a clear understanding of what kinds of risks hedge funds pose for dealers. For example, hedge funds are well-known for taking on debt to pursue trading strategies.218Id. Hedge fund participation raises the danger that leveraged funds become a credit risk for dealers that fund them. In addition, owing to their smaller balance sheets, hedge funds may need to sell Treasuries and also pull back quickly from providing liquidity to the rest of the market.219See generally Kruttli et al., supra note 186. The March 2020 crisis has shone a spotlight on the potentially destabilizing role of hedge funds in Treasuries.220Scaggs, supra note 216. See generally Kruttli et al., supra note 186; Barth & Kahn, supra note 46. It also highlights the pervasive danger for dealers from corners of the market they are less familiar with yet whose activities can nevertheless dramatically impact their own.

Second, primary dealers face difficulties from their participation as intermediaries in capital markets more broadly. Crucially, primary dealers are active liquidity suppliers to risky assets.221See generally Jonathan Brogaard & Yesha Yadav, The Broken Bond Market (Vanderbilt Law Research Paper No. 21-43, 2022), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3941941 [https://perma.cc/2EKG-DGR4] (discussing the extensive role of dealers in corporate bond markets generally). Figure 3.A shows that about one-third of net primary dealer exposure has been in risky non-Treasury securities. Figure 3.B shows that primary dealer trading volume in non-Treasuries is comparable to that in Treasuries, such that risks arising out of trading in non-Treasury assets are similar in volume to those arising from Treasuries.

Further, Figure 2 shows that the exposure of primary dealers to non-Treasuries collateral in repos and reverse repos is about 20% to 25%. Non-Treasury collateral generally comprises mortgage and asset-backed securities.222Supply shocks to primary dealers’ cash flows can also arise from episodic volatility in the short-term borrowing and lending rates implied by repo transactions. These directly impact the costs and risks they face in managing inventory for market making. As Gary Gorton and Andrew Metrick observe, these securities pose especially high risks where parties rush to liquidate their loans in fear of the collateral losing value quickly.223Gorton & Metrick, supra note 33, at 426. See Saguato, supra note 29, at 106–07, 116–18 (noting the risk of runs from low-quality securities in 2008). This non-Treasury segment can heighten the risk of a repo run and force primary dealers to limit their intermediation across the board, including in Treasuries repo and secondary markets.

Third, information costs are exacerbated by difficulties in the ability of primary dealers to understand whether the Treasuries collateral they hold is actually viable. In other words, can this collateral be traced and readily sold for cash? Lengthy collateral chains, formed by reusing Treasuries multiple times, muddy understanding of whether this collateral is available in a default. Information sources are scant. Primary dealers do not have to report data on their use of client collateral in their weekly disclosures to the NY Fed.224Singh, supra note 177; see Bd. of Governors of the Fed. Rsrv. Sys., Reporting Guidelines for Preparing the FR 2004 Primary Government Securities Dealers Reports 12, https://www.federalreserve.gov/reportforms/forms/FR_200420130331_i.pdf [https://perma.cc/SC3F-WSUH]. This means that they do not need to tell the NY Fed about how they use collateral that clients entrust to them in safekeeping. This leaves firms (and regulators) to guesstimate exposures.225See generally Off. of Fin. Rsch., supra note 37; Infante et al., supra note 172.

The perceived safety of the Treasuries-backed repo—and the difficulty of mapping out collateral chains—can act as a disincentive for primary dealers and others to invest in information gathering. Even if they do go to the trouble of mapping out the risk, the constantly evolving nature of repo market exposures means that this map of underlying collateral chains can change quickly.

Information and monitoring costs in the repo and secondary market for Treasuries contribute to imperfections in intermediation.226To be sure, opacity is improving in certain parts, notably, the cleared repo markets and efforts by regulators to gather more data since around 2019. See Kahn & Olson, supra note 30, at 1–2. Higher costs in understanding and pricing risk can result in intermediation becoming more selective, expensive, and governed by private interests at a cost to the public good. In response to these costs, primary dealers may temporarily cut off credit in repo, including to one another. Without the ability to fund themselves using repo operations (for example, to borrow cash), they may also withdraw from the secondary market and stop buying and selling Treasuries to investors. Critically, incomplete transparency in repo and the secondary markets can shield primary dealers that withdraw intermediation. They may be quicker to leave the market whenever they see fit owing to the impossibility of being publicly identified as the dealer that stopped supplying liquidity.

B.  Conflict Between Public and Private Regulation

Primary dealers also face a conflict when intermediating cash and Treasuries between the repo and secondary markets, particularly during crisis. Supporting the health of one market (for example, maintaining systemic stability in repo) can come at the expense of the other (providing liquidity to the secondary market).

This task of accomplishing successful intermediation across both markets is complicated by a number of factors. Primary dealers must decide how they allocate the “free float” of Treasuries available to them. They must examine the volume of Treasuries that is freely available and decide how much of this float should be allocated between the secondary and repo markets.

This calculus reveals the depth of the internal conflict at play between the secondary and repo market. Critically, the free float of Treasuries is reduced by the large amounts of cash and securities that are “locked-in” in the repo market.227Lam et al., supra note 45, at 55–57; Ding et al., supra note 45, at 237–38; see, e.g., Kuan-Hui Lee, The World Price of Liquidity Risk, 99 J. Fin. Econ. 136, 138 (2011) (discussing the relevance of the free float of securities to liquidity and pricing). This means that large volumes of this float become passively captured as collateral in the repo market.

Specifically, during 2020, the daily average Treasuries transaction volume in the secondary market was around $603 billion. Compared to this, the dollar volume of primary dealer Treasury collateral in repos during 2020 was $1.9 trillion, and the dollar volume of such collateral in reverse repos was $1.7 trillion. Taken together, without reuse, Treasuries valued at a daily average of about $3.6 trillion were captured as passive collateral in the repo contracts of primary dealers in just the bilateral repo market. This is about twelve times the average exposure of primary dealers, and six times their daily Treasury trading volume in the secondary market. This inference is not specific to 2020. Rather as seen in Figures 2, 3.A, and 3.B, these trends are persistent. High amounts of captured Treasuries float—necessary to repo operations—drastically reduce the volume of Treasuries that are available for trades in the secondary market.228This data is estimated from information submitted to the NY Fed on Form FR 2004 by primary dealers. Fed. Rsrv. Bank of N.Y., supra note 209.

These restrictions create difficulties for intermediation by primary dealers. Captured repo collateral imposes rigidity on primary dealers that reduces how fully they can supply liquidity to the secondary market during crisis. The secondary market can experience sudden and unexpected pressure from investors to perform. For example, total aggregate trading in the Treasury secondary market in the weeks of March 6, 2020, and March 13, 2020, came to around $5.7 and $4.9 trillion, respectively—an especially turbulent 2 weeks during which the Treasury market essentially stalled.229Trade Reporting and Compliance Engine (TRACE), FINRA, https://www.finra.org/filing-reporting/trace/data/trace-treasury-aggregates [https://perma.cc/GKR8-BZMM] (choose “LOAD MORE”; then choose “Week of March 2, 2020” and “Week of March 9, 2020). By contrast, as secondary trading activity normalized, it started seeing approximately half the volume by summer 2020.230Id. (choose “LOAD MORE”; then choose “Week of July 27, 2020”).

Trillions in passive repo collateral create a source of fragility for the secondary market, generating logistical and financial difficulties for primary dealers: (1) dealers might have to constantly warehouse a reserve of Treasuries to support Treasury secondary trading during periods of high demand or (2) they can risk having to buy Treasuries during a crisis in order to meet demand. The first option creates costs because dealers have to allocate capital for buying and maintaining Treasuries supply, limiting profits from intermediation. In the second case, they run the risk that Treasuries become costlier to source, potentially resulting in reduced margins for their business. A third option always remains on the table: to reduce intermediation and avoid the need to source expensive Treasuries during difficult periods. This trade-off requires primary dealers to balance their private interests with public ones. When Treasuries’ float is limited and demand in secondary trading exceeds existing reserves, the incentives of dealers to remain committed to Treasuries trading diminishes.

The coexistence of the secondary and repo markets—both intermediated by primary dealers—thus reveals real structural tensions. The growth of repo lending, demanding higher volumes of Treasuries float, can result in a corresponding decrease in securities available to lubricate the secondary market. This trade-off creates a complex problem for policy. If repo lending represents a desirable and efficient form of funding for financial institutions, ensuring it is done safely is of paramount concern. At the same time, a growing reliance on repo operations for financial institutions results in fragilities for the secondary market and the regulation that depends on it.

Moreover, as intermediaries for both markets, primary dealers have incentives to prioritize the needs of one market over another depending on private preferences. This favoring of one market over another can be motivated by a number of reasons. For example, primary dealers may see larger profits, lower risk, and reputational gains from ensuring the continuity of repo lending than from supplying expensive liquidity to the secondary market. Repo markets are far larger and primary dealers earn fees for matching counterparties.231See Copeland et al. supra note 55. It is one in which primary dealers dominate and have repeat relationships with major clients. Moreover, dealers are themselves beholden to the repo market for their own financing needs. For dealers, then, there is a lot to gain from seeing a growth in the size of the repo financing market—even if this means periodic retreats from the more competitive secondary market for Treasuries in which primary dealers have lost ground to high-speed electronic traders.232See Yadav, supra note 19, at 1207–15.

The consequences of how primary dealers navigate this trade-off is a critical matter for public policy. If primary dealers are motivated to step away from intermediating in secondary markets, their actions call into question the view that Treasuries trade in a market that is deeply and constantly liquid. Rather, it points to one that is chronically vulnerable to the private preferences of its key intermediaries who are unlikely to continue offering resilient tradability at a cost to themselves. More broadly, for public and private financial regulation to remain credible, its intermediation must be able to deliver continued lucrative profit to its major dealers.

C.  A Unique Configuration of Unknown Unknowns

As detailed above, primary dealers navigate a tricky and costly task in intermediating across both the repo and secondary market for Treasuries. Opacity limits the ability of a dealer to build a real-time understanding of the activity in the repo market external to that dealer—most importantly, where Treasuries collateral is located, and whether it can be captured and sold in an emergency. Motivation to monitor is low. The secondary market’s limited historic reporting also reduces sight of pockets of disruptive trading—and the arrival of greater competition between primary dealers and newer high-speed automated traders lowers the attractiveness of the secondary market as a place to do vibrant business. Importantly, intermediation represents a source of conflict. Dealers are caught between preserving trillions in passive collateral in the repo market to maintain its safety and soundness—and using cash and Treasuries to supply liquidity to the secondary market. Especially if collateral reuse results in uncertainty about the quality of collateral, primary dealers have every incentive to ring-fence the Treasuries and cash they have for repo operations, even if demand in secondary markets is spiking. Critically, despite dependence on the services of primary dealers to preserve intermediation, they can withdraw from both spaces whenever costs and uncertainties become too high.233Scaggs, supra note 32.

This combination of risks represents an unprecedented set of problems for intermediation in the repo and secondary markets. It leaves dealers and policymakers facing many unknowns.

First, as shown in Part I, the repo market has grown its reliance on Treasuries collateral sharply following the 2008 Financial Crisis as a way to privately regulate short-term credit between firms. Whereas an earlier era looked to a variety of riskier assets like mortgage-backed securities, the last decade has observed a marked shift in the direction of Treasuries as favored collateral.234Gorton & Metrick, supra note 33, at 430 (noting the reliance on mortgage-backed securities as collateral in repo markets). With Treasuries now collateralizing around $4 trillion of repo debt, concerns about locking-in and ring-fencing collateral carry special salience given the potential for catastrophic damage to financial stability if this collateral reserve becomes unstable.

Second, post-Crisis regulation also puts special weight on Treasuries as a mandatory asset for capital buffers. As discussed in Part I, Treasuries are particularly important for post-Crisis public regulation, with financial institutions required to maintain deep buffers of high-quality liquid assets. This signal reliance by public regulation raises the stakes for primary dealers to ensure the secondary market is supplied with trading opportunities for those firms that need to liquidate their Treasuries in a financial crunch or to buy Treasuries when a safe asset is needed. As detailed by Vissing-Jorgensen, mutual funds sold more Treasuries in 2020 than they did after the 2008 Financial Crisis, owing to the thicker reserves of Treasuries they held coming into 2020.235Vissing-Jorgensen, supra note 15, at 24–25.

On the other side, detailed in Part II, post-Crisis reforms also impose these liquidity requirements and capital surcharges on primary dealers themselves and necessitate compliance with regulations that protect these firms from becoming too big to fail. According to some commentators and scholars, these rules can also make primary dealers more hesitant to supply liquidity to the repo and the secondary market, depleting reserves of cash and Treasuries, and falling out of compliance. Importantly, scholars also note that a stricter compliance environment following post-Crisis reforms has reduced primary dealer motivation to support intermediation—and opened the door for other firms to enter the fray. While a broader trend across debt markets, commentators note that non–primary dealers (for example, hedge funds) have stepped into the breach to supply liquidity more actively.236See Kruttli et al., supra note 186, at 1–2. The increased participation of hedge funds in the U.S. Treasury liquidity supply has heightened the stakes for regulators and hedge funds looking to challenge the SEC’s February 2024 rulemaking. For hedge funds, the ability to remain outside the reporting perimeter has arguably allowed greater strategic flexibility for firms to determine how they might deploy Treasuries intermediation as a trading technique. For discussion, see, e.g., Duguid, supra note 43; Barbuscia, supra note 43. If this trend continues, primary dealers could face more information gaps arising from the activities of those that are new to the market as well as greater competition that diminishes their profits from intermediation.

Third, as outlined in Part II, the secondary market for Treasuries has experienced radical shifts, as primary dealers have lost ground to high-speed traders in the interdealer segment. According to one study of the major interdealer trading platform, BrokerTec, eight out of the top ten traders on the venue came from the ranks of HFT firms, rather than primary dealers. Primary dealers have continued to dominate the dealer-client segment. But this rapid waning of their professional power shows that that the secondary market has become more crowded with new entrants, and the incentives of primary dealers to take on costs to keep trading are quickly becoming weaker in the face of competition.237See Yadav, supra note 19, at 1208–15.

Putting these factors together, Treasuries intermediation has newly evolved into an especially complex, contradictory, and costly prospect for primary dealers, policymakers, and regulation. It is also foundational to financial markets and their stability post-2008. This coming together of regulatory need and operational complexity in managing intermediation requires that regulators be equipped to spot and address the risks at the heart of a straining Treasury market structure. As argued below, this is far from the case given the highly fragmented state of current regulatory design.

D.  A Breakdown in Regulation

The regulatory framework to oversee the repo and secondary market for Treasuries is ill-equipped to respond to the vulnerabilities underlying their market structure. Supervisory approaches for repo and Treasuries markets are divided between a “securities” model on the one hand (for secondary trading) and a prudential one on the other (for repo). This leaves regulators unable to develop a consolidated approach to oversight that recognizes the interdependence between the repo financing market that relies on Treasuries collateral and secondary trading that needs Treasuries to be capable of being bought and sold to realize their value quickly, cheaply, and at fair prices.

The regulatory framework for secondary trading in Treasuries is institutionally fragmented without any overarching coordination mechanism to guide rulemaking and supervision.238See id. As detailed in Part I, unlike equities markets that, for example, fall primarily within the jurisdiction of a single regulator (the SEC), Treasuries lack a single lead overseer. Oversight is shared between at least five major bodies: the U.S. Treasury, NY Fed, the Fed, the SEC, and CFTC. FINRA also oversees securities broker-dealers and is instrumental in data collection from reporting firms after 2017.239Id. at 1193–99, 1219–22 (detailing the framework for regulating Treasuries under the Government Securities Act of 1986 and analyzing the implications); see also Jerry W. Markham, Regulating the U.S. Treasury Market, 100 Marq. L. Rev. 185, 199–208 (2016).

 Fragmentation raises serious concerns in the context of an interconnected, internally conflicted repo and secondary trading market.240See Yadav, supra note 19, at 1193–99. First, information sharing becomes hobbled by institutional barriers and bureaucratic divergences in how information is collected and analyzed.241Id. at 1219–22 (noting the effects of bureaucratic divisions). Consider the so-called “Flash Rally” in the Treasury secondary market. On October 15, 2014, the Treasury secondary market experienced around thirty minutes of aberrant, anomalous trading at the start of the trading day, characterized by prices surging to some of their highest historic levels for inexplicable reasons.242U.S. Dep’t of the Treasury et al., supra note 39, at 15–19. Eventually, prices reverted to normal but not without first sending capital markets into chaos and confusion.243Id. Regulators undertook a thoroughgoing, yearlong joint investigation into the event’s possible causes and implications.244Id. at 1–2. While the final report did not unearth any smoking gun, the investigation itself was illuminating. During the inquest, commentators singled out the legal and logistical difficulties experienced by different agencies in collecting and sharing information with one another.245Ryan Tracy & Andrew Ackerman, The New Bond Market: Regulators Scramble to Keep Up, Wall St. J. (Sept. 23, 2015, 8:02 PM), https://www.wsj.com/articles/the-new-bond-market-the-u-s-treasury-struggles-to-keep-up-1443027850 [https://perma.cc/DM37-FA8W]; U.S. Dep’t of the Treasury et al., supra note 39, at 15–20. The CFTC, for example, required time to conclude an information-sharing agreement in order to forward its data to other regulators.246Tracy & Ackerman, supra note 245. The report further revealed that regulators lacked information to such a degree that they were shockingly unaware of major transformations underway in the Treasury market, specifically, the shift to high-speed, automated trading from a primary-dealer dominated, more analog interdealer market.247U.S. Dep’t of the Treasury et al., supra note 39, at 15–19.

In other words, fragmentation points to serious institutional challenges for regulators seeking to understand the interconnected machinations of the repo and secondary markets.248For detailed discussion, see Yadav, supra note 19, at 1219–22. Agencies may not feel comfortable or be permitted to share information on those they supervise.249Tracy & Ackerman, supra note 245. The 2017 trade reporting regime, instituted in the wake of the Flash Rally, requires that covered securities firms report their data to FINRA.250Harkrader & Puglia, supra note 39 (stating that FINRA-regulated broker-dealers are required to report their trades to FINRA, excluding hedge funds). Banks, on the other hand, have to provide reports of their trading to banking regulators, suggestive of the especially sensitive nature of bank exposures.251Trade Reporting and Compliance Engine (TRACE), FINRA, https://www.finra.org/filing-reporting/trace [https://perma.cc/NC8G-KTWQ]; Harkrader & Puglia, supra note 39. To harmonize the process, FINRA and the Fed have engaged in a yearslong dialogue on coordination of data collection, under which FINRA could acquire bank-reported data as an agent of the Fed.252See Press Release, Bd. of Governors of the Fed. Rsrv. Sys., Federal Reserve Board Announces Plans to Enter Negotiations with FINRA to Potentially Act as Collection Agent of U.S. Treasury Securities Secondary Market Transactions Data (Oct. 21, 2016), https://www.federalreserve.gov/newsevents/pressreleases/other20161021a.htm [https://perma.cc/BQ2W-WQ68].

Different regulatory regimes mean that regulators each have varying amounts of information on those they supervise.253For detailed discussion, see Yadav, supra note 19, at 1219–22. Whereas primary dealers banks and broker-dealers are overseen by various dedicated banking and securities regulators, like the Fed, the SEC, and FINRA, other major participants like hedge funds are regulated on a much looser basis.254Kruttli et al., supra note 186, at 1–2 (noting that hedge funds are much less regulated than broker-dealers and provide fewer disclosures). By design, hedge funds fall under a lighter touch, more opaque regulatory regime with fewer disclosures.255Id. To be sure, post-2008 rulemaking does extend the regulatory perimeter to cover their activities more fully than before. Notably, bigger hedge funds must provide disclosures on various types of exposures in financial markets in a bid to help regulators map out their systemic footprint.256Nabil Sabki & Nadia Sager, Five Lessons for Form PF, Prac. Compliance & Risk Mgmt. for Sec. Indus., July–Aug. 2013, at 35, 35 (highlighting information that must be disclosed and its purposes). But the intensity of their overall regulatory scrutiny is generally far less intense than that faced by banks, broker-dealers, or mutual funds.257Kruttli et al., supra note 186, at 1–2; FINRA, supra note 251; Harkrader & Puglia, supra note 39. For detailed discussion, see Yadav, supra note 19, at 1219–22; Novick et al., supra note 144, at 8–10. Importantly, within the Treasuries market, hedge funds have generally fallen outside of the reporting obligation for their secondary market trades because they do not fall under the category of broker-dealers or banks.258Novick et al., supra note 144, at 7–8 (noting the negative impact of hedge fund non-reporting in Treasury markets). This is expected to change, at least for the most active hedge fund Treasuries traders, as the SEC’s new registration and reporting rules take effect. Given the enormity of their exposure to Treasuries and the potential scale and impact of their activities, their historic exclusion from reporting has left regulators without a valuable and essential repository of data. In addition to hedge funds, many high-speed automated traders also do not qualify as FINRA broker-dealers—though again, this may change for more active players as the SEC rulemaking takes effect.259Harkrader & Puglia, supra note 39; Yadav, supra note 19, at 1219–22. The erstwhile regulatory regime has nevertheless allowed many such firms to avoid direct reporting of trades in the interdealer market. It is worth highlighting that even though the SEC has taken steps to bridge these gaps by passing new rules to encompass hedge funds and HFT firms within a registration and reporting regime, their chances of future success appear uncertain in view of industry resistance to reforms and potentially drawn-out court challenges.260Duguid, supra note 43; Barbuscia, supra note 43. A 2019 supplement to the reporting regime requires Treasuries trading platforms to identify traders in records.261Yadav, supra note 19, at 1197. As such, regulators can get information on particular securities firms should they need it.262Id. at 1219–22; Harkrader & Puglia, supra note 39. However, reporting to regulators is not direct—and they must absorb costs to get data on an ex post basis.263Yadav, supra note 19, at 1219–22.

Gaps in information and roadblocks to cooperation have limited the ability of regulators to share insights on the major risks to Treasury secondary and repo markets. And fragmentation in regulatory design and pockets of opacity are essentially fatal to the enterprise of constructing a picture of the vulnerabilities affecting intermediation and developing ex ante constraints to control the risks.264For detailed analysis and background, see id. at 1219–22.

In addition to fragmentation, Treasury repo and secondary markets also operate under systems of oversight that diverge in their methodological approaches. As detailed in Part II, the Fed and the NY Fed represent the prudential end of the regulatory spectrum. Focusing on safety and soundness, a prudential approach ensures preservation of systemic safety and soundness as its critical mission. This can mean less emphasis on disclosure and transparency, for example, and more on ensuring that markets remain insulated from the risk of sudden runs and default on credit.265See About the Fed, Bd. of Governors of the Fed. Rsrv. Sys., https://www.federalreserve.gov/aboutthefed.htm [https://perma.cc/KY59-9HUQ] (“The Federal Reserve . . . promotes the stability of the financial system and seeks to minimize and contain systemic risks through active monitoring and engagement in the U.S. and abroad; promotes the safety and soundness of individual financial institutions and monitors their impact on the financial system as a whole.”). By contrast, the SEC and FINRA represent quintessential securities markets regulators, offering deft expertise in building efficient and transparent trading markets and protecting investors.266What We Do, U.S. Sec. & Exch. Comm’n, https://www.sec.gov/Article/whatwedo.html [https://perma.cc/R353-R77U] (“[O]ur mission . . . [is] protecting investors, maintaining fair, orderly, and efficient markets, and facilitating capital formation.”); What We Do, FINRA, https://www.finra.org/about/what-we-do [https://perma.cc/34RG-H869]. Instead of financial stability as the core guiding mission, securities market regulators nurture trading markets, underpinned by the dissemination of information, efficient price formation, and capital allocation.267U.S. Sec. & Exch. Comm’n, supra note 266. To be clear, these are generalizations. The SEC, for example, also focuses on financial stability (e.g., by regulating the stability of money market funds that constitute an essential part of the repo market).268Money Market Funds, U.S. Sec. & Exch. Comm’n, https://www.sec.gov/spotlight/money-market.shtml [https://perma.cc/HZ7F-XGEJ]. The Fed regularly engages with securities markets to ensure that the infrastructure, such as exchanges, is protected against collapse.269Colleen Baker, The Federal Reserve’s Supporting Role Behind Dodd-Frank’s Clearinghouse Reforms, Harv. Bus. L. Rev. Online 177, 178–80 (2013), https://www.hblr.org//wp-content/uploads/sites/18/2013/04/Baker_The-Federal-Reserves-Supporting-Role.pdf [https://perma.cc/LD7U-9G3X] (detailing the financial and supervisory support that the Federal Reserve provides to securities clearinghouses). These generalizations, however, aide in understanding key differences between the purposes and approaches of regulators tasked with overseeing secondary and repo markets for Treasuries.

This divergence can explain why regulators have failed to connect the shared risks facing Treasury repo and secondary markets, and to oversee both in a more consolidated way. Neither the prudential nor the securities-based model neatly fits the secondary or the repo market. For a start, the interdealer secondary market—a fairly classic securities market with heavy and liquid daily turnover—holds enormous systemic implications for the economy. If this market stops working, like in March 2020, a swath of economic actors cannot meet critical prudential needs. Concretely, reliance by public regulation on Treasuries’ liquidity (e.g., HQLA) ties the proper functioning of the interdealer market to the prudential survival of any number of financial firms and the larger system.

Yet the regulatory methods used to oversee interdealer Treasuries trading fit neither a prudential nor a capital markets paradigm and leave risks exposed. Trade-by-trade reporting is of recent vintage (2017)—and only for regulators. Public reporting is limited—with data released only in aggregate form. This reticence to widely disclose potentially sensitive Treasuries trades recognizes the systemic quality of the market. However, other regulatory aspects undermine this focus on curbing systemic risks. Perhaps most importantly, lightly regulated actors are afforded ample latitude to trade in secondary markets without having to report their activities. Hedge funds, especially, are a case in point. But high-speed securities trading firms are another. Now firmly dominant in the interdealer market, such high-speed trading firms have not fallen within the regulatory regime for broker-dealers. Therefore, they have not been subject to reporting rules (but see above for anticipated changes in response to new regulatory measures).270See generally Harkrader & Puglia, supra note 39. Crucially, they have typically also been able to skirt other measures designed to address prudential risks—notably, capital requirements on broker-dealer firms that require safekeeping of rainy-day assets.271Elad L. Roisman, Comm’r, U.S. Sec. & Exch. Comm’n, Remarks at U.S. Treasury Market Conference (Sept. 29, 2020), https://www.sec.gov/news/speech/roisman-us-treasury-conference-2020-09-29 [https://perma.cc/6LAN-PVD8]. This leads to a possibility that highly influential traders have been transacting with only a thin base of capital, making them sensitive to losses and liable to exit rather than continue supplying liquidity especially during crisis.272Id.

Similarly, the regulatory strategy for overseeing repos fails to account for the complex dynamic between Treasuries repo and the secondary market. As detailed in Part II, repo markets are overseen through a decidedly prudential lens. Capital buffers help safeguard against runs and collapse.273See discussion and sources cited supra Section II.C. Collateral plays a pivotal role in reducing default risk.274See discussion and sources cited supra Section II.C. Because of this collateral and the fear of runs, real-time detailed disclosure is limited.275See discussion and sources cited supra Section II.C. Yet despite this focus on safety and soundness, the workings of the repo market fail to account for the role of the secondary market in maintaining the repo market’s smooth workings.276See discussion and sources cited supra Section II.C. This interconnection exists for a number of reasons: (1) if secondary markets experience illiquidity, Treasuries’ prices can become unstable and distorted, impacting the viability of Treasuries as collateral; (2) repo lenders that wish to liquidate Treasuries will find themselves unable to do so in an illiquid secondary market; and (3) if primary dealers cannot buy and sell Treasuries in secondary markets, they may lack the ability to source cash and securities to fulfill repo lending. As seen in March 2020, for example, firms selling Treasuries en masse caused secondary trading to stall and badly disrupted securities prices.277See discussion and sources cited supra note 1. With the value of Treasuries directly tied to the viability of firm liquidity buffers, a lack of attention to the securities market undermines the functioning of the prudential one.

In summary, this Part shows that public and private regulation’s reliance on Treasuries is subject to a number of failures arising from a flawed system of intermediation. We show that the Treasury-backed repo and secondary trading markets are connected by a common intermediary: the primary dealer. In entrusting maintenance of the trading and repo markets to primary dealers, public and private regulation has failed to account for a number of costs that mean liquidity in both markets becomes tenuous. Primary dealers incur information and monitoring costs, navigate conflict between the needs of the repo versus the secondary market, and attend to their own private business preferences. These challenges are particularly dangerous owing to the needs of a financial regulatory system that puts Treasuries at the center both in public oversight and private self-regulation. In its second contribution, this Part argues that the regulatory framework for the repo and secondary markets is fragmented, inadequate, and insufficiently adaptive to provide consolidated supervision of a connected set of markets. The result is a Treasury market that is relied on for its resilience, but one whose foundations are poorly understood and subject to rapid erosion.

IV.  PATHWAYS TO STABILITY

This Article shows that the Treasury market suffers from fragilities in intermediation that makes it unstable and unreliable, casting doubt on the assumption used by regulation to place Treasuries at the center of financial stability. To begin remedying the structural deficiencies identified in this Article, we outline three proposals. Our focus lies in enabling public and private actors to strengthen the quality of liquidity and improve their understanding of the market’s risks ex ante.278Manmohan Singh, Collateral Reuse and Balance Sheet Space 12 (Int’l Mon. Fund, Working Paper No. WP/17/113, 2017) (highlighting the pressure on dealer balance sheets to absorb repo market exposures—and the impact of regulations on balance sheet capacity). We recognize that if the Treasury market fails—like it did in March 2020 and in September 2019—it will be a near certain recipient of ex post federal emergency assistance. Indeed, in January 2022, regulators announced the creation of a permanent standing facility to lend securities and cash to repo market participants when the need arises.279Gara Afonso, Lorie Logan, Antoine Martin, William Riordan & Patricia Zobel, The Fed’s Latest Tool: A Standing Repo Facility, Fed. Rsrv. Bank of N.Y.: Liberty St. Econ. (Jan. 13, 2022), https://libertystreeteconomics.newyorkfed.org/2022/01/the-feds-latest-tool-a-standing-repo-facility [https://perma.cc/5U3Q-95K7]. Our focus is on taking first steps to develop strong ex ante mechanisms to improve information flows, enhance liquidity, and ensure that primary dealers are well supervised even within a highly fragmented regulatory framework. We suggest (1) developing greater transparency and information sharing in repo and Treasuries trading markets, (2) encouraging major liquidity suppliers—both primary dealers and key HFT traders—to invest in maintaining the liquidity of the market, even in times of distress, and (3) bringing greater consolidation and coordination to the regulatory framework, and requiring regulators to link supervision of the Treasury secondary markets and the Treasuries-backed repo markets more systematically.280In December 2023, the SEC approved the introduction of a mandate for central clearing for Treasuries trades in both secondary and repo markets. This mandate imposed a requirement on firms that are members of a clearinghouse to subject their Treasuries trades to risk management by a central clearinghouse. Nonmembers (for example, hedge funds and HFTs) would not be required to centrally clear their trades if they only transact with one another. At least in theory, such a mandate can improve data collection and risk mapping within U.S. Treasuries trading. Nevertheless, gaps remain, for example for trades executed between nonmembers of a clearinghouse. In addition, this proposal is far away from implementation. Its scale is ambitious, and it is unclear how the implementation process may impact how effectively a clearinghouse may resolve concerns surrounding opacity and risk management for U.S. Treasuries and collateralization. A full discussion of this proposal is outside the scope of this Article but will be addressed in further scholarship by the authors. For an outline and discussion of the clearing rule, see, e.g., Press Release, U.S. Sec. & Exch. Comm’n, SEC Adopts Rules to Improve Risk Management in Clearance and Settlement and Facilitate Additional Central Clearing for the U.S. Treasury Market (Dec. 13, 2023), https://www.sec.gov/news/press-release/2023-247 [https://perma.cc/T657-SCRU]. See also U.S. SEC Adopts Rules Requiring Central Clearing in the U.S. Treasury Market, Sidley (Dec. 21, 2023), https://www.sidley.com/en/insights/newsupdates/2023/12/us-sec-adopts-rules-requiring-central-clearing-in-the-us-treasury-market [https://perma.cc/RB5H-27BT].

A.  Transparency and Prudential Safety

As detailed in Part III, information gaps are endemic within the repo and secondary markets for participants as well as regulators. These gaps obscure an understanding of how repo and secondary trading intersect and what risks are created by dint of this connection. Reform must begin by developing reliable mechanisms for improving transparency and information flows as a first step toward empowering regulators and market participants.

Information gaps are deeply embedded throughout the Treasury market, in both the secondary trading and repo market. Reforms in 2017 and 2019 have brought reporting to secondary markets. But it is limited by significant gaps in coverage (for example, excluding hedge funds and high-speed securities firms). Public, real-time transparency is restricted. Repo markets, opaque by nature, lack systematic, up-to-date reporting.281Infante et al., supra note 172. This allows private parties to avoid thorough due diligence. But it is far from obvious that it is protective in all cases. Collateral reuse creates opaque chains that instill a potentially false sense of confidence in which multiple parties all count on owning a single security. Further, opacity has costs even if transparency also comes with downsides. Market participants may overreact during crises, lacking information, and not knowing with which firms the problems lie.282Long, supra note 33. Opacity also constrains how flexibly primary dealers manage inventories, respond to the behavior of a variety of clients as well as unknown dealers that are also active in supplying liquidity.

As a first matter, we propose increasing the information and scope of reporting available to regulators and its participants.283Id. This applies to both the repo as well as the secondary market. For the repo market, this represents a paradigm shift in approach. However, we believe that it offers a much-needed lever for those in the market to take steps to assess supply and demand of Treasuries/cash more precisely. It also lowers the cost of public surveillance. Regulators remain stymied in their ability to capture real-time data on repo exposures, particularly for bilateral exposures. As Victoria Baklanova writes, supervisors are left to the grind of painstaking and patchy data collection practices that require them to piece together information from weekly or quarterly mandatory disclosures, on-the-ground examinations, or informal reporting by financial firms.284Viktoria Baklanova, Off. of Fin Rsch. Brief Series 15-03, Repo and Securities Lending: Improving Transparency with Better Data 3–6 (2015), https://www.financialresearch.gov/briefs/files/OFRbr-2015-03-repo-sec-lending.pdf [https://perma.cc/EB7P-WCFF]. Such data collection is costly, quickly out-of-date, and imposes analytical costs on account of its lack of standardization and comprehensive coverage.285Id. To be sure, since late 2019, this situation is improving. Regulators have ramped-up data collection in cleared segments of the repo market. In May 2024, the Office of Financial Research approved a new rule to enhance reporting and data collection in the bilateral repo market.286Press Release, Office of Financial Research, OFR Adopts Final Rule for Data Collection of Non-Centrally Cleared Bilateral Transactions in the U.S. Repurchase Agreement Market (May 6, 2024), https://www.financialresearch.gov/press-releases/2024/05/06/ofr-adopts-final-rule-for-data-collection [https://perma.cc/X74D-HNZ8]. But market-wide, real-time data gathering remains elusive for now, and the outcome of future efforts remains uncertain.287On data gathering efforts, see generally Kahn & Olson, supra note 30.

Reporting in this market has a number of benefits. It requires dealers to develop mechanisms to record their repo trades on a real-time, granular basis ex ante, to track the collateral that attaches to a particular repo, and to determine whether collateral attaching to it might be subject to reuse. Reporting can help create systematization in relation to capturing exposures and discipline about understanding the robustness of the collateral. In addition, it can create incentives for primary dealers to be more diligent with respect to understanding how collateral is sourced, whether it might be subject to reuse, how many times, and what the risks of such reuse might be during a period of distress. Importantly, we believe that such information ought to be shared regularly between regulators and the primary dealers (at least) as a group. Key intermediaries ought to develop mechanisms whereby they circulate insights about their repo exposures to one another on a regular basis with the goal of understanding collective exposures, the robustness of collateralization, and the potential market availability of cash and securities in case of need. This allows market participants to share emerging concerns, prepare for problems, and for regulators to also be ready to deal with the consequences of fallout.

Invariably, there will be pushback on a proposal to create transparency in prudential spaces. It goes against the grain of conventional wisdom in regulating prudential risks.288See, e.g., Infante et al., supra note 172. But, despite attachment to the status quo, regulators have begun to soften their stance on keeping utmost secrecy in banking and prudential areas. For example, in banking, regulators are now increasingly revealing some of the results of bank stress tests.289Daniel K. Tarullo, Are We Seeing the Demise of Stress Testing?, Brookings: Up Front (June 25, 2020), https://www.brookings.edu/blog/up-front/2020/06/25/stress-testing [https://perma.cc/2Y5L-R8H4] (highlighting the tension between transparency and opacity in bank stress test reporting). Even in the repo market, some public reporting has emerged for its cleared segments.290Off. of Fin. Rsch., OFR U.S. Repo Market Data Release Methodology for DVP Cleared Repo (2021), https://www.financialresearch.gov/data/files/2021-04–Methodology-DVP.pdf [https://perma.cc/8UUS-6ZHW]. While far from full transparency, this easing of traditional fetters against disclosure in banking can hint at potential openness to real-time reporting and information sharing. In addition, regulators and market participants might also balk at the cost of enabling Treasury market transparency given the interconnected complexities of the repo market and its daily size. There is also the ever-present concern that too much disclosure could result in triggering the exact externalities that everyone seeks to avoid—a run that results in a catastrophic drain on the market’s liquidity and forces regulators to have to step in and stop the bleed.

Nevertheless, such concerns are not insurmountable, and while downsides exist, the costs embedded in the status quo are also high. Importantly, the repo market is not hermetically sealed. It does allow for some pockets of reporting (albeit not to the public). In particular, the tri-party repo market—that relies on a formal system of clearing and settlement—allows for greater reporting, collateral tracking, and unraveling of the complexity inherent in trades.291Baklanova, supra note 284, at 3–6. Stated differently, the market is amenable to systematization if parties so choose. In addition, the costs of recording trades, and tracking and reporting collateral, should not be prohibitively daunting. Primary dealers and others already do risk management as individual firms, though not on a standardized basis.292Id. Notably, regulators routinely look to dealers self-reporting their activities as a means of gaining insights about the market. Surveys are commonplace as part of public efforts to study the ins-and-outs of the bilateral marketplace from those that inhabit it most closely.293See, e.g., id.; Infante et al., supra note 172. Moreover, a real-time data repository for the bilateral repo market would save both market participants and regulators from having to perform expensive data collection, analysis, and extrapolation of the possible state of the market on a given day. Rather than guesstimates, parties could rely on a more standard and reliable reserve of information from which to understand an already complex market.

Perhaps most importantly, the centrality of Treasuries to stability means that opacity presents an incalculably high cost in which the market suffers on account of being poorly understood and inadequately protected. As made clear in March 2020, the dislocation in the market cast a pall of doubt among market participants about the resilience of Treasuries during a global crisis.294Smith & Wigglesworth, supra note 106. Seen from this perspective, failure to understand the market and its dynamics carries not just financial costs, but also implies larger damage from the standpoint of political economy. Finally, to avoid the potential for sudden runs (transparency, it should be noted, may also avoid runs if dealers and others better understand their exposures), data circulation around the market may be staggered and delayed.295This is the case, for example, for data published on cleared repos. OFR U.S. Repo Markets Data Release Information, Off. of Fin. Rsch., https://www.financialresearch.gov/short-term-funding-monitor/datasets/repo [https://perma.cc/9LYM-59QU]. For example, the repository would provide data to a closed loop of recipients (potentially the major dealers) and do so with a delay (perhaps circulating information at intervals during the day, or perhaps at the end of each day). In other words, while transparency and reporting may appear daunting at first glance, there are ways of structuring it that can allow for some aggregation and promote a careful, calibrated approach to information consumption, collation, and analysis.

B.  Increasing Resilience in Intermediation

A lack of information can fuel a race to the exit by intermediaries, resulting in liquidity draining quickly and causing distress for investors as well as firms needing to fund themselves. More reporting can provide clarity to dealers when it comes to pricing their risks. But it leaves open the possibility that they exit the market at even small signs of trouble. To ensure dealer engagement in maintaining Treasury market resilience, we suggest exploring tools to incentivize market makers to assume an affirmatively active role during periods of crisis—especially in the secondary market. As highlighted by the events of March 2020, the secondary market can face enormous strain during crisis as investors rush in to transact in Treasuries. Resilience here—when the market does not buckle under stress—helps ensure that Treasuries can perform their regulatory role as safety buffers. For the secondary market, such a duty would cover both primary dealers as well as high-speed security firms. Both types of dealers are vulnerable to exiting the market rapidly, causing a decline in available liquidity and distortion in prices.296Cheng et al., supra note 1; Claire Jones, More ‘Money’ Treasuries Would Calm Repo Markets, Fin. Times (Feb. 11, 2020), https://www.ft.com/content/a710474b-3ff5-42fc-b9ab-83325e878716 [https://perma.cc/LX6C-JFTM].

An affirmative duty on key dealers to remain trading can help to build more certainty around liquidity provision in the Treasury market. To be sure, regulators face a trade-off in introducing such affirmative duties. Imposing higher transaction costs on traders can discourage them from entering the market or encourage them to pass the costs of liquidity onto investors that use the Treasury market. On the other hand, affirmative duties can be beneficial, especially given the importance of securing liquidity for Treasuries. If dealers can simply leave depending on their own preferences, they will be likely to exit exactly when the Treasury market is most necessary—during a crisis. Investors may come to regard the perception of its fail-safe liquidity as illusory, primed to dry up whenever danger strikes.

Historically, regulators have not required primary dealers to keep the market going in a crisis—perhaps assuming that they would do so anyway. Given their long-assumed dominance, perhaps this assumption could be justified. But it cannot hold now. As detailed in this Article, the Treasury market as a whole is facing new pressures, created by heavy dependence by public and private regulation on its services. In addition, the arrival of high-speed trading firms as well as nontraditional types of repo intermediary (for example, hedge funds) add to the pressures facing primary dealers and can contribute to how they navigate private decisions about continuing to provide liquidity.

An affirmative duty to maintain market liquidity can offer greater certainty that dealers make a real effort to stay, rather than simply exiting the market. Such a duty would require an affirmative obligation on traders to remain, particularly during crises.297Our thanks to Kumar Venkataraman for insights into this proposal. Anand and Venkataraman make the economic case for establishing affirmative market maker obligations in stock markets as a way to prevent volatility and price discontinuity. Amber Anand & Kumar Venkataraman, Market Conditions, Fragility, and the Economics of Market Making, 121 J. Fin. Econ. 327, 348 (2016).

A duty to remain trading—applied to the most active dealers (primary dealers and high-speed securities firms alike)—can help to preserve price continuity and assure investors of resilient liquidity. Most importantly, it can motivate those charged with staying to play a role in monitoring and safeguarding operations through ex ante private oversight. Those that must trade, under a new duty, in all conditions ultimately bear the costs of any fallout from disruptive trading strategies or other traders that are creating outsize risks for others. This duty ought to prompt dealers to pay attention to the quality, sophistication, and reliability of their trading behavior. In this way, a duty to remain changes the trade-off governing the behavior of large Treasuries traders. Faced with the prospect of bearing potentially heavy losses if the market goes awry, taking risks starts to look more costly. Currently, easy exit and light-touch regulation have made taking careless or deliberate risks a low-cost option.298Yadav, supra note 19, at 1227–30.

To be clear, an affirmative duty to remain is not an unlimited one, forcing firms to comply to the point of making themselves insolvent, fighting enormous fires in Treasuries at a cost of their own existence. For example, some nontraditional dealers like HFT traders tend to be smaller and less capitalized.299Roisman, supra note 271. They cannot be expected to deplete their entire balance sheet to remain trading. Bigger bank dealers will have a more intensive obligation owing to their capacity to remain trading longer. That being said, a duty to remain is also likely to result in otherwise thinly capitalized firms to have to develop deeper capital buffers in readiness. Those subject to a duty will be the most active traders. Ensuring that they are better buffered provides assurance that those charged with maintaining Treasuries intermediation have the capacity to do so.

Practically, firms are likely to resist such a duty. If they have to pay large sums to selflessly protect the market, they will rationally demand a large ex ante price from the U.S. Treasury for their commitment. And regulators might consider how they ought to compensate traders that become subject to this duty. For example, one option might be to afford them special access to information on Treasuries trading order flow and repo operations as a way to help them to calibrate their risk. As above, this can lower the costs of monitoring and also help dealers to modulate their supplies of Treasuries and cash for secondary trading.

Importantly, this idea is not new to markets—earlier eras had once demanded that a designated group of traders withstand losses to protect markets in times of stress. Those that earned the designation also enjoyed certain privileges and status as a result.300See generally Lawrence R. Glosten & Paul R. Milgrom, Bid, Ask and Transaction Prices in a Specialist Market with Heterogeneously Informed Traders, 14 J. Fin. Econ. 71 (1985). While such a duty may not be critical in other markets in which liquidity provision is voluntary, introducing it for Treasuries is more than justified given the crucial importance of ensuring trading continuity in Treasuries in crises. During a crisis, affirmative liquidity provisions would clearly provide greater assurance of resilience. It also forces dealers to more fully confront the responsibility that comes with the fact of transacting in securities whose workings possess near existential significance for the global economy, further helping to strengthen market integrity.

C.  Fixing the Breaks in Regulation

Developing better information flows and ensuring the market’s resilience must also be accompanied by reform at the level of public oversight.301Yadav, supra note 19, at 1238–44 (setting out a detailed proposal for consolidation in oversight under the FSOC). This Article advocates for this proposal and also includes greater focus on accounting for the unaddressed subject of the interlinkages between repo and Treasuries trading. This Article points to the need to develop a coordinated and hybrid approach to oversight for the Treasury market that is capable of overcoming tension between different regulatory philosophies (prudential versus market based). Remedying the ill effects of institutional fragmentation is necessary as a condition precedent to more fully understanding how the market works, identifying the risks and producing a set of rules that can mitigate structural vulnerabilities at the intersection of the repo and secondary markets.

As argued in this Article, the need for coordination between regulators takes on urgency in light of the interlinkages connecting Treasury repo and secondary markets. With a common set of intermediaries—the primary dealers—Treasuries-backed repo and the secondary markets depend on one another for each to be able to fulfill its respective mission. A patchwork system of oversight makes little sense within an ecosystem in which the trading of high-speed securities firms impacts the liquidity of collateral propping up the four-trillion-dollar Treasuries-backed repo market; or where the enormous collateral and cash needs of the repo market put the resiliency of the Treasury trading markets in jeopardy. Rules to simply govern one or the other market by itself are not enough. Rather, this Article makes clear that the Treasury market exists as a whole, underpinned by the trading and funding mechanisms working together to deliver what is universally recognized as the lynchpin of the world’s financial order.

A near-term fix to the problem of regulatory fragmentation and ad hocism lies in making FSOC expressly into a coordinating supervisory agency for Treasury and repo markets.302Id. at 1241–43 (proposing the FSOC as a coordinating overseer for the Treasury market). Created by post-2008 rulemaking, the FSOC is designed to create a layer of consolidation over the patchwork of U.S. financial regulators. With the 2008 Financial Crisis showcasing systemic interconnections in financial markets, the FSOC’s creation offers an administrative response to the risks of agencies working just on single parts of an otherwise entangled system.303See generally About FSOC, U.S. Dep’t of the Treasury, https://www.treasury.gov/initiatives/fsoc/about/Pages/default.aspx [https://perma.cc/6YEC-YCK2]. The FSOC has been a controversial overseer since its establishment. For discussion, see Hilary J. Allen, Putting the “Financial Stability” in Financial Stability Oversight Council, 76 Ohio State L.J. 1087, 1090–95 (2015) (noting the propensity for the FSOC to have gaps and breakdowns); Daniel Schwarcz & David Zaring, Regulation by Threat: Dodd-Frank and the Nonbank Problem, 84 U. Chi. L. Rev. 1813, 1851–53 (2017) (highlighting the significance of deterring systemic risk development through the FSOC); Christina Parajon Skinner, Regulating Nonbanks: A Plan for SIFI Lite, 105 Geo. L.J. 1379, 1389–93 (2017) (noting the expansive powers of the FSOC in designation). By requiring the FSOC to bring multiple regulators together, it provides a way to ensure that regulators share information, develop a plan for reform, scrutinize and debate their own supervisory methodologies, and arrive at a mode of overseeing Treasuries that recognizes the linkages between trading and repo markets. As shown in this Article, this means developing a more hybrid regulatory strategy that is capable of moving beyond blunt prudential versus securities market approaches.

Introduction of the FSOC as a coordinating regulator for Treasuries is only a first step toward creating a governance model for public oversight. Even with the FSOC, agencies may struggle to work together. They may fail to share data or coordinate. Opacity may hamper attempts to understand how risks move between repo and Treasuries trading. In the absence of a strong system of supervision, the Treasury market may well just be left to depend on the Fed’s ex post interventions in a crisis. But disruptions to U.S. Treasury secondary and Treasury-backed repo markets (for example, in March 2020) show that the current fragmentation and disorganization between regulators is untenable and harmful. Coordination through FSOC begins a process of deeper institutional reform.304For example, the SEC itself put out a detailed proposal to provide thoroughgoing reform of Treasuries trading platforms. The systemic importance of such platforms would point to the importance of prudential regulators also being involved. Press Release, U.S. Sec. & Exch. Comm’n, SEC Proposes Rules to Extend Regulations ATS and SCI to Treasuries and Other Government Securities Markets (Sept. 28, 2020), https://www.sec.gov/news/press-release/2020-227 [https://perma.cc/YA3E-TRND]. Further, systematized transparency (for example, through disclosure and reporting) offers a way to help bridge the difficulties faced in developing a hybrid approach to overseeing the interlinkages between Treasuries-backed repo and Treasuries trading markets. With all regulators able to share in data from both repo markets and Treasuries, understanding interdependencies should become practicable. Information—in addition to saving collection costs and bridging institutional hurdles to communication—can foster collective focus on a connected marketplace. This approach of co-opting banking and securities regulators and ensuring greater coordination through the FSOC offers a way out of the bifurcated approaches that treat repo and trading markets as basically distinct and subject to separate modes of scrutiny.

This Part proposes a three-part solution to place Treasury markets on a stronger footing to better withstand the weight this market carries for the financial system and the economy. It proposes first developing stronger information flows to increase reporting and transparency, affording primary dealers greater ease in monitoring exposures as well as giving regulators a clearer idea about the market’s structural weaknesses in real time. In addition, an affirmative obligation on major dealers to remain trading creates confidence in the resiliency of liquidity across the marketplace, especially during crises. Finally, we advocate for regulatory oversight that can bridge fragmentation and offer a more consolidated, coordinated system of supervision. The FSOC provides a convening authority. But, looking forward, fixing the fractures in regulation would help to ensure that the Treasury market’s overseers are well positioned to match the realities of its critical importance to financial market stability.

CONCLUSION

Financial stability rests on a central idea that Treasuries represent a bulwark against distress, representing the foremost risk-free asset anywhere on the globe. Free of default risk and trading in a market with supposed plentiful liquidity, public and private regulation are anchored to Treasuries for their function and assume that Treasuries will protect firms and markets from collapse. In this Article, we show why this assumption is incorrect. While Treasuries themselves are viewed as risk-free, the market that distributes them is not. It is pervasively subject to flawed intermediation. Importantly, the demands of public and private industry regulation are internally in conflict, crystallizing the harms of faulty intermediation. Despite their importance, these risks in the secondary and repo markets remain undertheorized and poorly understood, leaving Treasuries perpetually at risk of failing to perform their protective role. Without real reform, the first steps to which we outline here, we worry that Treasuries cannot live up to their reputation, undermining their promise for regulation as the anchor in financial system stability.

APPENDIX: FIGURES

Figure 1.A.  Bilateral Repo Market Collateral Outstanding ($ Trillions)

Sources: SIFMA Rsch., supra note 26 (providing figures for the bilateral repo and reverse repo markets through 2021); Sec. Indus. & Fin. Mkts. Assoc., supra note 21 (providing data through July 2024).

Figure 1.B.  Tri-Party Repo Market Collateral Outstanding ($ Billions)

Source: Fed. Rsrv. Bank of N.Y., supra note 26 (select “Total” and “US Treasuries excluding Strips”).

Figure 2.  Primary Dealer Repos and Reverse Repos Outstanding ($ Billions)

Source: Fed. Rsrv. Bank of N.Y., supra note 209 (providing raw data on trades and positions of primary dealers).

Figure 3.A.  Secondary Market Trading of Primary Dealers: Daily Inventory Risk Exposure ($ Billions)

Source: Fed. Rsrv. Bank of N.Y., supra note 209 (providing raw data on trades and positions of primary dealers).

Figure 3.B.  Secondary Market Trading of Primary Dealers: Daily Trading Volume ($ Billions)

Source: Fed. Rsrv. Bank of N.Y., supra note 209 (providing raw data on trades and positions of primary dealers).

Figure 4.  Daily Changes in Primary Dealer Treasury Holdings: Repo Market vs. Treasury Secondary Market ($ Billions)

Source: Fed. Rsrv. Bank of N.Y., supra note 209 (providing raw data on trades and positions of primary dealers).

97 S. Cal. L. Rev. 1349

Download

* W. Ross Johnston Chair and Professor of Finance, University of Oklahoma.

† Professor of Law and Milton R. Underwood Chair, Associate Dean & Robert Belton Director of Diversity, Equity and Community, Vanderbilt Law School. We are deeply grateful for thoughtful comments, insights, and conversations. We thank Dan Awrey, Jonathan Brogaard, Peter Conti-Brown, Chris Brummer, Nakita Cuttino, Anna Gelpern, Evan Gerhard, Patricia McCoy, Mitu Gulati, Rory van Loo, Michael Kang, Lev Menand, Robert Rasmussen, Morgan Ricks, Paolo Saguato, Nadav Shoked, Danny Sokol, Kumar Venkataraman, Jialan Wang, Mark Weidemeier and participants at the Wharton School of Business Conference on Financial Regulation, the Vanderbilt Law School Annual Conference on Central Banking and Financial Regulation and faculty workshops at the University of Illinois Business School, Northwestern Law School, and USC Gould School of Law. Runzu Wang and Doris Zhou at the University of Oklahoma provided excellent research assistance.

War and Coercion

Compelled service in hostile forces is prohibited by International Humanitarian Law. In the context of an international armed conflict, it is a war crime to compel prisoners of war (“POWs”) or other protected persons to serve in the forces of a hostile power and to compel participation in military operations against the person’s own country or forces. However, conscription—or compelled service in military forces—of a state’s own citizens is not prohibited under international law. In fact, conscription, some aspects of which are regulated by International Human Rights Law, is generally legitimate.

This asymmetry—whereby compelling protected persons to fight or serve in the forces of a hostile power is a war crime, but compelling one’s own citizens is not—has puzzling implications. Take the example of Russia’s invasion of Ukraine. It is a war crime for Ukraine to compel Russian POWs to fight on behalf of Ukraine, even though Ukraine is fighting a lawful war of self-defense. Yet, it is not a war crime for Russia to compel its own citizens to fight, even though Russia is fighting an unlawful war of aggression.

Can we make moral sense of this asymmetric regime regarding compelled service in armed forces? Is the regime morally coherent? In order to make moral sense of the regime, two arguments must succeed. First, we must argue that it matters greatly whether individuals are compelled to fight in hostile forces or in the armed forces of their own state. Second, we must argue that the nature of the war they are compelled to serve in—whether the war is legal or illegal—does not matter at all.

This Article argues that the second argument cannot but fail, but it is possible to argue that compelled service in hostile forces is morally wrong and often morally worse than compelled service in the armed forces of one’s own state. It is morally worse because it is morally worse to harm those who are vulnerable and defenseless, like those who have fallen into the hands of a party to the conflict. And it is morally wrong because noncitizens lack duties to fight on behalf of other states. However, what makes compelled service in hostile forces morally wrong also makes conscription morally wrong. That is, what is wrong about compelled service in hostile forces is also present in the state’s conscription of its own citizens.

This Article thus argues that the current regime concerning compelled service in armed forces is, in fact, morally incoherent. To render the regime morally coherent, international law should (1) appropriately distinguish between conscription to serve in legal wars and conscription to serve in illegal wars, and (2) generally prohibit compelled service in armed forces.

INTRODUCTION

Russia has been conscripting men from occupied Crimea to serve in Russian armed forces for several years. During the ninth conscription campaign, which ended in June 2012, at least 3,300 men from Crimea had been enlisted, bringing the total of forced conscripted men to at least 18,000.1Off. of the U.N. High Comm’r for Hum. Rts., Rep. on the Human Rights Situation in Ukraine: 16 May to 15 August 2019, ¶ 111 (2019), https://www.ohchr.org/sites/default/files/Documents/Countries/UA/ReportUkraine16Feb-15May2019_EN.pdf [https://perma.cc/VL53-T4MQ].

Conscription in occupied Crimea was still ongoing in 2019.2Crimea: Conscription Violates International Law, Hum. Rts. Watch (Nov. 1, 2019, 12:00 AM), https://www.hrw.org/news/2019/11/01/crimea-conscription-violates-international-law [https://perma.cc/8MCU-YCTU]; see Marten Zwanenburg, Ukraine Symposium—Forced Conscription in the Self-Declared Republics, Lieber Inst. (Aug. 8, 2022), https://lieber.westpoint.edu/forced-conscription-self-declared-republics [https://perma.cc/2ACS-UTE9]. By 2022, The Guardian reported that men in the Donbas region were being forcibly conscripted to serve in the armed forces of the self-declared Donetsk Peoples Republic and Luhansk Peoples Republic.3Zwanenburg, supra note 2 (citing Peter Beaumont & Artem Mazhulin, ‘They Hunt Us Like Stray Cats’: Pro-Russia Separatists Step Up Forced Conscription as Losses Mount, Guardian (July 20, 2022, 09:01 EDT), https://www.theguardian.com/world/2022/jul/20/pro-russian-separatists-step-up-forced-conscription-as-losses-mount [https://perma.cc/9ACP-ZPVC]). At the same time, Russia has been conscripting its own citizens to fight in Ukraine.4Sarah Dean & Rob Picheta, Russia Admits Conscripts Have Been Fighting in Ukraine, Despite Putin’s Previous Denials, CNN (Mar. 9, 2022, 7:27 PM), https://www.cnn.com/2022/03/09/europe/russia-conscripts-fighting-ukraine-intl [https://perma.cc/N65R-VE43].

In times of war, conscription of individuals in occupied territory and the state’s conscription of its own citizens share an important feature. They both involve a severe restriction on individuals who are called on to fight, and possibly to kill and die, on behalf of the state. However, although they share this important feature, international law treats them differently. In fact, one might say that there is an asymmetry in how international law treats compelled service (or conscription) to serve in armed forces. Conscription of protected persons to serve in hostile forces, when done by an occupying or detaining power, is a war crime under International Humanitarian Law (“IHL”).5Hague Convention (II) with Respect to the Laws and Customs of War on Land and Its Annex: Regulations Concerning the Laws and Customs of War on Land art. 44, July 29, 1899, 32 Stat. 1803, 1 Bevans 247 [hereinafter 1899 Hague Convention]; Hague Convention (IV) Respecting the Laws and Customs of War on Land and Its Annex: Regulations Concerning the Laws and Customs of War on Land art. 23(h), Oct. 18, 1907, 32 Stat. 1803, 1 Bevans 247 [hereinafter 1907 Hague Convention]; Geneva Convention (IV) Relative to the Protection of Civilian Persons in Time of War art. 51, ¶ 1, Aug. 12, 1949, 6 U.S.T. 3516, 75 U.N.T.S. 287 [hereinafter Geneva Convention (IV)]; Geneva Convention (III) Relative to the Treatment of Prisoners of War art. 130, Aug. 12, 1949, 6 U.S.T. 3316, 75 U.N.T.S. 135 [hereinafter Geneva Convention (III)]; Rome Statute of the International Criminal Court arts. 8(2)(a)(v), 8(2)(b)(xv), July 17, 1998, 2187 U.N.T.S. 38544 [hereinafter Rome Statute]. Conscription of the state’s own citizens to fight a war is, however, not only not a war crime, but is also recognized by international law as the state’s prerogative.6See, e.g., U.N. High Comm’r for Refugees, Guidelines on International Protection No. 10: Claims to Refugee Status Related to Military Service Within the Context of Article 1A(2) of the 1951 Convention and/or the 1967 Protocol Relating to the Status of Refugees, ¶ 5, U.N. Doc. HCR/GIP/13/10/Corr.1 (Nov. 12, 2014) [hereinafter Guidelines on International Protection No. 10].

Perhaps surprisingly, this asymmetry has received almost no attention in international legal scholarship. Perhaps even more surprisingly, the crime of compelled service in hostile forces and the ethics of conscription have also received very little attention, both in international legal scholarship and political theory. Yet the asymmetry regarding compelled service in armed forces (or the asymmetry regarding conscription) present in international law has some puzzling implications.

Take the example of Russia’s invasion of Ukraine. Russia’s conscription of its own citizens to fight its unlawful war of aggression is permitted by international law.7Something that Russia is in fact doing. Dean & Picheta, supra note 4. By contrast, it is a war crime for Ukraine to compel Russian prisoners of war (“POWs”) to fight on behalf of Ukraine, even though Ukraine is fighting a lawful war of self-defense. Even more so, if Russian citizens voluntarily decided to join Ukrainian armed forces to fight against Russian aggression—and some of them have8Michael Schwirtz, They Are Russians Fighting Against Their Homeland. Here’s Why., N.Y. Times (Feb. 12, 2023), https://www.nytimes.com/2023/02/12/world/europe/russian-legion-ukraine-war.html [https://perma.cc/6T94-M8BP]. —they are not protected by international law and could be prosecuted by Russia. In fact, in September 2022, Russia toughened up penalties for voluntary surrender to enemy forces, desertion, and refusal to fight by up to ten years in prison.9Russia Stiffens Penalty for Desertion; Replaces Top General, Al Jazeera (Sept. 24, 2022), https://www.aljazeera.com/news/2022/9/24/putin-toughens-penalty-for-surrender-refusal-to-fight-in-ukraine [https://perma.cc/N4GT-NQ3Z].

International law thus distinguishes between compelled service in hostile forces and compelled service in the armed forces of one’s own state—prohibiting the first but allowing the latter—but fails to distinguish between compelled service in legal wars and compelled service in illegal wars. The distinction drawn by international law suggests that there is a normatively relevant difference between compelled service in hostile forces and compelled service by one’s own country—a difference significant enough to permit the latter and make the first a war crime. And it also suggests that there is no normatively relevant difference based on whether the wars one is forced to fight are legal or illegal.

This asymmetry demands a justification. We ought to try to make sense—moral sense—of the international legal regime on compelled service in armed forces. This is not the same as attempting to explain why the regime is the way it is. That explanation might be historical in character if, for example, states could agree regarding the prohibition on compelled service in hostile forces but could not agree—or did not want to agree—regarding the state’s conscription of its own citizens or regarding the relevance of whether the wars individuals are compelled to serve in are legal or illegal. And that historical explanation might provide a moral justification for the adoption of the prohibition on compelled service in hostile forces.10For distinction and argument, see Marcela Prieto Rudolphy, The Morality of the Laws of War: War, Law, and Murder 74–85 (2023). For example, if we think compelled service in armed forces is always wrong and should be prohibited, but states could only agree to prohibit compelled service in hostile forces, we might argue that it is better to prohibit one morally wrong behavior than to prohibit nothing at all.

However, the question this Article is concerned with is not a question about the historical explanation of the current international regime on conscription, nor a question about whether we can justify its adoption. It is a question about its content. Can we make moral sense of this asymmetric regime regarding compelled service in armed forces during times of war? Is the regime morally coherent?

This Article thus brings moral and political philosophy to bear on international law.11For others who have done so, see generally Adil Ahmad Haque, Law and Morality at War (2017); Arthur Ripstein, Kant and the Law of War (2021); Jeremy Waldron, Torture, Terror, and Trade-Offs: Philosophy for the White House (2010); Philipp Gisbertz-Astolfi, Reduced Legal Equality of Combatants in War, 35 Ethics & Int’l Affs. 443 (2021). It is concerned with the relationship between international law and morality and, in particular, with the question of how law ought to be if it wishes to be morally coherent. By moral coherence, I am referring to the idea that a legal regime (like the regime on conscription) should be complete; that is, it should equally prohibit behaviors that are similarly wrongful instead of failing to prohibit things that are as, or more, wrongful than the behaviors it already prohibits.12Richard Wasserstrom, The Laws of War, 56 Monist 1, 7–8 (1972). I am thus not concerned with the regime’s integrity or coherence in the Dworkinian sense. See Ronald Dworkin, Law’s Empire 176–275 (1986). And a legal regime should also “make moral sense,” that is, it should be morally intelligible, in the sense that it does not fail to take into account important moral reasons in favor of and against prohibiting certain behaviors.

In order to make moral sense of the international legal regime on conscription, two arguments must succeed. First, we must argue that it matters greatly whether individuals are compelled to fight in hostile forces or in the armed forces of their own state. Second, we must argue that the nature of the war that individuals are compelled to serve in—whether the war is legal or illegal—does not matter at all.

These are difficult arguments to make. This Article will argue that the second argument is impossible to make; it cannot but fail. Whether individuals are forced to fight legal or illegal wars is morally significant and should be accounted for. The first argument is more plausible: it is possible to show that compelled service in hostile forces is often morally worse than compelled service in the armed forces of one’s own state and that compelled service in hostile forces itself is morally wrong. It is morally worse because it is morally worse to harm those who are vulnerable and defenseless, like those who have fallen into the hands of a party to the conflict, and because it is morally worse to be coerced to fight against those we care about. And it is morally wrong because noncitizens lack duties to fight on behalf of other states.

However, these arguments fail to support the current regime. This is so because the fact that compelled service in hostile forces is morally worse than the state’s conscription of its own citizens cannot show, on its own, that the latter is morally permissible. The fact that something is morally worse than something else says nothing about whether what is morally better is permitted. Thus, showing that compelled service in hostile forces is often morally worse than the state’s conscription of its own citizens cannot explain why compelled service in hostile forces is prohibited, but the state’s conscription of its own citizens is allowed. And the second argument, which shows that compelled service in hostile forces is morally wrong, also fails to explain why conscription of a state’s own citizens is morally permissible. This is so because citizens often lack duties toward their own states to kill and die on its behalf. That is, what is wrong about compelled service in hostile forces is also present in the state’s conscription of its own citizens.

This Article makes then three claims, which are related but logically independent from each other. First, it is morally wrong to conscript individuals to fight wars of aggression, regardless of whether the citizen’s state or hostile forces do so. Second, compelled service in hostile forces is often morally worse than the state’s conscription of its own citizens and is also morally wrong. Third, conscription by the state, even to fight legal wars, is also often morally wrong. This Article thus concludes that the current regime concerning compelled service in armed forces is, in fact, morally incoherent.13Wasserstrom, supra note 12, at 7–8. It is morally incoherent because it is incomplete and cannot be made sense of.14Id. It is incomplete because it fails to criminalize or prohibit conduct (the state’s conscription of its own citizens) that is similarly wrong when compared to what it already criminalizes (compelled service in hostile forces). And it cannot be made sense of because it fails to distinguish between legal and illegal wars, so that states are allowed to compel their own citizens to fight illegal wars.

The fact that the regime on compelled service is morally incoherent does not mean that the compelled service in hostile forces prohibition (“CSHF prohibition”) lacks value or is morally misguided. The CSHF prohibition, this Article will argue, protects fundamental rights and interests. However, it is incomplete. It is that incompleteness that makes the current regime incoherent. To render the regime morally coherent, international law should (1) appropriately distinguish between conscription to serve in legal wars and conscription to serve in illegal wars, and (2) generally prohibit compelled service in war. The latter might seem entirely utopian. It might also make it harder for states to fight wars—possibly even lawful ones. But lawful (and just) wars are the exception, and conscription to fight in war imposes a severe restriction on individuals’ freedom. Even if international law never comes to prohibit the state’s conscription of its own citizens, there are powerful moral reasons for doing so. The fact that such a prohibition is utopian is not a (moral) reason against it.15See David Estlund, Utopophobia, 42 Phil. & Pub. Affs. 113 passim (2014).

The remainder of this Article proceeds as follows. Part I gives a brief overview of the international regime on compelled service in armed forces, distinguishing between compelled service in hostile forces and the state’s conscription of its own citizens. Part II argues that the regime’s failure to distinguish between conscription to serve in legal wars and conscription to serve in illegal wars cannot be defended. At the very least, international law should prohibit states from conscripting their own citizens to fight wars of aggression, and the illegal nature of the war should be an additional aspect of the crime of compelled service in hostile forces. Part III tries to make sense of the fact that international law considers compelled service in hostile forces to be significantly worse than the state’s conscription of its own citizens. It argues that although there is a plausible case for why compelled service in hostile forces is more wrongful, or morally worse, than the state’s conscription of its own citizens, this argument cannot support the latter’s permissibility. And some of the arguments that show why compelled service in hostile forces is morally wrong put into question the permissibility of conscription generally. Part IV discusses what it would take for the international legal regime on conscription to be morally coherent. It argues that (1) international law should make it a war crime for states to conscript their own citizens to fight in illegal wars; (2) the unlawfulness of the war in compelled service in hostile forces should be an additional aspect of the crime; and (3) there is a pro tanto reason for international law to generally prohibit conscription to fight in war.

I.  THE INTERNATIONAL LEGAL REGIME ON CONSCRIPTION

A.  Compelled Service in Hostile Forces

Compelled service in hostile forces was not always prohibited and was, in fact, a common practice across different cultures.16See Rita J. Simon & Mohamed Alaa Abdel-Moneim, A Handbook of Military Conscription and Composition the World Over 7 (2011) (noting that Greek and late Roman armies conscripted young men from enemy nations that had been defeated or from tribes that had signed a treaty to remain outside the empire). POWs—that is, those combatants who had been captured, due to surrender or injury, by enemy powers—were thought to have forfeited their lives by surrender or capture, and, in practice, they were often required to join the forces of their captors.17Michael Walzer, Obligations: Essays on Disobedience, War, and Citizenship 148 (1970).

The CSHF prohibition was introduced into treaty law with the Hague Regulations of 1899. Article 44 of the Hague Convention of 1899 prohibits the compulsion of the population of occupied territory to take part in military operations against their own country.181899 Hague Convention, supra note 5, at art. 44. The Hague Convention of 1907, Article 23(h), prohibits compelling the nationals of the hostile party to “take part in the operations of war directed against their own country, even if they were in the belligerent’s service before the commencement of war.”191907 Hague Convention, supra note 5, at art. 23(h). This provision is limited to nationals of the hostile party.20Zwanenburg, supra note 2.

Later, the Geneva Conventions also included the CSHF prohibition. Under the Fourth Geneva Convention, it is a grave breach for an occupying power to compel protected persons to serve in its armed or auxiliary forces.21Geneva Convention (IV), supra note 5, at art. 51. Protected persons are “those who at a given moment and in any manner whatsoever, find themselves, in case of a conflict or occupation, in the hands of a Party to the conflict or Occupying Power of which they are not nationals.”22Id. at art. 4. And Article 147 states that it is a grave breach to compel “a protected person to serve in the forces of a hostile Power,”23Id. at art. 147. which is not limited to situations of occupation, but applies generally in the context of international armed conflicts (“IACs”).24Zwanenburg, supra note 2. The Third Geneva Convention, relative to the “Treatment of Prisoners of War,” provides that compelling a POW or a protected person to serve in the forces of a hostile power is a grave breach of the conventions.25Geneva Convention (III), supra note 5, at art. 130. Note, however, that while compelled service in hostile forces is a war crime, enlistment that is the result of pressure or propaganda is a violation of the Fourth Geneva Convention, but not a war crime.26Zwanenburg, supra note 2.

The CSHF prohibition is also contained in the statutes of the International Criminal Tribunal for the Former Yugoslavia (“ICTY”) and the International Criminal Court (“ICC”).

The statute of the ICTY expressly included “compelling a prisoner of war or a civilian to serve in the forces of a hostile power” as part of the grave breaches of the 1949 Geneva Conventions under its jurisdiction.27Statute of the International Tribunal for the Prosecution of Persons Responsible for Serious Violations of International Humanitarian Law Committed in the Territory of the Former Yugoslavia Since 1991 art. 2(e), Sept. 2009, https://www.icty.org/en/documents/statute-tribunal [https://perma.cc/Z7T8-W94W]. However, there were no convictions that relied exclusively on a violation of this provision, and no indictments for compelling POWs or civilians to serve in the forces of a hostile power.28William A. Schabas, The UN International Criminal Tribunals: The Former Yugoslavia, Rwanda and Sierra Leone 252 (2006).

The Rome Statute for the ICC also includes the prohibition in question. It distinguishes four categories of war crimes29Knut Dörmann, War Crimes Under the Rome Statute of the International Criminal Court, with a Special Focus on the Negotiations on the Elements of Crimes, in 7 Max Planck Yearbook of United Nations Law 341, 344 (Armin von Bogdandy et al. eds., 2003).: (1) grave breaches of the Geneva Conventions in the context of IACs;30Id. (2) other serious violations of IHL contained in the Hague Conventions, Additional Protocol I to the Geneva Conventions, and the 1925 Geneva Gas protocol;31Id. (3) serious violations of Article 3 common to the Geneva Conventions in the context of non-international armed conflicts (“NIACs”);32Id. and (4) other violations of IHL in the context of NIACs.33Id.

The selection of war crimes to be ultimately included in the Rome Statute was based on two considerations: first, the norm should be part of customary international law (“CIL”); and second, the violation of the norm should give rise to individual criminal responsibility under CIL.34Id. at 345.

The crime of forced service in hostile forces is enshrined in Articles 8(2)(a)(v) and 8(2)(b)(xv) of the Rome Statute. The former states that it is a war crime to compel a POW or other protected persons to serve in the forces of a hostile power in the context of an IAC.35Rome Statute, supra note 5, at art. 8(2)(a)(v). The latter makes it a war crime to compel participation in military operations against a person’s own country or forces in the context of an IAC.36Id. at art. 8(2)(b)(xv). The expression “forces” should be given a broad interpretation, and forced service is prohibited not only regarding forces hostile to the individual’s own country, but also regarding allied countries and forces.37See Dörmann, supra note 29, at 374.

The first provision—Article 8(2)(a)(v)—effectively combined the language of the Geneva Conventions with Article 23 of the 1907 Hague Regulations.38Id. The second one is based solely on Article 23 of the 1907 Hague Regulations.39Id.

Finally, the CSHF prohibition has now crystallized into CIL, at least in the context of IACs. The International Committee of the Red Cross’s (“ICRC”) statement on the latter includes, in Rule 95, the prohibition on uncompensated or abusive forced labor, and it specifies that compelling persons to serve in the forces of a hostile power is a specific type of forced labor that is prohibited in IACs.40Jean-Marie Henckaerts & Louise Doswald-Beck, International Committee of the Red Cross, Customary International Humanitarian Law, Volume I: Rules 330, 333 (2005) [hereinafter ICRC Rule 95].

Many countries incorporate similar prohibitions in their military manuals and criminal codes.41Id. at 331. And, in 2005, the Israeli Supreme Court found that the IDF’s “Early Warning” procedure was at odds with international law, in part because it ran afoul of “a basic principle, which passes as a common thread running through all of the law of belligerent occupation,” consisting of “the prohibition of use of protected residents as part of the war effort of the occupying army.”42HCJ 3799/02 Adalah v. GOC Central Command, 60(3) PD 1, ¶¶ 24–25 (2005) (Isr.). The “Early Warning” procedure stipulated that IDF soldiers who wished to arrest a Palestinian suspected of terrorist activity may be assisted by a local Palestinian resident, who would warn the arrestee of possible harm to themselves or those present when the arrest took place.43Id. at 1. The procedure could only be used when it posed no risk to the Palestinian resident, and the latter consented to it,44Id. ¶ 7. but the Court found that, given the inequality between the occupying force and the local resident, consent was unlikely to be real.45Id. ¶ 24.

B.  Conscription of the State’s Own Citizens

Although forced labor is prohibited under International Human Rights Law, conscription is not treated as an instance of it.46Hum. Rts. Council, Rep. of the Off. of the High Comm’r for Hum. Rts. on Conscientious Objection to Military Service, ¶ 2, U.N. Doc. A/HRC/9/24 (Aug. 20, 2008) [hereinafter U.N. Doc. A/HRC/9/24]. Conscription—or compelled service in military forces—of a state’s own citizens is not prohibited under international law, except in the case of children, in which case it is a war crime.47Rome Statute, supra note 5, § 8(2)(e)(vii).

Conscription is well accepted in international law regarding a state’s own citizens. Voluntary enlistment and service in foreign forces is also not a violation of international law.48William W. Fitzhugh & Charles Cheney Hyde, The Drafting of Neutral Aliens by the United States, 36 Am. J. Int’l L. 369, 370–71 (1942). At the Hague Conference in 1907, the German delegation wanted to incorporate an article stating that belligerent parties could not ask neutral persons to render them war services, even if it was voluntary, which was supported by the United States.49Id. at 370. The proposal did not succeed. More recently, in 2022, Russia issued a law that facilitates the attainment of Russian citizenship for foreigners who voluntarily enlist in the Russian army for at least a year.50Al Jazeera, supra note 9.

Regarding the conscription of noncitizens with permanent residence, the law is less settled. In fact, the United States drafted permanent residents at least during the Korean and Vietnam Wars. In both, the United States required military service of every non-U.S. male citizen admitted to permanent residency and actually residing in the United States for more than a year.51Walzer, supra note 17, at 108–09. Often, the United States conscripted resident foreigners unless they agreed to forfeit future claims to citizenship.52Id. at 108.

During the course of the two World Wars, the main Allied belligerents also conscripted nationals of other states, much to the protest of the states in question.53Clive Parry, International Law and the Conscription of Non-Nationals, 31 Brit. Y.B. Int’l L. 437, 439 (1954). Opposition was based on the general principles of international law, but more often, it was based on treaties that ensured states would not conscript each other’s foreign nationals.54Id. Sometimes, treaties were signed with the opposite purpose: in the course of World War II, the Allies entered into treaties with the United States to secure that nationals of the Allies residing in the United States would serve in either the forces of the United States or of their own countries.55Id. at 442.

Thus, it is fairly clear that conscription of resident noncitizens is not a war crime under international law, even though it might be prohibited on account of bilateral international treaties. But it is unclear whether a prohibition on conscripting resident noncitizens has crystallized in CIL or whether it remains a rule of comity, as suggested in the 1970s by Frank Upham and Charles E. Roh, Jr.56Charles E. Roh, Jr. & Frank K. Upham, The Status of Aliens Under United States Draft Laws, 13 Harv. Int’l. L.J. 501, 501–02 (1972).

Regarding a state’s own citizens, excepting children, conscription is well accepted by international law, both as a general practice in times of peace and as a practice in times of war. The practice of conscription itself has an old history, and after the two World Wars, it remained the norm in many countries.57Simon & Abdel-Moneim, supra note 16, at 19. With the end of the Cold War, the debate about universal military conscription regained force again, and at the beginning of the 1990s, France, the Netherlands, and Belgium abandoned the system of conscription, the universal draft, or both.58Id. In the process of European reintegration, military conscription largely disappeared.59Mitchell A. Belfer, Conscription and European Security: A Theoretical First-Step, 1 Cent. Eur. J. Int’l & Sec. Studs. 28, 28 (2007). Belfer suggests that this has to do with the fact that security identities in post-Cold War Europe are increasingly forged by cosmopolitan values.60Id.

Nonetheless, debates around conscription still occur in the public space, and, in the United States, resurfaced during the wars in Afghanistan and Iraq.61Simon & Abdel-Moneim, supra note 16, at 20. Indeed, some commentators worried that reliance on all-volunteer forces (“AVF”) would be insufficient to satisfy the demands of war in those countries.62Id.

The decreasing practice of conscription in many countries has led some to speak of a “crisis of conscription.”63Rafael Ajangiz, The European Farewell to Conscription?, in 20 The Comparative Study of Conscription in the Armed Forces 307, 308 (Lars Mjøset & Stephen van Holde eds., 2002). There have also been increases in figures for conscientious objection in some countries like Italy, Spain, and Germany.64Id. But protest and resistance to the draft or to conscription have been common in different contexts and cultures,65Id. and the discourse of crisis is disputed. Leander and Joenniemi, for example, argue that the landscape of conscription is not homogenous, and the so-called crisis of conscription can take different forms, not all of which involve abolishing conscription.66Anna Leander & Pertti Joenniemi, Conclusion: National Lexica of Conscription, in The Changing Face of European Conscription 161, 161 (Pertti Joenniemi ed., 2016). Further, Leander is skeptical that conscription as a practice is coming to an end.67Anna Leander, Drafting Community: Understanding the Fate of Conscription, 30 Armed Forces & Soc’y 571, 572–73 (2004).

At least legally, conscription, understood as service in military forces during times of peace or during times of war (in the latter case, the practice might be called “the draft”), is still recognized as the state’s prerogative, following from the state’s right to self-defense and sovereignty.68See, e.g., Guidelines on International Protection No. 10, supra at note 6, ¶ 5. Conscription by non-state groups is, by contrast, prohibited.69Id. ¶ 7. In this Article, I do not discuss the issue of conscription by non-state actors. Nonetheless, some of the arguments I discuss in Part III would apply, with some modifications, in this context.

Nonetheless, in recent years, a right to conscientious objection has started to crystallize.70See generally, e.g., U.N. Doc. A/HRC/9/24, supra note 46. Both the Human Rights Committee and the UN Human Rights Council have recognized a right to conscientious objection to military service, based on the right to freedom of thought, conscience, and religion enshrined in Article 18 of the Universal Declaration of Human Rights and the International Covenant on Civil and Political Rights.71Rachel Brett, Quaker U.N. Off., International Standards on Conscientious Objection to Military Service 3 (2011).Human Rights Council reiterated the view that there is a right to conscientious objection to military service, even though a number of states still do not recognize it.72Hum. Rts. Council, Rep. of the Off. of the High Comm’r for Hum. Rts. on Approaches and Challenges with Regard to Application Procedures for Obtaining the Status of Conscientious Objector to Military Service in Accordance with Human Rights Standards, ¶ 4, U.N. Doc. A/HRC/41/23 (May 24, 2019) [hereinafter U.N. Doc. A/HRC/41/23]. And the European Court of Human Rights, which, previous to 2000, did not recognize the right to conscientious objection to military service, has also adopted the view that conscientious objection is an aspect of the right to freedom of thought, conscience, and religion.73Elizaveta Chmykh, Grazvydas Jasutis, Rebecca Mikova & Richard Steyne, Legal Handbook on the Rights of Conscripts 60 (2020).

By contrast, selective conscientious objection, which accepts the legitimacy of some military action but objects to particular instances of it, is not recognized as a right under international law.74U.N. Doc. A/HRC/41/23, supra note 72, ¶ 26. Still, Amnesty International has adopted cases of selective conscientious objection, which have arisen in places like South Africa and Israel.75Edy Kaufman, Prisoners of Conscience: The Shaping of a New Human Rights Concept, 13 Hum. Rts. Q. 339, 349 (1991).

Finally, in some cases, the consequences following from objecting or evading conscription can amount to persecution for the purposes of being recognized as a refugee. In its 2014 guidelines on the issue, the Office of the United Nations High Commissioner for Refugees (“UNHCR”), consistent with international law, recognized the rights of states to require citizens to perform military service for military purposes, as well as the rights of states to impose penalties on those who avoid or desert military service, provided that “their desertion or avoidance is not based on valid reasons of conscience” and that the penalties and associated procedures in question comply with international standards.76Guidelines on International Protection No. 10, supra note 6, ¶ 5. In the context of refugee status, the UNHCR guideline states that persecution against draft evaders, deserters, or conscientious objectors might occur in certain circumstances, such as if there is a risk of threat to life or freedom or other serious human rights violations.77Id. ¶ 14. In those cases, those who selectively object to participating in military service in a conflict contrary to the basic rules of human conduct or in an unlawful conflict and those who object to the means and methods of warfare would be covered, provided that certain circumstances obtain.78Id. ¶¶ 21–30. See also Daniel Davies, Which Russians Fleeing Military Service Should Be Recognized as Refugees? The Answer Is More Complicated and More Interesting Than Politicians Think, Opinio Juris (Oct. 17, 2022), https://opiniojuris.org/2022/10/17/which-russians-fleeing-military-service-should-be-recognized-as-refugees-the-answer-is-more-complicated-and-more-interesting-than-politicians-think [https://perma.cc/RFN6-5JYE]. And the Court of Justice of the European Union held in 2020 that conscription in a conflict characterized by the repeated and systematic commission of war crimes and crimes against humanity can be assumed to involve the commission of such crimes.79Tom Dannenbaum, Mobilized to Commit War Crimes?: Russian Deserters as Refugees, Part II, Just Sec. (Sept. 27, 2022) (citing EZ v. Bundesrepublik Deutschland, No. C-238/19 (E.C.J. 2020)), https://www.justsecurity.org/83269/russian-deserters-as-refugees-part-two [https://perma.cc/Y26K-36QK]. The Court also concluded that there is a strong presumption that the prospect of punishment for refusal to fight in such circumstances would amount to persecution for the purposes of refugee status.80Id.

In sum, while compelled service in hostile forces is a war crime, states have the right to demand of their own citizens that they serve in their military forces during times of war and peace. The only exceptions to the scope of that right are the conscription of children and the right to conscientious objection.

In other words, international law draws a significant distinction between compelled service in hostile forces and compelled service in the armed forces of one’s state. And it also fails to distinguish between legal and illegal wars. As a result, the current regime makes it a war crime to compel protected persons to fight in hostile forces, even if the latter are engaged in lawful wars. And it makes it a state’s prerogative to conscript its own citizens to serve in its armed forces, regardless of whether the wars they are forced to fight are legal or illegal.

There is thus a question of whether the international legal regime on conscription can be rendered morally coherent. In order to do so, we would need to successfully claim that (1) the nature of the wars that people are forced to fight is irrelevant and (2) that it is much worse to be forced to fight by hostile forces than it is to be forced to fight by one’s own state. Let us take each of these arguments in turn.

II.  LEGAL AND ILLEGAL WARS

The international law on conscription fails to distinguish between illegal and legal wars. This suggests that this distinction is normatively irrelevant—that for the purposes of conscription, whether in hostile forces or the armed forces of one’s state, the fact that the war is legal or illegal does not matter at all—and does not alter our moral evaluation of the facts. This claim, however, is highly implausible. Consider the following implications of the regime.

First, under the current regime, compelled service in hostile forces is a war crime. This will be the case regardless of whether those hostile forces are engaged in lawful or unlawful uses of force. That is, under the present regime, it would be equally wrong for Ukraine to compel Russian POWs to fight against Russia as for Russia to compel Ukrainian POWs to fight against Ukraine.

Second, because the state’s prerogative to conscript its own citizens also fails to distinguish between legal and illegal wars, under the present regime, a state that conscripts its own citizens to fight an unlawful war of aggression commits no international crime. In fact, the state’s behavior is arguably not even prohibited by international law. As a result, under the present regime, Russia acts permissibly when it conscripts its own citizens to fight against Ukraine.

Third, because the prerogative to conscript individuals belongs only to the state, conscription by non-state armed groups is prohibited by international law. This implies that if non-state armed groups were operating in Ukraine or Russia and forcing individuals to fight against Russia, they would be acting as wrongfully as non-state armed groups conscripting individuals to fight for Russia.

Finally, the regime also has implications for the treatment of individuals who voluntarily join foreign forces. Under international law, service in foreign forces is not prohibited, but it is also not protected. As a result, individuals who join foreign forces to fight a legal war—as some Russians are doing right now81Schwirtz, supra note 8.—do not violate international law. Yet, international law does not protect them from the sanctions that their states might impose on them, including criminal punishment.

The current regime suggests that the nature of the wars one is forced to fight does not matter at all or does not matter enough to make a significant normative difference. It suggests that what matters is solely the identity of those who coerce individuals to fight: one’s state or hostile states. But this is highly implausible. Suppose the CSHF prohibition is justified, in the sense that it is true that compelling service in hostile forces is wrong. That is, suppose the CSHF prohibition is a mala in se offense in international criminal law.

The present regime suggests that compelled service in hostile forces that are fighting a legal war is as bad or equally wrong as compelled service in hostile forces fighting a war of aggression. But this cannot be true. Even if we agree that coercion from a third-party is wrongful or that a third-party lacks the authority to coerce us to do something, it matters what one is coerced to do.

Suppose B, A’s neighbor, is upset that A severely mistreats his dog. B has observed that A often hits his dog, fails to feed him for several days at a time, leaves him outside chained to a wall when it is extremely cold or hot, and so on. Although B has spoken to A multiple times about the issue and has volunteered to adopt the dog, A refuses to alter his behavior or give the dog away. Eventually, B decides to take matters into her own hands. She goes to A’s house brandishing a gun and tells A that if he does not immediately release the dog to her care, she will shoot him. A, afraid for his life, relinquishes the dog.

Now suppose D, C’s neighbor, has a dog-fighting ring. D has observed that C owns a small dog that would be the perfect bait in dog fights. D has spoken to C repeatedly and has offered increasingly higher amounts of money to C so that she sells him the dog. But C does not wish to sell her dog, no matter how much money she is offered. Eventually, D, upset by C’s multiple rejections and her love for the dog, decides to proceed anyway. He shows up at C’s house brandishing a gun and tells her that she must release the dog to his care or give him $6,000 so he can get a similar dog for his fighting ring. C does not want to relinquish her dog and knows that sustaining or contributing to dog-fighting is morally wrong. However, afraid for her life, she gives D the money.

In both examples, B and D have violently coerced their neighbors to do something they did not wish to do, and for that reason, we might say that they have acted wrongly.82Note that if we think that in both cases coercion is wrong, this will put some pressure on the permissibility of conscription generally. I will come back to this in Part III. But it would be absurd to say that B’s and D’s behavior is, all things considered, equally wrong. B has forced A to do what was morally right, that is, what he should have done anyway: give the dog away. By contrast, D has forced C to commit a wrongful act: give money to a dog-fighting ring. Even if we think that both have acted wrongly in coercing their neighbors to do something, we would say that D’s behavior is, all things considered, worse, morally speaking, than B’s behavior. From the viewpoint of the coerced neighbors, we would also say that C’s situation is worse than B’s: unlike B, who was forced to do what he had a duty to do, C was forced to do something that she not only lacked a duty to do, but was also in fact prohibited from doing (or had a weighty reason not to do).

The same applies to the CSHF prohibition. One could perhaps respond that once protected persons are being compelled into service in hostile forces, the wrongfulness of the act is so high that it is irrelevant whether individuals are compelled to fight legal or illegal wars—that there is no sense in which either of the two is significantly or relevantly morally worse than the other. However, this position is too quick. It might be that there is no point in legally regulating them differently, but, in the example above, it seems that the fact that C has been forced to do something morally wrong is an additional aspect of the crime. It is not normatively insignificant.

It is thus not true that being coerced to fight in an illegal war is equally bad as being coerced to fight in a legal war. This is so because international law itself draws a clear and normatively significant distinction between legal and illegal wars. A legal war is a war fought to uphold the international order. By contrast, an unlawful war or a war of aggression is precisely the opposite.

In fact, aggression is considered by some to be the “crime of all crimes.” It is not only prohibited by the UN Charter, but also constitutes an international crime that entails the individual responsibility of those who plan it and conduct it.83U.N. Review Conference of the Rome Statute of the International Criminal Court, Adoption of Amendments on the Crime of Aggression, U.N. Doc. RC/Res. 6, Annex I (June 11, 2010), https://treaties.un.org/doc/source/docs/RC-Res.6-ENG.pdf [https://perma.cc/U9VF-YKW8]. Accordingly, several international legal scholars have developed a view that explains why we have criminalized aggression and why aggressive war is wrong. Mégret, and later, Mégret and Redaelli, have defended a human rights characterization of the crime: aggressive war constitutes a massive violation of the human rights of citizens of both the victim state and the aggressor state, including combatants.84Frédéric Mégret & Chiara Redaelli, The Crime of Aggression as a Violation of the Rights of One’s Own Population, 9 J. on Use Force & Int’l L. 99, 110–13 (2022); Frédéric Mégret, What Is the Specific Evil of Aggression?, in The Crime of Aggression: A Commentary 1398, 1440–41 (Claus Kreß & Stefan Barriga eds., 2017). Dannenbaum has argued something similar. He contends that international law’s “criminalization of aggression is not just a formal prohibition, but also an expression of aggression’s wrongfulness from the international legal point of view.”85Tom Dannenbaum, The Crime of Aggression, Humanity, and the Soldier 2 (2018). The core criminal wrong of the crime of aggression is, according to Dannenbaum, the unjustified killing and human violence it entails86Id. at 265.: an aggressive war results in unjustified displacement, killing, and harming of thousands of individuals. Finally, more recently, Saira Mohamed has pointed out that international law has no language to name the wrong that is committed by a state such as Russia, which fights an aggressive war by relying on conscription.87Saira Mohamed, We Want You: Conscription and the Law in Russia’s War of Aggression, 37 Berlin J. 52, 52 (2023–24). She argues that in such circumstance, individuals do not have a duty to fight for their state and, in fact, should be protected against doing so.88Id. at 54.

Given what makes aggression wrong—unjustified violence against countless individuals—it is impossible to argue that the distinction between legal and illegal wars lacks relevance in the context of conscription.

Perhaps one could make the following argument in defense of the international regime’s failure to consider the nature of the wars that individuals are coerced to fight: whether a war is legal or illegal is not what matters. What matters is whether a war is morally justified—whether a war is just.

Just war theory has long distinguished between just and unjust wars, arguing that the first kind are justified, while the second kind are not.89See generally Ripstein, supra note 11; Jeff McMahan, Killing in War (2009); Francisci de Victoria, De Indis et de Ivre Belli Relectiones (Ernest Nvs ed., 1917); Hugo Grotius, On the Law of War and Peace (Stephen C. Neff ed., 2012). Generally speaking, a just war is one that has a just cause and meets certain requirements concerning proportionality and necessity and is fought in a just manner (for example, by distinguishing between civilians and combatants).90See generally Cécile Fabre, Cosmopolitan War (2012); Ripstein, supra note 11; Jeff McMahan, Just Cause for War, 19 Ethics & Int’l Affs. 1 (2005). Self-defense and defense of others (that is, humanitarian interventions) are widely accepted amongst contemporary just war theorists as just causes for war.91See, e.g., McMahan, supra note 90, at 46; Fabre, supra note 90, at 51–52; Ripstein, supra note 11, at 104. By contrast, an unjust war is a war that is morally prohibited. It involves inflicting morally unjustified harm and death on countless innocent individuals.

In just war theory, then, the distinction between just and unjust wars is the distinction that matters. Unjust wars are morally prohibited and involve the commission of grievous moral wrongs. By contrast, just wars are morally justified. Thus, one could argue that the distinction between legal and illegal wars is normatively irrelevant; that we should concern ourselves with the distinction, at the level of morality, between just and unjust wars. But, of course, even if this argument is correct, it cannot work as a defense of the international regime on conscription. The latter fails to distinguish at all on the basis of the character or nature of the wars that individuals are conscripted to fight. It fails to distinguish between legal and illegal wars, and it also fails to distinguish between just and unjust wars.

Further, there is some overlap between what makes a war just and what makes a war legal. This overlap is, however, not perfect.92Prieto Rudolphy, supra note 10, at 14–15. Self-defense is recognized both as a just cause for war and a legal instance of the use of force in international law.93See U.N. Charter art. 51. However, humanitarian interventions, which are widely recognized as a just cause for war, are not clearly legal uses of force under international law, unless they are authorized by the United Nations Security Council (“UNSC”).94Id. at art. 2, ¶ 7, art. 39. Additionally, under the U.N. Charter, the UNSC can authorize the use of force, and it could potentially do so in circumstances in which just cause, necessity, or proportionality are lacking. That is, a legally authorized use of force by the UNSC could be an instance of an unjust war.95Prieto Rudolphy, supra note 10, at 14–15.

The fact that the overlap between the two is imperfect cannot, of course, support the conscription regime’s lack of concern for whether the wars are legal or illegal, just or unjust. It does, however, provide reasons to modify the jus ad bellum—that is, the legal rules on resort to force—to make it more coherent with the distinction between just and unjust wars.96Id. And because the overlap between legal and just wars is not perfect, the implications of the regime on conscription as it pertains to just and unjust wars merits attention as a separate set of implications. The current regime entails that states are free, under international law, to conscript their own citizens to fight unjust wars, while compelled service in hostile forces remains a war crime, even when the war individuals are conscripted to fight is a just one.

Nonetheless, in the remainder of this Article, I will speak indistinctly of legal/just wars and illegal/unjust wars. Because this Article and the arguments focus on wars of self-defense and wars of aggression, which are examples of legal and just wars and illegal and unjust wars, respectively, it is unnecessary to keep making the distinction. All the arguments I make are applicable to both. But in those areas in which there is no overlap, and we cannot assume that a war is just merely because it is legal, the arguments I make are applicable only to the distinction between just and unjust wars.

In sum, if wars of aggression are deeply morally wrongful, the failure of the international legal regime on conscription to incorporate that distinction renders some aspects of the regime morally incoherent: they cannot be “rendered intelligible” in a moral sense.97Wasserstrom, supra note 12, at 7–8. This is so because, in at least three instances, the regime fails to criminalize or prohibit conduct that is worse than, or as bad as, compelled service in hostile forces.98Id.

First, the regime makes it so compelled service in hostile forces is equally bad regardless of whether the war one is compelled to fight is legal or illegal. Given that a war of aggression is a grave violation of the international order and a clear instance of an unjust war, it should be a worse crime to compel protected persons to fight in hostile forces in pursuit of a war of aggression than to compel them to fight in hostile forces in pursuit of a war of self-defense. It seems, as Ryan suggests, that it would be “an additional aspect of the crime” to compel service in hostile forces in a war of aggression.99Cheyney Ryan, Moral Equality, Victimhood, and the Sovereignty Symmetry Problem, in Just and Unjust Warriors: The Moral and Legal Status of Soldiers 131, 143 (David Rodin & Henry Shue eds., 2008).

Second, the same problem arises with the state’s prerogative to conscript its own citizens. If aggression is the crime of all crimes, how can it be that conscription of a state’s own citizens to fight an illegal war is allowed just as a state’s conscription of its own citizens to fight a legal war? Given that conscription is already a grave intrusion into one’s personal freedom, nearly equivalent to an obligation to kill and die for the state,100See generally Walzer, supra note 17 (discussing conscription, Walzer consistently refers to an obligation to “die” for the state). it seems that the kind of war citizens are conscripted to fight should be relevant. At the very least, conscription to fight legal wars and conscription to fight illegal wars should not be equally allowed.

This issue has been addressed by some scholars. Tom Dannenbaum and James Pattinson have argued that there should be a right to object to deployment in illegal wars.101Dannenbaum, supra note 85, at 312; James Pattison, The Legitimacy of the Military, Private Military and Security Companies, and Just War Theory, 11 Eur. J. Pol. Theory 131, 149 (2011). Dannenbaum has also argued that international law should grant refugee status to those who refuse to fight in illegal wars.102Dannenbaum, supra note 85, at 312. More recently, he has argued that states have a legal obligation to recognize the refugee status of Russian troops who flee to avoid participating in a war of aggression, including those facing conscription.103Dannenbaum, supra note 79. He claims that the unlawful nature of the war should be enough to ground refugee status for Russian citizens who desert or flee Russia in order to avoid conscription.104Tom Dannenbaum, The Legal Obligation to Recognize Russian Deserters as Refugees, Just Sec. (Mar. 2, 2022), https://www.justsecurity.org/80419/the-legal-obligation-to-recognize-russian-deserters-as-refugees [https://perma.cc/2W2U-5NZ8]. Although the crime of aggression does not entail the international criminal responsibility of mid- and low-level soldiers, aggression’s wrongfulness lies in the fact that it causes widespread death and destruction without legal justification.105Id. And I have argued in previous work that individuals should have a right to object to deployment in unjust wars, that international law should grant refugee status to those who refuse to fight in unjust wars, and that we should modify the jus ad bellum, too, so that it better conforms to the morally relevant distinction between just and unjust wars.106Prieto Rudolphy, supra note 10, at 262–68.

However, even if one might be able to defend these conclusions through a progressive interpretation of international law, as Dannenbaum does,107Dannenbaum, supra note 85, at 332. at the moment, a right not to fight in illegal wars is not recognized by international law, states remain free to conscript their own citizens to fight in wars of aggression, and refugee status has not been extended to those who refuse to fight in illegal wars. In fact, Russian citizens who have fled Russia to avoid conscription have been met with varying responses. While Canada recently granted refugee status to a Russian man who had fled his country,108Russian Man in Canada Who Received Conscription Notice to Fight in Ukraine Granted Refugee Status, Radio Can. Int’l (Jan. 18, 2023, 6:11 AM), https://ici.radio-canada.ca/rci/en/news/1949154/russian-man-in-canada-who-received-conscription-notice-to-fight-in-ukraine-granted-refugee-status [https://perma.cc/ZS63-Y3UF]. Norway has hesitated to do so;109Thomas Nilsen, Norway Hesitates on Granting Asylum for Russians Fleeing Army Draft, Barents Observer (Jan. 24, 2023), https://thebarentsobserver.com/en/borders/2023/01/norway-hesitate-asylum-russians-fleeing-army-draft [https://perma.cc/NG9S-JNRL]. Latvia, Lithuania, and Estonia have said they will not offer refuge to fleeing Russians;110Andrius Sytas, Baltic Nations Say They Will Refuse Refuge to Russians Fleeing Mobilisation, Reuters (Sept. 21, 2022, 11:06 AM), https://www.reuters.com/world/europe/latvia-says-it-wont-offer-refuge-russians-fleeing-mobilisation-2022-09-21 [https://perma.cc/GFA9-69RL]. and Poland has begun to turn away Russian citizens at the border.111Id.

Third, compare compelled service in hostile forces to fight a legal war—a war crime under IHL—with the state’s conscription of its own citizens to fight an illegal war—permitted under IHL. The current regime suggests that it is significantly worse to be compelled by hostile forces to fight a legal war than it is for the state to conscript its own citizens to fight an illegal war. But this should be, at the very least, controversial. Outside of this context, we do not think that the moral and legal status of actions people are forced to do is irrelevant, and, further, we certainly do not think that being forced to do something illegal and immoral is less bad than being forced to do something legal and morally justified, purely based on the identity of who is coercing us into doing so.

Perhaps the identity of who is coercing individuals is relevant in this context; perhaps the state is specially positioned to demand certain things from its citizens, and I will return to this in Part III. But to defend this aspect of the regime, the claim that must be defended is not only that the identity of who is coercing individuals matters, but also that it matters much more than whether individuals are being coerced to fight legal or illegal wars.

Finally, the fact that the international legal regime on conscription fails to distinguish between legal and illegal wars not only fails to capture something that is normatively significant. It is also self-defeating; that is, it is bad at achieving what international law presumably aims to achieve. If one of the goals of international law, or the jus ad bellum, is to achieve or sustain peace,112See Marcela Prieto Rudolphy, Who Is at War? On the Question of Co-Belligerency, 25 Y.B. Int’l Humanitarian L. 141, at 152 (Heike Krieger et al. eds., 2022). a regime that allows states to “generate soldiers for war-making”113Ryan, supra note 99, at 141. independent of whether they are doing so to uphold the international regime or to breach it, seems likely to generate more and longer wars than a regime that prohibited states from conscripting its citizens to fight in illegal wars.

In sum, the fact that the international regime on conscription does not pay attention to the nature of the wars that individuals are coerced to fight cannot be made sense of, morally speaking. The current regime treats equally things that are morally dissimilar in a relevant way.

International law should thus distinguish between legal and illegal wars. In the context of the state’s conscription of its own citizens, this implies that states should be prohibited from conscripting individuals to fight illegal wars or wars of aggression. In the context of compelled service in hostile forces, this implies that the illegal nature of the war one is compelled to fight should be an additional aspect of the crime.

But there is still a remaining dimension of the regime that demands justification and has not been entirely undermined by the arguments so far. Recall that justifying the present regime on conscription requires the success of two arguments. First, that the distinction between legal and illegal wars does not matter at all. This argument has failed. And second, that compelled service in hostile forces is significantly worse than compelled service by one’s state. This argument is necessary to justify the fact that compelled service in hostile forces, even to fight a legal war, is a war crime while the state’s conscription of its own citizens to fight a legal war remains the state’s prerogative. Engaging with this argument will be the task of Part III.

III.  HOSTILE FORCES AND THE STATE

In Part II, I argued that the international legal regime on conscription is morally incoherent in a particular way: it treats equally, either by permitting both or criminalizing both, things that are morally dissimilar. It matters whether the wars that individuals are coerced to fight are just or unjust, legal or illegal.

If we accept the arguments made in Part II and their implications, we should accept that there is a powerful pro tanto reason for international law to prohibit the state’s conscription of its own citizens to fight illegal wars and to make the illegal nature of the war an additional aspect of the crime of compelled service in hostile forces.

However, having accepted these implications, there is a second aspect of the regime whose moral coherence remains to be proved: whether compelled service in hostile forces is both wrong and significantly worse than compelled service in the armed forces of one’s state. This argument is necessary to render morally intelligible the fact that compelled service in hostile forces to fight legal wars is a war crime, while compelled service in the armed forces of one’s state to fight legal wars is the state’s right.

Some of the arguments I will explore will also have implications for compelled service in illegal wars. As I just noted, Part II provides a pro tanto reason for international law to prohibit compelled service in illegal wars. Although I think that Part II provides a conclusive, and not just a pro tanto, reason for international law to prohibit the state’s conscription of its own citizens to fight illegal wars, the case in favor of prohibition does not rest solely on that argument. It can also rely on some of the arguments that follow.

Nonetheless, most of the arguments in this Part attempt to explain why international law treats compelled service in hostile forces to fight legal wars differently from a state’s conscription of its own citizens to fight legal wars. Perhaps there is something particularly wrong about compelling persons who are in custody or in occupied territory to serve in hostile military forces—something wrong that is present in these circumstances but is not present when states conscript their own citizens to fight wars. If this argument exists, then we can explain at least this aspect of the international legal regime on conscription.

The complexity of making moral sense of this aspect of the regime is increased by the fact that compelled service in hostile forces is not merely prohibited by IHL, but is a war crime, that is, a crime that entails individual, and not just state, responsibility. Whatever justification is provided, it must be able to account not only for the CSHF prohibition itself, but also for its criminalization as part of international criminal law. However, there is no unitary theory of war crimes nor a satisfactory definition of them.

War crimes belong to the general category of international crimes. International crimes are one of the few areas of international law in which duties are imposed directly on individuals who might be personally responsible for their conduct. Not every human rights violation is an international crime. Louise Arbour, former UN High Commissioner of Human Rights, explains the distinction in this way: “Human rights law violations are actions and omissions that interfere with the birthright of all human beings—their fundamental freedoms, entitlements and human dignity. Humanitarian crimes are, in essence, crimes that are so heinous that they shock the human conscience.”114Judith Blau & Alberto Moncada, It Ought to Be a Crime: Criminalizing Human Rights Violations, 22 Socio. F. 364, 365–66 (2007) (footnote omitted).

The idea that international crimes are a particular category of very heinous acts, one that shocks the conscience of humankind, is quite common. Some authors, however, have offered a different theoretical account of international crimes.

David Luban, for example, has provided a theory of crimes against humanity, understanding them as an assault on a particular aspect of human beings, namely, our character as political animals.115David Luban, A Theory of Crimes Against Humanity, 29 Yale J. Int’l L. 85, 90 (2004). Larry May, as we will see below,116See infra notes 161–67 and accompanying text. has developed a theory about war crimes that relies on notions of honor and the vulnerability of certain individuals during war.117See generally Larry May, War Crimes and Just War (2007). As I will argue in Section III.D, May’s theory can explain why compelled service in hostile forces is often worse than the state’s conscription of its own citizens, but it cannot explain the permissibility of conscription.

There is, however, significant uncertainty and confusion about the definition of “war crimes,” and the term is often used in various and sometimes contradictory ways.118Oona A. Hathaway, Paul K. Strauch, Beatrice A. Walton & Zoe A.Y. Weinberg, What Is a War Crime?, 44 Yale J. Int’l L. 53, 68 (2019). This is probably partly explained by the fact that the concept of grave breaches, and the idea of a war crime itself, is a body of law “whose normative provisions were drawn from different, and somewhat inconsistent, treaty provisions.”119Schabas, supra note 28, at 233.

The most common definition of war crimes is “violations of the laws of war that incur individual criminal responsibility.”120Charles Garraway, War Crimes, in Perspectives on the ICRC Study on Customary International Humanitarian Law 377, 377 (Elizabeth Wilmshurst & Susan Breau eds., 2007). The international criminal tribunals, as well as the ICC, have coalesced around three basic elements of war crimes: (1) an armed conflict; (2) a nexus between the acts of the accused and the armed conflict; and (3) knowledge of the armed conflict.121Schabas, supra note 28, at 227–40.

Other accounts of war crimes, like that of Hathaway, Strauch, Walton, and Weinberg, require that the violation of the laws of war is also serious.122Hathaway et al., supra note 118, at 55. It is generally assumed that all grave breaches of the Geneva Conventions are serious, but the seriousness of other violations might depend on the nature of the infraction itself and, perhaps, the manner in which the prohibition is broken.123Garraway, supra note 120, at 385. This is somewhat ambiguous, both in the statute of the ICTY and in the Rome Statute, in which the Elements of Crimes introduce elements to Article 8 that require the act to be sufficiently serious to justify international condemnation.124Id. (citing Rome Statute, supra note 5, at art. 8(2)(b)(vii), reprinted in The International Criminal Court, Elements of Crimes and Rules of Procedure and Evidence 754 (Roy S.K. Lee ed., 2001)). There might also be breaches of the laws of armed conflict that do not qualify as serious and meriting international condemnation, but that should be the subject of domestic proceedings.125Id. at 386.

The most accepted definition of war crimes (violations of IHL that entail individual criminal responsibility) fails to offer a theory of criminalization. That is, it fails to guide criminal tribunals and other organs in determining what a war crime is and lacks a deep underlying justification.126Hathaway et al., supra note 118, at 54. It is not clear at all that this is what the definition is trying to do—perhaps it aims simply to give a positivist account of war crimes. But the truth is that we do not agree on a theory of criminalization, both at the domestic level127See Nicola Lacey, Historicising Criminalisation: Conceptual and Empirical Issues, 72 Mod. L. Rev. 936, 941 (2009). For some theories of criminalization, see, e.g., Victor Tadros, The Ends of Harm: The Moral Foundations of Criminal Law (2011); Antony Duff, The Realm of Criminal Law (2018); The Boundaries of the Criminal Law (R.A. Duff et al. eds., 2010). and at the international one. The purpose of International Criminal Law itself is also deeply contested.128See generally Immi Tallgren, The Sensibility and Sense of International Criminal Law, 13 Eur. J. Int’l L. 561 (2002); Bill Wringe, Why Punish War Crimes? Victor’s Justice and Expressive Justifications of Punishment, 25 L. & Phil. 159 (2006); David Tolbert & Marcela Prieto Rudolphy, Transitional Justice in the 21st Century: History, Effectiveness, and Challenges, in The Oxford Handbook on Atrocity Crimes 581 (Barbora Holá et al. eds., 2022). This poses some difficulties in terms of the arguments I will offer. It might be plausible to conclude that some things should be forbidden by international law, but something else is presumably required in order for a behavior to constitute a war crime. I will come back to this at the end, but I cannot do anything but leave the issue somewhat open. To do otherwise would require developing a theory of criminalization in the context of IHL.

For now, the main challenge is to explain why compelled service in hostile forces might be morally worse than the state’s conscription of its own citizens. I will address several different but somewhat related arguments: (a) compelled service in hostile forces is irrational; (b) the CSHF prohibition aims to incentivize surrender and disincentivize longer wars; (c) citizens feel loyalty toward their own states and that loyalty should be respected; (d) protected persons are in a particularly vulnerable situation vis-à-vis their captors, which makes it morally worse to compel them into service; and (e) unlike foreigners, citizens have duties toward their own states to fight legal wars.

A.  Compelled Service in Hostile Forces Is Irrational

One possible argument why compelled service in hostile forces is significantly different from the state’s conscription of its own citizens is that individuals forced to fight for hostile or enemy forces would likely defect or surrender as soon as possible and, generally speaking, would make for bad fighters.

This might be true in certain circumstances. Some individuals feel strong loyalty toward their states and armies and if compelled to fight might in fact decide to do so badly. However, this difference between compelled service in hostile forces and the state’s conscription of its own citizens cannot explain why compelled service in hostile forces is a war crime. It only shows why it is a bad idea for any given state to compel protected individuals to fight in hostile forces—that is, it shows why it is likely irrational to do so. But mere irrationality does not provide a conclusive argument in favor of the international criminalization of compelled service in hostile forces. There are plenty of things that states do that might be a bad idea, but bad ideas are not enough to “shock the conscience”129See supra note 114 and accompanying text. of humankind.

Further, conscripted soldiers fighting for their own state might also be bad fighters, particularly in comparison to professional armies. They might be reluctant to kill or engage in combat or lack relevant training. Thus, if the CSHF prohibition was justified because protected persons are likely to make bad fighters, conscription would be put into question for similar reasons. But the fact that individuals might make bad fighters is, in any case, not enough to explain why compelling individuals to fight is wrong or should be prohibited. Again, it only shows why it is a bad idea.

One might object at this stage that whether compelled protected persons make good fighters is irrelevant. What the CSHF prohibition aims to achieve is to provide incentives to surrender. Let us examine this argument.

B.  POWs and Incentives to Surrender

Privileged combatants can become POWs and are consequently entitled to certain rights and protections.130See generally Geneva Convention (III), supra note 5. POWs are those who have fallen into the power of the enemy and belong to one of the categories specified in Article 4 of Geneva Convention III.131Id. at arts. 4, 13, 42. Hors de combat, that is, persons who are in the power of an adverse party, express a clear intention to surrender, and have been rendered unconscious or otherwise incapacitated by wounds or sickness,132Protocol Additional to the Geneva Conventions of 12 August 1949, and Relating to the Protection of Victims of International Armed Conflicts (Protocol 1) art. 41, June 8, 1977, 1125 U.N.T.S. 3 [hereinafter API]. are also entitled to certain protections against mistreatment.133See generally Geneva Convention (III), supra note 5; Geneva Convention (II) for the Amelioration of the Condition of Wounded, Sick and Shipwrecked Members of Armed Forces at Sea, Aug. 12, 1949, 75 U.N.T.S. 85 [hereinafter Geneva Convention (II)]. Among those protections is the prohibition against compelled service in hostile forces.134See supra Section 1.A.

One might thus argue that, if combatants knew that after surrendering or becoming wounded or incapacitated they might be compelled by the adverse party to serve in its armed forces, they would be less likely to surrender and more likely to fight until the very end. If everyone did that, this might, in turn, make wars longer.

The CSHF prohibition then is concerned with providing individuals with incentives to surrender. In doing so, it aims to make wars shorter, and perhaps in making wars shorter, it reduces suffering. Of course, this rationale would not explain why the CSHF prohibition also applies to those in occupied territory; it is limited to POWs.

The idea that the goal of IHL is to reduce suffering (to make wars more humane) is familiar and is often referred to as “the humanitarian view.”135See Prieto Rudolphy, supra note 112, at 147–48; Haque, supra note 11, at 38. If this is right, and the incentives work in this way, then we might be able to explain why it makes sense to provide individuals with certain protections when they surrender, such as the prohibition on compelled service in hostile forces.

The humanitarian view has been the object of a variety of objections, both as a justification of the legal equality of combatants136See, e.g., Prieto Rudolphy, supra note 10, at 61–74; Haque, supra note 11, at 38–43. and as a justification of the law on co-belligerency.137See Prieto Rudolphy, supra note 112, at 149–53. In the context of the international legal regime on conscription, the humanitarian view can explain why the CSHF prohibition is a reasonable rule to have, but it cannot explain why compelled service in hostile forces is worse than the state’s conscription of its own citizens. In fact, if the state’s conscription of its own citizens is likely to make wars longer or to increase suffering, there would be at least a pro tanto reason to prohibit the practice.

Note, as well, that the humanitarian account is also likely to provide an additional reason for prohibiting states’ conscription in illegal wars. If we rely on accounts of what people have incentives to do, we should, presumably, reduce incentives to wage and participate in illegal wars.

C.  Loyalty

A third argument that might explain why there is something different between compelled service in hostile forces and the state’s conscription of its own citizens is that the former violates the ties of loyalty that bind individuals to their own states, while conscription by one’s own state does not. Compelled service in hostile forces entails fighting against one’s own state and betraying one’s loyalty to it.

The importance of loyalty to one’s state is not unheard of in the context of war. Levinson, for example, discusses the case of Ernst von Weizsaecker, State Secretary of the German Foreign Ministry.138Sanford Levinson, Responsibility for Crimes of War, 2 Phil. & Pub. Affs. 244, 260–65 (1973). Von Weizsaecker had internally opposed the war, but the issue of why he did not inform the Russian ambassador of Hitler’s plans against Russia in 1941 came up during the Nuremberg trials.139Id. at 262–63. If he had done so, he would have put himself in great danger, would not have changed Hitler’s policy, and would have led to greater German losses:

The prosecution insists, however, that there is criminality in his assertion that he did not desire the defeat of his own country. The answer is: Who does? One may quarrel with, and oppose to the point of violence and assassination, a tyrant whose programs mean the ruin of one’s country. But the time has not yet arrived when any man would view with satisfaction the ruin of his own people and the loss of its young manhood. To apply any other standard of conduct is to set up a test that has never yet been suggested as proper, and which, assuredly, we are not prepared to accept as either wise or good.140Id. at 263 (citation omitted).

This argument about loyalty was not just an idiosyncratic belief of the Nuremberg judges. The ICRC database on customary IHL states that the reasoning behind the CSHF rule is “the distressing and dishonourable nature of making persons participate in military operations against their own country—whether or not they are remunerated.”141ICRC Rule 95, supra note 40, at 334. Bothe also finds the rationale of the rule in avoiding “bringing a detained person or a person in occupied territory into an unbearable loyalty conflict.”142Michael Bothe, War Crimes, in The Rome Statute of the International Criminal Court: A Commentary 379, 394 (Antonio Cassese et al. eds., 2002). And the ICTY, in interpreting the concept of “civilians” for the purposes of the Fourth Geneva Convention, relied on the idea of “genuine bonds of loyalty and allegiance,” dismissing, however, the requirement of nationality.143Schabas, supra note 28, at 247. In the ICTY, the issue as to who counted as civilians for the purposes of the Fourth Geneva Convention arose because the latter refers to those who find themselves “in the hands of a Party to the conflict or Occupying Power of which they are not nationals.”144Id. This was pressing in the case of Bosnian Muslims, who could not be considered protected persons under the Convention because their persecutors were also of Bosnian nationality.145Id.

In the philosophical literature, arguments from loyalty have been discussed in the context of the crimes of treason and espionage.146See generally Youngjae Lee, Punishing Disloyalty? Treason, Espionage, and the Transgression of Political Boundaries, 31 L. & Phil. 299 (2012); Cécile Fabre, The Morality of Treason, 39 L. & Phil. 427 (2020).

Arguments that rely on loyalty have, however, limited purchase. Loyalty can be understood as the “practical disposition to persist in an intrinsically valued (though not necessarily valuable) associational attachment, where that involves a potentially costly commitment to secure or at least not to jeopardize the interests or well-being of the object of loyalty.”147John Kleinig, Loyalty, Stan. Encyc. Phil. (Mar. 22, 2022), https://plato.stanford.edu/archives/sum2022/entries/loyalty [https://perma.cc/ZA55-3PBP]. In institutional settings, “loyalty can simply take the form of commitment and willingness not to undermine institutions which do well by us and, thereby, not to harm our fellow community members.”148Fabre, supra note 146, at 442.

One of the issues with a loyalty-based argument is that loyalty can often be morally problematic. This is so because individuals might experience loyalty toward all sorts of projects and associations, some of which will have no value at all, or even negative value. This would be the case with individuals who experience loyalty toward, say, racist associations, criminal organizations, and so on.

Perhaps loyalty-based arguments can succeed if we can argue that loyalty toward one’s own state (or nation) is valuable to the point of requiring protection from international law. Yet, although some states perform valuable functions, it might be better if individuals felt stronger loyalty toward other fellow human beings, rather than to their own state. This kind of loyalty also seems more consistent with some of international law’s cosmopolitan aspirations.149See generally Roland Pierik & Wouter Werner, Cosmopolitanism in Context: Perspectives from International Law and Political Theory (2010) (exploring the ideal of cosmopolitanism and its strained relationship with existing international legal institutions). And relying on this account of loyalty might provide an additional argument as to why compelled service in hostile forces to fight illegal wars should be a war crime. When states do compel such service, they are asking individuals to engage in conduct that is disloyal to states that are engaged in legal wars and to the international legal system.

In any case, even if we concede that betraying one’s loyalty to one’s state is presumptively wrongful, that presumption can be overridden by other considerations—mainly, whether one’s state is engaged in violating the fundamental rights of others.150Fabre, supra note 146, at 444–45. Fabre, in fact, argues that under certain circumstances, individuals are not only morally permitted but might also have a pro tanto duty to commit informational treason. Id. Yet, this is precisely what states engaged in aggressive wars are doing. Recall that we are trying to answer why compelled service in hostile forces to fight a legal war is both wrong and significantly worse than the state’s conscription of its own citizens to fight a legal war. This would require arguing that an individual’s loyalty toward a state fighting a war of aggression should be protected by making compelled service in hostile forces a war crime. However, that individual’s loyalty is surely misguided from the viewpoint of the international legal system itself, and it is, effectively, loyalty toward an actor engaged in serious legal (and moral) wrongdoing.

Perhaps we can argue that the objective value of an individual’s loyalty is irrelevant. It only matters whether the individual in fact experiences ties of loyalty to their own state. If they do, that loyalty should be protected. But this argument is both over and underinclusive. It is overinclusive because it would require international law to protect individuals’ loyalty toward their own state in a wider range of circumstances, extending far beyond service in hostile forces. And it is underinclusive because it would require international law not to concern itself with instances in which that loyalty is not present. That is, if particular individuals felt no special ties of loyalty toward their own state, or if they had no state, the CSHF prohibition could not be justified in their case; states would do nothing morally wrong if they chose to compel those foreigners to serve in their armed forces. This seems to miss something important.151A variation of the loyalty argument might rely on commitment, whereby individuals might have voluntarily committed to fulfill duties of conscription to their own state. The argument runs into similar problems as loyalty-based ones when applied to conscription. On the obligation to obey the law and commitments, see generally Felipe Jiménez, Legality, Legal Obligation, and Commitment (n.d.) (unpublished manuscript), https://law.ucla.edu/sites/default/files/PDFs/Law_and_Philosophy/Felipe_Jimenez-Legality_Legal_Obligation_and_Commitment.pdf [https://perma.cc/B5UF-3QHT].

It might be the case that many individuals do feel strong ties of loyalty to their own states, even if they are misguided in doing so. This might counsel for excusing their behavior in certain circumstances. But it is difficult to argue that international law should encourage that loyalty when it is misguided by making compelled service in hostile forces a war crime.

This is not to say that loyalty is completely irrelevant in all circumstances. Individuals, for example, might be loyal to members of their own family or their loved ones, or they might have special duties to protect them from harm.152See generally Seth Lazar, Associative Duties and the Ethics of Killing in War, 1 J. Prac. Ethics 3 (2013) (discussing the limited relevance of associative duties of protection in the context of killing in war). It is understandable, from that perspective, why forcing someone to harm their own loved ones would be worse than forcing them to harm a stranger. Since fighting against one’s own country might involve harming one’s loved ones, it is plausible to argue that, in those cases, compelled service in hostile forces is worse than compelled service in the armed forces of one’s state; thus, it might be wrongful, under certain circumstances, for states to compel individuals to harm their loved ones.

However, ties of loyalty and duties of protection in close interpersonal relationships are insufficient to fully account for the CSHF prohibition due to two reasons.

First, this is not the kind of relationship that exists between states and their own citizens, nor is it the kind of relationship that exists between citizens themselves.153For a similar argument in the context of the difference between loyalty toward the state and toward those with whom we have close interpersonal relationships, see Lee, supra note 146, at 310–12. Further, interpersonal relationships and ties of loyalty do not necessarily overlap with citizenship in a given state—individuals might have family and loved ones in other countries, and, in the context of civil conflict, in which states can coerce their own citizens to fight, ties of loyalty to the state will be conflicted and, sometimes, nonexistent. Finally, even if we grant that individuals can draw deep personal meaning from their association to their state, their nation, their community, and so on, the strength of those ties pales in comparison to those in close interpersonal relationships.

At this point, one might object that it is unclear why interpersonal relationships should be so different from the kind of relationships that exists between individuals and their communities (however expansively we define the latter). After all, people draw profound meaning from their membership in those communities and often feel bound to act in certain ways on account of that membership. I find this kind of argument unpersuasive, partly because I am committed to a more cosmopolitan view of our moral obligations and the sources that give meaning to our lives. Nonetheless, even those who reject strong versions of cosmopolitanism should be persuaded by the second reason why ties of loyalty and associative duties are insufficient to account for the CSHF prohibition.

This is so because even the ties of loyalty and duties of protection that arise in interpersonal relationships are not impervious to the impartial demands of morality. That is, individuals are not morally allowed to do anything they can to protect their loved ones (or the members of their communities) from harm nor are they allowed to be partial to the interests of their loved ones in all possible circumstances. In the context of the CSHF prohibition, it can hardly be the case that associative duties and ties of loyalty toward one’s own state and its institutions—which seem categorically different or, at the very least, weaker, from those present in interpersonal relationships—could override moral obligations not to wrongfully harm others. Or, put differently, it cannot be the case that it is morally wrong for individuals to fight a war of self-defense against their own states. If hostile forces are prohibited from coercing individuals into doing so—as I think they are—it cannot be merely because doing so involves making them breach duties of loyalty or protection toward their own states. Those duties will most often be inexistent or overridden by duties not to wrongfully harm others (as individuals often do when fighting in an illegal war).

In sum, while we can acknowledge that individuals often feel loyalty toward their own states and they also have duties of protection toward their loved ones, this argument only explains why fighting against one’s own state can be worse, from an agent-relative perspective, than fighting against other states. But alone, outside of limited circumstances in which fighting for hostile forces might involve breaching duties of protection toward one’s loved ones, it cannot explain why compelled service in hostile forces is morally prohibited.

D.  Mistreatment and Vulnerability

A different way in which we can argue that there is a significant difference between the state’s conscription of its own citizens and compelled service in hostile forces relies on the idea that POWs and persons in occupied territory are in a particularly vulnerable position. This argument is, of course, underinclusive—it can only explain what is wrong with compelled service in the case of POWs and those in occupied territories, but it cannot explain what is wrong with compelling “the nationals of the hostile party to take part in the operations of war directed against their own country.”154Rome Statute, supra note 5, at art. 8(2)(b)(xv).

A vulnerability-based argument has two dimensions. One is more obviously concerned with the incentives at play that explain why POWs or individuals in occupied territory are more likely to be treated in objectionable ways. The second one focuses on what is particularly wrong with harming or coercing vulnerable individuals. Both dimensions of the argument are related. The first one aims to explain why the CSHF prohibition is required to protect certain individuals who are likely to be the victims of objectionable treatment. The second one aims to explain why certain forms of treatment are objectionable.

The first part of the argument worries, then, about the CSHF prohibition’s role in protecting individuals in contexts in which states might have incentives to treat them in objectionable ways. We might say, for example, that if states were allowed to conscript protected persons to fight, they would likely use them in cruel ways, as cannon fodder or diversions, in missions that have little chance of succeeding, and so forth.

Incentives are likely to operate in this way. In fact, a study by Valentino, Huth, and Croco shows that highly democratic states face great pressure to reduce the human costs of war and are thus more likely than other states to employ strategies that minimize military casualties.155See generally Benjamin A. Valentino, Paul K. Huth & Sarah E. Croco, Bear Any Burden? How Democracies Minimize the Costs of War, 72 J. Pol. 528 (2010). States—namely, democratic states—might thus have incentives to protect their own citizens and avoid high casualties, but they are less likely to have similar incentives regarding foreigners, to whom they are not politically accountable. And we have powerful reasons to impede states from employing individuals in these objectionable ways.

However, this argument alone, although plausible, cannot meaningfully distinguish between the state’s conscription of its own citizens and compelled service in hostile forces. Simply as a matter of history, the CSHF prohibition was prior to any human rights standards that might have governed how states treated their own soldiers and conscripts, which only developed after World War II.156Peter Rowe, The Impact of Human Rights Law on Armed Forces 5–6 (2006).

Going beyond this historical point, the argument about incentives cannot explain why compelled service in hostile forces is meaningfully different from the state’s conscription of its own citizens. It might be true that states often have fewer incentives to use their own citizens as cannon fodder, send them into missions certain to fail, and so on. But there are a range of circumstances in which states will, in fact, have few incentives to protect their own citizens. For example, authoritarian states are less responsive to public opinion, and as a result might be more willing to employ their own citizens in these ways. States might also employ their own citizens as cannon fodder as a war tactic in a desperate attempt to avoid defeat, particularly in the case of citizens from oppressed minority groups.

Thus, although this argument can provide a good reason for having the CSHF prohibition, it also provides good reasons for having a prohibition on conscription of the state’s own citizens. If the CSHF prohibition is justified because incentives are likely to make states use individuals in objectionable ways during combat, there would be a reason to enact a similar prohibition in all contexts in which the incentives to use individuals in those objectionable ways exist or to directly prohibit those objectionable ways themselves. Yet, no such prohibitions exist.

Although some standards of treatment have been imposed regarding soldiers and conscripts,157Chmykh et al., supra note 73, at 14–15. there is no restriction on states sending out their own citizens into difficult or impossible missions, nor any restrictions on using them as cannon fodder.158See Saira Mohamed, Cannon Fodder, or a Soldier’s Right to Life, 95 S. Cal. L. Rev. 1037, 1080–82 (2022) (noting the neglect of the human rights community regarding service members’ rights and the need to “humaniz[e] actors too often deemed instruments”). The standard of treatment recognized by the European Court of Human Rights, for example, demands that military service is performed in “conditions compatible with respect for human dignity” and that do “not impose distress or suffering of an intensity exceeding the unavoidable level of hardship inherent in military discipline” and service,159Chmykh et al., supra note 73, at 14–15. thus leaving wide discretion to states in terms of military strategies and planning. In the context of IHL, the debate concerning states’ obligations to their own soldiers seems also currently limited to the state’s obligation to tend to the wounded, which extends to the state’s own soldiers.160Saira Mohamed, Abuse by Authority: The Hidden Harm of Illegal Orders, 107 Iowa L. Rev. 2183, 2189 (2022). But see generally Cóman Kenny & Yvonne McDermott, The Expanding Protection of Members of a Party’s Own Armed Forces Under International Criminal Law, 68 Int’l & Comp. L.Q. 943 (2019) (discussing recent developments in international criminal law).

The argument so far, although insufficient to render the regime coherent, does give a reason in favor of the CSHF prohibition: compelled service in hostile forces might be one instance in which incentives to treat individuals in certain objectionable ways are very often present. Here is where the second dimension of the argument enters the picture. I have just pointed out that states are likely to have incentives to treat foreigners in objectionable ways when they send them into combat. The second dimension of the argument explains, precisely, why it is worse to treat POWs and persons in occupied territory in certain ways, and it relies on the notion of vulnerability, as understood by Larry May and Seth Lazar.

Larry May has developed an account of war crimes that relies on duties of humane treatment, in which war crimes are understood as “crimes against humaneness.”161May, supra note 117, at 1–2. They are a violation of the principle that requires soldiers to act humanely, with mercy and compassion.162Id. This is a difficult task, May acknowledges, because while soldiers might be required to act humanely toward some, they are still allowed to—or are not prohibited from—killing enemy combatants.163Id. Ultimately, for May, the rules of war are grounded in notions of honor and mercy, as well as the protection of the vulnerable.164Id. at 6.

Regarding confined soldiers (POWs and hors de combat) and prohibitions on their mistreatment, May argues that humane treatment becomes paramount: POWs are confined by one party, and that party has “every reason to want to exert vengeance or retribution on those who have been killing members of one’s armed forces.”165Id. at 142–43.

It is the asymmetry in power between confined prisoners and the party confining them that partly motivates May’s account. The laws of war, he argues, should counteract the strong possibility of abuse perpetrated by those who have weapons against those who do not.166Id. at 145. POWs are in a “special moral situation because they are utterly dependent on their captors and are vulnerable in ways that soldiers on the battlefield are not.”167Id. Walzer makes a similar point: “Just beyond the state there is a kind of limbo, a strange world this side of the hell of war, whose members are deprived of the relative security of political or social membership.”168Michael Walzer, Prisoners of War: Does the Fight Continue After the Battle?, 63 Am. Pol. Sci. Rev. 777, 777 (1969).

This concern about vulnerability, understood as defenselessness or the inability to diminish one’s vulnerability to a threat, is echoed by Seth Lazar in defending the prohibition against deliberately targeting civilians.169Seth Lazar, Sparing Civilians 102–22 (2015). Lazar argues that harming those who are vulnerable or defenseless is particularly bad or worse than harming the nonvulnerable because doing so is exploitative, risky, breaches a duty to protect the especially vulnerable, dominates and disempowers them, and generates unfair distributions of risk on the innocent.170Id. at 113.

Vulnerability is also the object of special protection in domestic criminal codes, in which it can operate as an aggravating circumstance in certain crimes such as homicide, or as giving rise to certain duties of protection toward, say, children.171See, e.g., Código Penal [Cód. Pen.] [Criminal Code] art. 12, nos. 2, 6, 12 (Chile); Strafgesetzbuch [StGB] [Penal Code], § 174 (Ger.), https://www.gesetze-im-internet.de/englisch_stgb/englisch_stgb.html [https://perma.cc/RL2M-MXG5]; Código Penal [C.P.] [Criminal Code] art. 22, nos. 1–2 (Spain); Cal. Penal Code § 273a (West 2024).

Following this line of argument, one might then reason that the vulnerability of POWs and those in occupied territories is what explains the CSHF prohibition. It does so because prisoners and persons in occupied territories are in vulnerable positions such that states have incentives to treat them in objectionable ways, and harming those who are vulnerable is morally wrong and morally worse than harming the nonvulnerable.

It is certainly true that both occupation and custody create a situation of vulnerability. In that sense, it is understandable why states might have duties to provide and care for individuals in custody and under occupation. And it is quite plausible that it is morally worse to harm the vulnerable than the nonvulnerable. If so, then we can explain why compelled service in hostile forces is morally worse than the state’s conscription of its own citizens: to the extent that POWs, hors de combat, and those in occupied territories are vulnerable and defenseless relative to citizens of the state, and to the extent that compelling them into service will entail treating them in objectionable ways, it is worse to compel the former into service than to compel citizens.

This argument, however, faces some difficulties. Recall that we are trying to answer why compelled service in hostile forces to fight a legal war is morally worse than compelled service by one’s own state to do the same. So far, the argument has established that harming those who are vulnerable is worse than harming those who are not, and that states might have incentives to treat POWs and those in occupied territories in objectionable ways (that is, states might have incentives to harm them). There are several problems with this.

First, as mentioned before, states can have similar incentives regarding their own citizens, and yet they are not prohibited from treating their own citizens in such ways. The argument is, in that sense, overinclusive.

Second, the argument on its own says very little about whether conscription is morally prohibited or permissible. This is so because the argument explains why compelled service in hostile forces is worse than the state’s conscription of its own citizens. Whether compelled service in hostile forces is morally wrong depends on whether it harms those compelled to fight. I have suggested that it does so when they are used in certain objectionable ways, or that it might do so when it forces them to breach associative duties toward their loved ones (under certain circumstances), but I have not argued that it does so in every instance. That is, I have not yet provided an argument as to whether coercion to fight legal wars on behalf of hostile forces always harms those forced to fight.

Put simply, the fact that harming someone who is vulnerable is worse than harming someone who is not does not say anything about whether the latter is morally prohibited or permitted. It only implies that one is worse than the other. That is, the fact that harming a defenseless child is worse than harming an adult does not prove, on its own, that harming the adult is morally permissible. That depends on whether the harm itself is justified.

Applied to the CSHF prohibition, one can argue that it is worse to compel protected persons into service than it is to compel one’s own citizens. That might be the case. But alone, this cannot answer whether compelled service in military forces is allowed in the first place.

Third, the argument that relies on vulnerability also requires that vulnerability is applied in a particular way, so that only those who have fallen under the power of an adverse party to the conflict and those that are in occupied territory are understood as vulnerable. However, not all POWs remain in physical custody during war, and citizens in their own states are also quite vulnerable to coercion on the state’s part (think, for example, of those who are serving criminal sentences). Further, refusing conscription might lead to criminal punishment, and the state has a wide arsenal of enforcement mechanisms at its disposal. If it is vulnerability that is driving the CSHF prohibition, then one might also find vulnerability in the state’s context when compelling individuals to serve in its own forces.

Still, we can argue that it will often be the case that POWs and those in occupied territories will be in a highly vulnerable position, and that states are likely to have incentives to use them in objectionable ways during combat, and so the CSHF prohibition is predicated on that likelihood. In those cases, we can explain why compelled service in hostile forces is wrong, and why it might be morally worse than the state’s conscription of its own citizens. But vulnerability itself might be present in the state’s own citizens. More importantly, this vulnerability-based argument does not say why compulsion to fight a legal war is always equivalent to unjustifiably harming individuals.

E.  Citizens and Duties Toward the State

Is coercion to fight a legal war equivalent to unjustified harm? Or is it coercion to do what one already has a moral duty to do? Does it matter whether it is one’s state who coerces one to fight?

What is wrong (or right) about compelled service in legal wars cannot be coercion itself. It would prove too much: it would immediately make the state’s conscription of its own citizens morally suspect. It also makes coercion—which is a key feature of the state—generally morally wrong. This argument should thus be discarded. Coercion might always require justification, but what makes coercion justified responds to other facts, such as what one is coerced to do, or who has authority to coerce other individuals.

Perhaps the difference between compelled service in hostile forces and the state’s conscription of its own citizens is that citizens, unlike noncitizens, owe certain duties to their own states, and among those duties, there is the duty to fight on behalf of one’s state. If successful, this argument can explain why international law permits states to conscript their own citizens and why compelled service in hostile forces is a war crime. The first is permitted because citizens already have enforceable duties to fight that are owed to the state. The second is prohibited because noncitizens do not owe such duties to other states, and it is wrong for those states (or hostile forces) to coerce individuals to fight when those individuals lack any relevant duties to do so. Forcing them to fight would constitute unjustified harm, in a sense, and it is even worse to harm those who are vulnerable or defenseless.

At this stage, one might argue that, in the case of compelled service in hostile forces, the issue of citizens’ duties toward their own states arises twice. First, because those who are compelled to serve in hostile forces have a duty toward their own states that compelled service in hostile forces causes them to breach. And second, because those who are compelled to serve in hostile forces lack duties toward hostile forces, and compelling them to fight as if they had such duties is morally wrong. However, recall that we are now concerned with the question of why compelled service in hostile forces to fight legal wars is worse than the state’s conscription of its own citizens to fight legal wars. In this case, when states breach the CSHF prohibition, they are doing so in relation to individuals who, if fighting for their own states, would be fighting a war of aggression, which is also an unjust war. When states are engaged in unjust wars, the question of whether their citizens still retain a duty to fight on behalf of their states is highly controversial, and the most plausible answer is that they almost always lack such duties toward the state, at least in cases of egregiously unjust wars.172See Prieto Rudolphy, supra note 10, at 94–110, 241–67; McMahan, supra note 89, at 66–78; Susanne Burri, If You Care About a Rule, Why Weaken Its Enforcement Dimension? On a Tension in the War Convention, 41 L. & Phil. 671, 671 (2022). But see generally David Estlund, On Following Orders in an Unjust War, 15 J. Pol. Phil. 213 (2007) (arguing that under certain limited circumstances, soldiers might be morally obligated to follow orders to fight in an unjust war); Yitzhak Benbaji, A Defense of the Traditional War Convention, 118 Ethics 464 (2008) (arguing that the moral equality of combatants is a fair and mutually beneficial norm that can explain why soldiers can follow orders to fight in unjust wars). Thus, I will not address whether compelled service in hostile forces also involves making individuals breach duties toward their own states.

The argument based on duties owed to the state thus requires showing that citizens have certain duties toward their own states (or their political communities) that noncitizens lack, and, in particular, that they have duties to fight on behalf of their state that noncitizens lack.173I thus leave aside whether citizens might have enforceable duties owed to the state to participate in the war effort in other ways. Some of the arguments I develop here will be applicable in this context too, but there are some important differences as well. This argument is obviously related to the question of political obligation, that is, the question of whether individuals have a prima facie, context- and content-independent moral duty to obey the law of the jurisdiction they are in.174Samuel Scheffler, Membership and Political Obligation, 26 J. Pol. Phil. 3, 3 (2018). Because conscription is often implemented through the legal regime, refusal to accept conscription will often involve disobeying the relevantly applicable laws.

However, the question of political obligation is somewhat different in character to the question of whether citizens have a duty to fight—to die and kill—for their states. The first difference is that conscription in times of war, unlike other instances of duties imposed by law, is extremely demanding: individuals are likely to face mortal and moral peril when they are called to kill and die on behalf of the state.175See generally Moshe Halbertal, On Sacrifice (2012) (discussing sacrifice in the context of war and violence). Fighting in war, unlike complying with most legal rules, is a significant burden. War is risky, both in physical and moral terms,176See Prieto Rudolphy, supra note 10, at 195, 248–49. and individuals only have one life to live—their own.

As a result, general arguments as to why individuals have moral duties to obey the law of their own states might not be sufficiently powerful or weighty to explain why individuals might have moral duties to die and kill for their states. This is not implausible: no account of political obligation defends an absolute duty to obey the law, independent of its content, and most accounts are qualified in several respects.177Liam Murphy, What Makes Law: An Introduction to the Philosophy of Law 120 (2014); Scheffler, supra note 174, at 3. Further, some political theorists, like Hobbes and Rousseau, have treated the question of an obligation to kill and die for the state as a separate, distinct issue.178Walzer, supra note 17, at 77–98. Rousseau, for example, held the view that to bear arms on behalf of the state was the ultimate public duty because every individual shares in the moral goods of the community.179Id. Hobbes contended that individuals do not have a duty to fight on behalf of the state because once the state demands citizens to die on its behalf, the social contract breaks down: the state and the citizen are at war with each other and they are hence returned to the state of nature.180Id. at 81–84. There are some passages where Hobbes seems more ambivalent. See id. at 85–86. This is so because the end of the state, for Hobbes, is individual life.181Id. at 82. An individual who dies for the state defeats the very purpose of forming that state: the preservation of life.182Id. Others interpret this aspect of Hobbes’s theory as a matter of prudence, in the context of which self-preservation was paramount.183Cheyney Ryan, The Dilemma of Cosmopolitan Soldiering, in Heroism and the Changing Character of War 120, 128 (Sibylle Scheipers ed., 2014).

The second reason why the question of political obligation is different from the question of conscription is that the first one is a question about whether there is a content- and context-independent duty to obey the law. By contrast, the question regarding conscription is not content-independent: it is a question of whether individuals have duties, owed to their state, to fight in legal wars.

Despite these differences, conscription has received little theoretical attention from political and moral philosophers184See, e.g., Mathias S. Sagdahl, Conscription as a Morally Preferable Form of Military Recruitment, 17 J. Mil. Ethics 224, 225 (2018); Ryan, supra note 99, at 143. besides focused attention on the legitimacy of the draft in the 1960s and early 70s.185Cheyney Ryan, The Chickenhawk Syndrome: War, Sacrifice, and Personal Responsibility 20 (2009). In fact, in the contemporary ethics of war, the question about conscription has been posed as a question of whether conscription might have an impact on one’s liability to defensive force. In other words, questions about conscription have been reduced so far to the question of whether coercion would make conscripted unjust combatants morally impermissible targets.186See, e.g., Michael Walzer, Just and Unjust Wars: A Moral Argument with Historical Illustrations 151 (4th ed. 2006); McMahan, supra note 89, at 50; Helen Frowe, Defensive Killing 21 (1st ed. 2014); Victor Tadros, To Do, to Die, to Reason Why: Individual Ethics in War 237 (2020). But see Ryan, supra note 99, at 133. But the legitimacy of conscription itself has been hardly discussed.187But see Ryan, supra note 99, at 133.

This omission is quite puzzling. As things are, states have the capacity of generating “unlimited numbers of soldiers by their coercive practices.”188Id. at 131. Indeed, during the 19th century, pacifism about war included an argument against conscription, which was considered to be a “type of enslavement at the heart of war” that led to a system of inhumanity; that is, a system where war was seen as taking a life of its own.189Cheyney Ryan, Bearers of Hope. On the Paradox of Nonviolent Action, in Soft War: The Ethics of Unarmed Conflict 184 (Michael L. Gross & Tamar Meisels eds., 2017). Although liberal thinkers like Hobbes, Locke, and others during the 18th and 19th centuries have remarked on the incompatibility of the rights of individuals with the power that the military possesses over its soldiers, very few thinkers have seriously questioned the permissibility of the state’s conscription practices.190Ryan, supra note 99, at 142.

Because the philosophical treatment of conscription is sparse, the literature on political obligation is a good starting point for assessing arguments in favor of a duty to fight for one’s state, even if there are relevant differences between the two.

Insofar as we can establish that a duty to accept conscription in times of war exists and is owed to one’s state or to one’s political community, we can explain the distinction that exists in international law between compelled service in hostile forces and the state’s conscription of its own citizens. This thought—that the justification of conscription relies on the existence of a duty toward one’s community or one’s state—is familiar. Conscription, as Ryan notes, is often experienced both as naked coercion and as embodying a legitimate social obligation.191Id. at 139.

Let us start by discussing how different theories of political obligation have justified the existence of a duty to obey the law.

1.  Theories of Political Obligation

In the literature on political obligation, context- and content-independent duties to obey the law have been grounded on consent,192John Locke, Two Treatises of Government and a Letter Concerning Toleration 3 (Ian Shapiro ed., 2003). fairness,193See, e.g., H.L.A. Hart, Are There Any Natural Rights?, 64 Phil. Rev. 175, 178 (1955); Russell Hardin, The Free Rider Problem, Stan. Encyc. Phil. (2003), https://plato.stanford.edu/archives/spr2013/entries/free-rider [https://perma.cc/SH9V-C9UZ]. natural duties of justice,194Jeremy Waldron, Special Ties and Natural Duties, 22 Phil. & Pub. Affs. 3, 3 (1993). associative duties,195Scheffler, supra note 174, at 4; Ronald Dworkin, Law’s Empire 195–216 (1986). on the basis of democratic authority,196Thomas Christiano, The Authority of Democracy, 11 J. Pol. Phil. 266, 268 (2004); Daniel Viehoff, Democratic Equality and Political Authority, 42 Phil. & Pub. Affs. 337, 338 (2014). and a commitment to law.197See generally Jiménez, supra note 151 (arguing that the duty to obey the law can be explained on the basis of a commitment). However, some authors remain skeptical that any such duties exist.198Murphy, supra note 177, at 110–43; A. John Simmons, Moral Principles and Political Obligations 189 (1979).

Theories based on consent199See generally, e.g., Locke, supra note 192. or commitment200See generally, e.g., Jiménez, supra note 151. regard political obligations as obligations of commitment.201Simmons, supra note 198, at 58. The idea is that the community has granted authority to the government and chose to undertake political obligations,202Id. at 58–59. or that individuals have adopted a commitment to law.203See generally Jiménez, supra note 151 (arguing that reasons to act in conformity with law depend on agents’ commitments). These theories tend to be voluntaristic (what matters for establishing a duty to obey the law is whether individuals have, in fact, consented), in which case the arguments discussed pertaining to loyalty will apply. When they are less voluntaristic (what matters is whether people should consent, or whether people in certain idealized circumstances would have consented), the arguments that apply to fairness, associative duties, and natural duties–based theories will often apply to them as well.

Fairness-based theories argue that states provide their citizens with significant benefits that they would otherwise not obtain. In a Hobbesian account of the state, for example, we would argue that citizens benefit from membership in their state because the latter’s existence allows them to exit the state of nature.204See generally Thomas Hobbes, Leviathan (Edwin Curley ed., 1994). In broader accounts, we might posit that the state allows individuals to access a number of goods that they could not obtain otherwise, such as security, the existence of a legal system, the solution of coordinative problems, and so on. Plausibly, noncitizens do not benefit as much as citizens from these arrangements. A fairness-based account of political obligation, then, holds that when a number of individuals “engage in a just, mutually advantageous, cooperative venture according to rules and thus restrain their liberty in ways necessary to yield advantages for all, those who have submitted to these restrictions have a right to similar acquiescence on the part of those who have benefited from their submission.”205Robert Nozick, Anarchy, State, and Utopia 90 (1974). Acceptance of the relevant benefits, even in the absence of express or tacit consent to cooperate, are enough to bind individuals to do their fair share in the group.206Id. Fairness-based accounts of political obligation explain the existence of the latter on the idea that individuals benefit from certain arrangements, and those benefits make it the case that they are obligated to participate in their production by following the rules of the scheme of cooperation.

In the context of conscription, Rawls, for example, suggested that the draft could be defended as a fair way of sharing in the burdens of national defense.207John Rawls, A Theory of Justice 380–81 (rev. ed. 1999). Given that even just and well-ordered societies cannot entirely eliminate the possibility of aggression by another state, they should make sure that the burden to defend one’s country should be evenly shared by all members of society over the course of their lives and that there is no avoidable class bias in selection.208Id. Further, given that conscription is “a drastic interference with basic liberties,” Rawls argued that conscription would be justified only if it is demanded for the defense of liberty itself.209Id. Rawls did not think this duty was content-independent: he suggested there might be a right or a duty to disobey if the war is unjust or is being fought in an unjust manner.210Id. at 381.

An alternative way to ground duties of obedience is to argue that citizens have associative duties or membership-dependent reasons to do their share, “as defined by the norms and ideals of the group itself, to help sustain it and contribute to its purposes,” provided that the norms are neither gravely unjust nor irrational, and the group is not corrupt.211Scheffler, supra note 174, at 6. Scheffler argues that this might, on certain assumptions, provide the basis of an argument in favor of the existence of a duty to obey the law.212Id. at 9. Those assumptions are, first, that membership in a political society can be noninstrumentally valuable; second, that the laws of a society are among its norms of individual conduct; and third, that these reasons amount to duties.213Id. Note that whether membership in the state (or any group) is noninstrumentally valuable depends on the justice (or injustice) of the group in question: at a certain point, the injustice of any given society will erode the value of membership in that society for, as Scheffler observes, part of the value of membership in a political society such as the state is that “it makes it possible to live on just terms with others.”214Id. at 14.

Accounts based on natural duties provide a different alternative.215Rawls, supra note 207, at 114–17, 333–37. See generally Waldron, supra note 194. They contend that there is a natural duty of justice which requires individuals to support and to comply with just institutions that exist and apply to them, as well as to further just arrangements not yet established (at least when it is not too costly for individuals).216Rawls, supra note 207, at 115.

Finally, duties to obey the law might also be based on the law’s democratic provenance.217See, e.g., Estlund, supra note 172, at 213. The democratic authority account obviously has a problem regarding scope: it can only justify conscription in democratic states. Thus, it could not explain why compelled service in hostile forces is worse than the state’s conscription of its own citizens. If successful, it can only explain why compelled service in hostile forces is worse than democratic states’ conscription of their own citizens and why conscription by democratic states is permissible.

All the arguments just discussed aim to explain why individuals might have content- and context-independent duties to obey the law, within certain limits. But duties of obedience can also be grounded instrumentally, that is, on the basis of the ability of the state, the legal system, or the practice of widespread obedience to the law of securing certain good outcomes.218Murphy, supra note 177, at 125. Of course, instrumentalist accounts of duties to obey the law fail to support a context- and content-independent duty to obey the law—that is not what instrumentalist accounts are trying to do.219Id. at 129. At most, an instrumentalist account will be able to support duties to obey certain laws, insofar as obeying those laws is a way of securing the outcomes we want to secure.220Id. In the case of conscription, the argument would be able to support the duty to accept conscription if doing so was the kind of thing that helped secure certain outcomes or if not doing so would risk the collapse of, say, one’s state (provided that the state is a valuable institution).

Historically, conscription, both in times of peace and in times of war, has been justified instrumentally on the basis of different outcomes or goals,221See Matthew Kosnik, Conscription in the Twenty-First Century: Do Reinforcements Equal Security?, 36 Compar. Strategy 457, 457 (2017). some of which Leander refers to as “myths” regarding conscription.222Leander, supra note 67, at 573–74. Some of these myths have been empirically debunked, but they tend to fail on their own terms anyway.

One argument in favor of conscription, and against all-volunteer armed forces (“AVF”), is that the latter pose a danger to democracy, while conscription is central for controlling the use of force in society.223See, e.g., id. at 580–82; Milton Friedman, Why Not a Voluntary Army?, 4 New Individualist Rev. 3, 4–7 (1967). Neither Leander nor Friedman think this argument is sufficient to defend conscription. There is thus a supposed link between the preservation of democracy and conscription, or, alternatively, all-volunteer armed forces can pose a threat to the political community, democracy, and freedom which conscription-based armies are less likely to pose.

However, historical examples do not support the notion that conscription-based armies are particularly effective in controlling or constraining the use of force by their own state leaders. The general draft allowed Hitler to form a powerful army and conscripts did not stop his plans from within, and both Stalin’s and Mussolini’s armies were composed of conscripts, who also posed no considerable internal resistance.224Carsten Keil, On Conscription 6 (Nov. 19, 2001) (unpublished manuscript), https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=60044ff7f202faa491c4aa8b5fdddf81e63c4d01 [https://perma.cc/A6FW-TXE8]. Conscript-based militaries in Chile, Argentina, and Turkey also offered no resistance to military takeovers.225Kosnik, supra note 221, at 458. It is thus not true that conscription can be a cure to the threat that all voluntary armies pose, and as Keil notes, the “equation ‘conscription equals democracy’ is badly flawed.”226Keil, supra note 224, at 6. Interestingly, a study conducted in thirty-four European states in 1997–2017 found that in countries with conscription-based recruitment, there are higher levels of support for the military.227Ioannis Choulis, Zorzeta Bakaki & Tobias Böhmelt, Public Support for the Armed Forces: The Role of Conscription, 32 Defence & Peace Econ. 240, 243 (2021).

Further, even if it were true that AVFs pose graver threats to democracy than conscript-based forces, this would not, on its own, provide an argument in favor of the permissibility of conscription. It only highlights an aspect that makes conscription better than AVFs. Pattison, for example, has argued that the AVF is the most legitimate way of organizing the military, partly because it does not severely infringe on individuals’ liberty.228Pattison, supra note 101, at 149.

A second myth is that conscription works to construct a more tightly knit society, both as a source of social mobility and as a source of social integration.229Leander, supra note 67, at 573–74. This logic does not resist analysis: in most places, women are not subject to conscription, so whatever social integration or mobility is created clearly excludes roughly half the population.230Id. at 574. It is also unclear why social integration should stop at the border of one’s own country.231Id. Further, this argument might work for conscription as a state’s general practice during times of peace, but it does not really support conscription or the draft in times of war. There is no social integration when people are dead.

A third argument usually employed to support conscription is that it helps to form loyal and virtuous citizens because conscription itself works as the “school of the nation.”232Id. at 576–77. However, as Leander points out, the idea that the military could and should play a role in forming virtuous citizens is in tension with democratic understandings of what makes a virtuous citizen in most contemporary political thinking.233Id. at 578. Even if that were the case234A study by Choi et al. found that states with conscription policies have higher levels of electoral participation than states without conscription policies. Min Jae Choi, Seung Wook Yoo & Zack Bowersox, Conscription and Political Participation: How Conscription Policies Affect Voter Turnout, 50 Armed Forces & Soc’y 315, 316 (2024).—that is, even if conscription was an effective tool in creating “virtuous citizens”—it cannot be plausibly sustained that as a result the state can demand people to kill and die on its behalf. The intrusion on personal freedom and the sacrifice demanded of individuals is far too great in comparison to the pursued goal, which is relatively insignificant. It is also far from clear that the state, at least one committed to some version of political liberalism, can coercively make individuals believe and endorse its own conception of virtuousness.

Thus, an instrumentally justified duty to accept conscription must go beyond these arguments. It must rely on the benefits that the state provides for its citizens, which might be diminished if the war is lost or eliminated entirely if losing the war entails the destruction of the state. In the latter case, one might be facing something like a “supreme emergency.”235Walzer, supra note 186, at 250–67. I will come back to this point later.236Of course, instrumentally justified duties face a number of objections. In this context, the difficulty is that one person’s refusal to fight does not, in fact, have a great effect on the overall outcome of war, but, at the same time, it imposes a significant burden on the individual. On this, see Jonathan Glover & M.J. Scott-Taggart, It Makes No Difference Whether or Not I Do It, 49 Proc. Aristotelian Soc’y, Supplementary Volumes 171, 171 (1975). In the first case, the argument relies on the notion that states fulfill valuable ends and thus have a right to their own survival. Citizens, then, must contribute to the survival of their own state by fighting.

2.  The Shortcomings of Theories of Political Obligation

Whichever way one grounds these duties, if successful, individuals might have duties to fight on behalf of their state. If so, then we will have an additional reason why compelled service in hostile forces is worse than conscription, and why conscription to fight legal wars is permitted by international law. However, these theories struggle to support this notion.

First, the theories tend to be overinclusive. If we accept that individuals can owe certain duties to groups or that there are instrumentally justified duties to fight legal wars, then there is no reason to think that those duties would be owed exclusively, or even primarily, to one’s state. Any community, political or otherwise, would be able to generate such duties, provided that individuals have consented to them, the community is reasonably just, individuals benefit from the existence of the community, and so on. There is, thus, a difficult question of demarcation: why is the state the right entity to which individuals owe duties to fight legal wars?

We live in an increasingly interconnected society in which noncitizens can, in fact, benefit considerably from the functioning and existence of other states—and sometimes to a greater extent than the state’s own citizens. Further, individuals might also benefit from the existence of the international legal system, in which case they would be obliged to that system or to all states, not just to their own. In the context of war, if pacifism is false,237See Prieto Rudolphy, supra note 10, at 184–85 (pointing out that pacifism is difficult to resist). See generally Cheyney C. Ryan, Self-Defense, Pacifism, and the Possibility of Killing, 93 Ethics 508 (1983) (arguing that pacifism, understood as opposition to killing, is attractive at an emotional and intellectual level, and arguments against it are difficult to make). there is a plausible argument that legal wars, when they are just and aim to uphold the prohibition on aggression, concern and benefit not just citizens of the states involved, but also the international community as a whole. This has implications for the theories of political obligation just discussed.

Take the fairness and natural duty-based accounts. If legal wars generate benefits to everyone, then everyone would have a duty to fight, and that duty would not be owed to one’s state but to the relevant group (those who benefit from legal wars). If everyone has duties to uphold and further just institutions, then everyone would have a duty to fight a legal war, which would not be owed to one’s state, but to all states or the international community.238Waldron, supra note 194, at 9–11. Waldron proceeds to argue for some differences in the extension of those duties, but he does not deny that duties of justice might exist toward other societies. Similar objections are also developed by Simmons. Simmons, supra note 198, at 146–52. This is known as the particularity objection in the literature on political obligation. If we take an instrumental account of political obligation, then one would be required to fight anytime it would help uphold the legal regime or the existence of sufficiently valuable states. And, again, it is unclear why that duty would be owed to any particular state or to one’s own.

The problem, then, is to draw a significant line between citizens and noncitizens. If this is right, accounts of political obligation can explain why the state’s conscription of its own citizens is permitted. However, they will struggle to explain why compelled service in hostile forces to fight legal wars is a war crime, or why non-state groups are prohibited from conscripting individuals to fight legal wars. They provide a reason to the contrary. That is, they provide a reason in favor of everyone having duties to fight legal wars: duties that are owed not to one’s own state, but to the international community as a whole or to other relevant groups. If so, compelled service in hostile forces of the kind that does not involve objectionable forms of treatment or the breach of duties toward others would not be morally unjustified. Even though it would be worse than the state’s conscription of its own citizens, it would be permissible. And although prisoners and those in occupied territories would remain vulnerable, coercing them to fight would not, in at least some circumstances, constitute an instance of harming them, given that it would, in fact, constitute the enforcement of a moral duty to fight.

The second problem with theories of political obligation is that they cannot support the notion that many, or even most, individuals have duties to fight legal wars that are owed to all states, their own states, or the international community. This is so because theories of political obligation have been developed under certain ideal assumptions, mainly that institutions and states are reasonably just. In these ideal circumstances—that is, when states are just and treat members equally—these accounts might be able to take off, to a greater or lesser extent.239I personally agree with Murphy and Greene that no context- and content-independent duty to obey the law exists. See Abner S. Greene, Against Obligation: The Multiple Sources of Authority in a Liberal Democracy 186 (2012); Murphy, supra note 177, at 125. But they struggle greatly in nonideal conditions, which are the present conditions of all states (and of the international community).

Take the fairness-based theory. In most states, individuals do not equally benefit from the state’s existence and the prevailing social arrangements. In fact, many states not only fail to benefit certain sectors of the population, but also actively contribute to their oppression and marginalization. That is, some individuals are not only not benefitted by the state but also are harmed by it. For example, Raff Donelson has argued that Black people and police in the United States are locked in a Hobbesian state of nature; that is, in this respect, the state fails to secure even the most basic conditions of personal security.240Raff Donelson, Blacks, Cops, and the State of Nature, 15 Ohio St. J. Crim. L. 183, 183 (2017).

A fairness-based account already struggles in ideal circumstances. It is hard to accept that the mere fact of benefiting from a social practice or institution provides a good argument as to why those who benefit—and who might not have accepted nor wanted the practice, the institution, or the benefit—can be burdened to obey.241Nozick, supra note 205, at 90–95. This is particularly the case when the benefits are not equally distributed among members or the costs of complying with the obligations are higher than the benefits the person obtains from the social practice.242Id. The latter is especially relevant in the context of conscription, in which the costs of compliance are very high (risking severe injury and death in addition to putting oneself at risk of committing morally wrongful acts), but the benefits any given person obtains from the state are likely to be significantly smaller in comparison. In nonideal circumstances this is an even more pressing issue, and the argument simply cannot take off: some individuals (often those who are most likely to be conscripted or drafted) do not benefit from the existence of the state, or benefit very little, and thus cannot be obliged to take on the higher burden of fighting to defend it.243See Jeffrie G. Murphy, Marxism and Retribution, 2 Phil. & Pub. Affs. 217, 240 (1973) (making similar arguments regarding punishment).

The same is true in an associative duties-based account: some individuals simply lack any reason to place noninstrumental value on the existence of their state. The state actively makes their lives worse or prevents them from living justly with others. And in a consequentialist or instrumentalist account, there is no reason for individuals to fight to defend a state (or an international legal order) that they might be better off without, or from which they might benefit little, when the burden of fighting is so high. Further, a consequentialist account could not establish that individuals have general duties to fight; it can only establish that, depending on the circumstances, sometimes some individuals will have duties to fight.244On the inability of act- and rule-consequentialist arguments to support a duty to obey the law, see Greene, supra note 239, at 96–108. These might seem, as Murphy writes, “banal empirical observation[s],” but “it is through ignoring such banalities that philosophers generate theories which allow them to spread iniquity in the ignorant belief that they are spreading righteousness.”245Murphy, supra note 243, at 241.

Theories of political obligation then struggle in two dimensions. First, in their idealized versions, they might actually ground duties to fight legal wars that are owed to the community of states. If this is true, then there is a pro tanto reason against the CSHF prohibition when protected individuals are compelled to fight in hostile forces engaged in legal wars. This, of course, cannot provide a complete argument against the CSHF prohibition. Compelled service in hostile forces would remain both morally worse than the state’s conscription of its own citizens and morally wrong when involving objectionable treatment. Further, incentives to treat foreigners poorly would still exist and provide a reason in favor of the prohibition.

Second, in nonideal circumstances (which are the circumstances of all present states), theories of political obligation struggle to ground duties to obey the law. If they struggle to even do this, it is even harder to argue that they could support a duty to fight and kill on behalf of the state (or the community of states), given how demanding the duty is. Theories of political obligation are not unresponsive to the demandingness of the burdens associated with compliance with the law. In the case of conscription, the obligation to fight is extremely demanding, which makes it harder to defend. It is also a grave imposition on individuals’ freedom, which also makes it harder to justify.

If this is true, then both compelled service in hostile forces and the state’s conscription of its own citizens are unjust practices. Although the first is often worse than the second, both are morally wrong. This is so because, in nonideal circumstances, individuals often do not have duties toward the international community nor to their own state to fight and kill on its behalf. And because many individuals lack those duties, it is wrong for states to force them to fight, even in legal wars.

3.  The Authority of the State to Enforce Duties to Fight Legal Wars

Note that the argument so far does not establish that no one has duties to fight legal wars that are owed to the state or the international community. The argument only establishes that in nonideal circumstances, many individuals lack such duties. However, at least some individuals might have duties to fight legal wars because, for example, they benefit considerably from the existence of their own state, or they have consented to have such duties by, say, joining the army. Others might have duties to fight a legal war based on instrumental reasons, and some individuals might have moral duties to fight that are owed to their own community or group.

Of course, it can be hard for the state to identify who these people are. But if it could do so, and these individuals refused to fight, can the state enforce their duties to fight?

This is a question about the political legitimacy or authority of the state.246Murphy, supra note 177, at 116. Although related to the question of political obligation, it is not the same. The question of political obligation pertains to whether individuals have a prima facie moral duty to obey the law. The question about political authority pertains to whether the state is justified to issue and enforce binding directives, sometimes referred to as the question of political legitimacy.247Id. Some think that the two questions can come apart: it is possible that states are justified in issuing and enforcing directives while, at the same time, individuals lack a moral duty to obey them.248Id.

In the context of conscription, this would mean that states might be justified in demanding conscription of citizens, even in cases in which citizens might lack any duties to accept conscription that are owed to the state. If so, one could argue that compelled service in hostile forces is wrong because hostile forces lack authority over individuals to coerce them to fight on their behalf; that is, they lack political authority regarding foreigners, but they do have authority over their own citizens. Coercion over their own citizens is thus legitimate, whether it involves coercion to pay taxes or to fight wars.

The first problem with relying on the legitimacy of the state is that although most political philosophers agree that legitimacy does not require that the state is perfectly just, it does require that the state is reasonably just.249C.H. Wellman, The Space Between Justice and Legitimacy, 31 J. Pol. Phil. 3, 9 (2023). However, a significant number of states are not even reasonably just. Only 57% of states in 2017 were “democracies of some kind,”250Drew Desilver, Despite Global Concerns About Democracy, More Than Half of Countries Are Democratic, Pew Rsch. Ctr. (May 14, 2019), https://www.pewresearch.org/short-reads/2019/05/14/more-than-half-of-countries-are-democratic [https://perma.cc/DS8D-625S]. and there is disagreement about whether certain states that are “democracies of some kind” are sufficiently just to be legitimate.251Wellman, supra note 249, at 3. Thus, this reliance on political legitimacy cannot explain why conscription is allowed under international law. If anything, it should be severely restricted.

Second, political legitimacy aims to justify state coercion generally. But it cannot justify every instance of coercion: in fact, all theories of political legitimacy recognize limits. Conscription to fight in war is exceedingly burdensome. If, as I have argued, some citizens lack duties to accept conscription precisely due to the state’s failure to protect them, benefit them, or make them better off, then surely the state lacks any sort of prerogative, for the same reasons, to enforce directives to kill and die on its behalf. The state’s authority would only be plausible in the case of those very few citizens who have duties toward the state and, perhaps, in the case of those who have duties to fight owed to other members of the community (but not to the state). For example, in other work I have argued that when burdens are imposed unfairly, the group to which the individual belongs to is corrupt or unjust, and refusal to fight is costly, individuals can have a pro tanto reason to accept conscription because refusing it would entail deflecting harm on other, innocent individuals.252Prieto Rudolphy, supra note 10, at 240. In this case, however, it is obvious that the state cannot enforce that obligation: that obligation is not, in fact, owed to the state, and the state is acting wrongfully when demanding conscription.253Id.

In the first case, when duties to fight are owed to the state, we might still put into question the state’s legitimacy to enforce those duties, particularly in severely unequal and individualistic (i.e., capitalist) societies. Such societies, Murphy argues, incentivize individualism. In that context, there is something perverse about the state’s enforcement of obligations to fight and die on its behalf when doing so “presuppose[s] a sense of community in a society which is structured to destroy genuine community.”254Murphy, supra note 243, at 239.

Finally, wars, even legal wars, involve causing harm against many innocent people, including civilians. Legal wars are also often fought unjustly, and war is a particularly morally risky enterprise. If, as Parry and Easton have argued, individuals have a presumptive claim against exposure to moral risk—which grounds duties in others (such as the state) not to expose them to moral risk—then states can hardly demand individuals to fight on their behalf.255See Jonathan Parry & Christina Eaton, “Filling the Ranks”: Moral Risk and the Ethics of Military Recruitment, Am. Pol. Sci. Rev., 2023, at 1, 3, https://www.cambridge.org/core/services/aop-cambridge-core/content/view/AECBBE88736D2AB01470BDB404E537BF/S0003055423001247a.pdf/filling-the-ranks-moral-risk-and-the-ethics-of-military-recruitment.pdf [https://perma.cc/855E-Q9LU]. And if, as I have argued, the moral nature of what one is coerced to do matters, individuals facing conscription in conflicts that are fought in breach of jus in bello norms could also not be coerced to fight.

Ultimately, even if some citizens have duties to fight, it will often be the case that citizens are similarly placed to noncitizens: both will lack duties to fight that are owed to the state, and the state will lack authority to enforce duties to fight on its behalf. If that is true, then the state’s conscription of its own citizens is not generally permissible. Conscription to fight in wars cannot be justified as a legitimate practice deserving of international protection—at least not until states become more just.

IV.  A MORALLY COHERENT INTERNATIONAL REGIME ON CONSCRIPTION

I have argued that the international legal regime on conscription is morally incoherent. In order to justify the regime, two arguments needed to succeed, yet both have failed.

First, the regime’s failure to distinguish between legal and illegal wars cannot be justified. There is no argument why compelled service in hostile forces to fight a legal war is equally bad or equally wrong as compelled service in hostiles forces to fight a war of aggression. The latter seems significantly worse. Although both cases involve coercion, the latter involves coercion to contribute to an international crime. As a result, the nature of the war one is coerced to fight should be an additional aspect of the crime.

Further, the distinction between legal and illegal wars is also relevant pertaining to the state’s conscription of its own citizens. States should be prohibited from conscripting their own citizens to fight wars of aggression. There are also powerful reasons for thinking it should be a war crime; severely infringing individuals’ liberty to coerce them to participate in deeply wrongful acts does seem to “shock the conscience” of humankind. And it certainly seems sufficiently serious for international law to be concerned with it.

Second, although, all else equal, compelled service in hostile forces of protected persons is often morally worse than the state’s conscription of its own citizens, the argument that supports this claim—that it is worse to harm those who are vulnerable and to be forced to fight against those we care about—cannot render the state’s conscription of its own citizens permissible. It can only show that the state’s conscription of its own citizens might be morally better (or less bad) than compelled service in hostile forces. But the fact that the state’s conscription of its own citizens is morally better (or less bad) than compelled service in hostile forces cannot show, in and of itself, that conscription by the state is permissible or justified. In fact, in current, nonideal conditions, citizens will often lack duties to fight that are owed to their own states, and the state will also lack authority to compel its own citizens to fight, even in legal wars. This is what makes both compelled service in hostile forces and conscription to fight wars morally wrong.

There are thus powerful pro tanto reasons for international law to prohibit states from conscripting or drafting individuals to fight wars. There might also be reasons why it should be an international crime, for the same reasons why conscription in the context of illegal wars should be one. Conscription has been described as “tyranny,” and the peculiar horror of modern warfare as a social practice is that states can exercise “ ‘tyrannical power’ ” against their own loyal people as well as their enemies.256Ryan, supra note 99, at 131 (citation omitted). I will not argue in favor of this, but, if that were the case, the war’s illegality should be an additional aspect of the crime of conscription.

In sum, to render the regime morally coherent, the state’s conscription of its own citizens to fight an aggressive war should be a war crime; the illegality of the war should be an additional aspect of the crime of compelled service in hostile forces; and the state’s conscription of its own citizens should be generally prohibited.

However, I have not made an all-things-considered case in favor of the prohibition of conscription at the international level. I have said that the regime is morally incoherent and there are powerful reasons to render it coherent. But perhaps there are other, more powerful reasons why the state’s conscription of its own citizens to fight legal wars should be permitted by international law.

First, one might argue that prohibiting conscription would make it harder to fight both legal and illegal wars. If anything, it would make the former harder than the latter. This is so because if international law prohibited conscription, the states more inclined to comply with that prohibition would be precisely those states more likely to be engaged in legal wars. On the contrary, states likely to be engaged in illegal wars would be more likely to breach that prohibition. If that were the case, then international law would create a perverse system in which rogue states would breach the prohibition on conscription while law-abiding states would not, thus allowing the first to gain a significant military advantage over the latter.

I think this worry is overblown. It might be that legal wars would become more expensive to fight than illegal wars if the first relied on professional armies and the latter relied on conscription. However, I do not think the costs of war are reasons weighty enough to permit conscription, which severely infringes on individuals’ liberty. States would remain free to have AVFs, and they would also remain free to ask individuals to volunteer to fight in legal wars in certain circumstances.

Second, some might argue that conscription would be justified in the case of what Walzer calls a “supreme emergency,”257Walzer, supra note 186, at 323–27. in which the very existence of the state (or the international community) is at stake. Friedman, who was generally opposed to conscription, suggested that for a major war or “in times of the greatest national emergency,” a strong case in favor of compulsory service can be made.258Friedman, supra note 223, at 5–6. This would be the case if winning the war required conscription because, say, not enough individuals would volunteer to fight otherwise. But whatever one thinks about this case, it is not enough, on its own, to show that conscription should thus be generally permitted by international law. It only shows that there might be an exception regarding the (moral) prohibition on conscription. But laws should not be made thinking solely about the exception, especially when they are likely to be abused to commit wrongdoing.

Third, suppose that international law is committed to peace, as discussed before. And suppose, further, that lack of conscription in times of war makes it more likely for state leaders to go to war. This is known as the “chickenhawk syndrome.”259Ryan, supra note 185, at 12. Basically, the existence of an all-volunteer army, as opposed to one based on the draft or conscription, severs the connection between citizens and the wars that are fought on their behalf.260Id. Doing so makes war much easier: political leaders are more likely to go to war when war requires no personal sacrifice of their own or of their loved ones.261Id. at 4.

One might then posit that the prohibition on conscription would be self-defeating: instead of resulting in fewer wars, it would result in more wars, thus making peace more difficult to achieve. It is hard to know what to make of this objection. Conscription of citizens who lack duties to fight involves the violation of individuals’ most important rights at the hands of the state. It is at the very least controversial that we can make trade-offs regarding such rights in order to achieve certain desired outcomes (in this case, fewer wars).262See, e.g., Jonathan Quong & Rebecca Stone, Rules and Rights, in 1 Oxford Studies in Political Philosophy 222 (David Sobel et al. eds., 2015). If anything, the solution to this problem is to introduce other modifications, at the domestic or international level, to make wars harder to fight and that do not involve the violation of individuals’ rights.

Further, it is not clear at all that the inability of states to conscript their own citizens to fight would result in more wars. States are much more likely to fight illegal (and unjust) wars than to fight just ones, and allowing them to conscript individuals only facilitates their wrongdoing.

Finally, one might argue that although it is true that many individuals do not have duties to fight on behalf of the state, at least some individuals do. Thus, because at least some of the time conscription involves forcing people to do what they already have a duty to do, it should not be prohibited by international law. However, it would be impossible for states to determine who has moral duties to fight on their behalf (and in some cases, the state would still lack legitimacy to enforce such duties). As a result, a regime of conscription that could distinguish between those who have duties to fight and those who do not is not only unfeasible, but also very likely to get it wrong and thus violate individuals’ freedom by forcing them to fight. This seems to me a sufficient reason to generally forbid conscription to fight wars: if when states conscript individuals, they are likely to violate their freedom, then we should generally forbid conscription.

Again, the argument so far does not establish that all individuals in all circumstances lack duties to fight legal wars. Some individuals will have such duties in certain circumstances—for example, due to instrumental reasons, or because the state or the international community has benefitted them considerably. The point is only that the state should not be generally allowed to conscript individuals because, in present circumstances, many individuals lack those duties and conscripting them constitutes a grievous violation of their freedom. There are powerful reasons to generally prohibit state practices that are likely to violate the rights of many individuals, particularly when we cannot ensure that the practice is conducted in a way that can appropriately distinguish between individuals who have duties to fight and individuals who do not.

It is also important to note that the fact that there might be a legal prohibition on the state’s conscription of its own citizens does not present an insurmountable obstacle in enforcing moral duties to fight regarding those individuals who have them. In those cases, social pressure to comply with duties to fight might be justified, and states might be able to avail themselves of other mechanisms, short of coercion, to persuade individuals to fight on their behalf.

Thus, I think that the reasons in favor of rendering the international legal regime morally coherent are not just pro tanto reasons, but also conclusive ones, at least if one thinks that individuals’ rights should operate as a constraint on state action. International law should prohibit conscription to fight in war. The state’s conscription of its own citizens to fight aggressive wars should be a war crime. And the illegality of the wars individuals are compelled to fight should be an additional aspect of the crime of conscription.

There are different ways in which these changes could be achieved. For example, they might involve making conscription generally prohibited under international human rights law, with conscription to fight aggressive wars qualifying as a war crime under the jurisdiction of the ICC. It would also be necessary to grant extensive rights to individuals to engage in conscientious objection against military service in war and expand refugee protections to those who are forced to fight wars of aggression. The latter mechanism has the advantage of leaving the assessment of the war’s legality to domestic tribunals, thus bypassing the usual deadlock in the UNSC regarding these matters. I leave somewhat open the details of what a coherent regime would look like in practice, how it should be effectively enforced, and so on.

The idea that international law should generally prohibit conscription in times of war is, perhaps, completely utopian. It is also likely to destabilize other areas of international and domestic law, given how pervasive the idea that we owe special duties to our states is.263For example, it might put into question norms that criminalize treason, although not necessarily, as instrumentalist arguments in their favor can remain available. See Lee, supra note 146, at 333; Fabre, supra note 146, at 432. We might thus think that it is very unlikely that states would ever agree to any such norm prohibiting conscription during times of war, and that we should focus on making it a crime for states to conscript individuals to fight in wars of aggression. This might be true as a matter of what is presently feasible to achieve and how we should set our priorities.

However, the fact that something is utopian does not alter the moral demands.264See Estlund, supra note 15, at 116. If conscription to fight legal wars is often wrong, then it remains wrong—even if we are unable to prohibit it—at least until states or the international community are more successful in achieving justice.

CONCLUSION

This Article started by pointing out an asymmetry in the international legal regime on conscription: while compelled service in hostile armed forces is a war crime, a state’s conscription of its own citizens is the state’s prerogative. In order to defend the content of this regime, two arguments needed to succeed: first, that the distinction between legal and illegal wars is normatively insignificant; second, that compelled service in hostile forces is significantly different from conscription.

Both arguments have failed. The distinction between legal and illegal wars is morally significant. And although compelled service in hostile forces is wrong and is often morally worse than the state’s conscription of its own citizens, the latter is often not morally permissible.

As a result, the international legal regime on conscription is morally incoherent. It is morally incoherent because it is incomplete and cannot be made sense of.265Wasserstrom, supra note 12, at 7–8. It is incomplete because it fails to criminalize or prohibit conduct (the state’s conscription of its own citizens) that is similarly wrong than what it already criminalizes (compelled service in hostile forces). Furthermore, it cannot be rendered morally intelligible in two ways. First, what makes compelled service in hostile forces wrong also makes the state’s conscription of its own citizens wrong, yet the latter is permitted. Second, international law fails to distinguish between legal and illegal wars.

None of these implications mean that the CSHF prohibition lacks value or is morally misguided. On the contrary, the CSHF rule prohibits something that is morally wrong in a context in which states are likely to have perverse incentives to use individuals in objectionable ways, and as such, it is a valuable rule. Nonetheless, the international legal regime on conscription allows states to use the lives and bodies of millions of people to fight wars, both lawful and unlawful. Furthermore, it allows them to do so through coercive means. In order to render this regime morally coherent, conscription should be prohibited, and the unlawfulness of the war individuals are coerced to fight should be an additional aspect of the crime of conscription.

After all, individuals have only one life to live, and wars sow destruction, death, and suffering on a massive scale. As Walzer notes, “there has never been a more successful claimant of human life than the state.”266Walzer, supra note 17, at 77.

97 S. Cal. L. Rev. 1289

Download

* Associate Professor of Law and Philosophy, University of Southern California Gould School of Law, Los Angeles, California, United States of America, mprieto@law.usc.edu. Profesora adjunta extraordinaria, Universidad Adolfo Ibáñez, Escuela de Derecho, Santiago, Chile. I am indebted to Scott Altman, Ángel Díaz, Felipe Jiménez, Mugambi Jouet, Erin Miller, Jeesoo Nam, Jessica Peake, and the attendees of the Southern California International Law Scholars Workshop; the attendees of the Diálogos of Derecho Internacional Conference; Norrinda Brown, Benjamin Zipursky, and the attendees of the Fordham Legal Theory Faculty Workshop; Jaya Ramji-Nogales and the attendees of the ASIL Mid-year Research Forum Meeting; Seana Shiffrin, Lawrence Sager, and the attendees of the UCLA Legal Theory Workshop; and the attendees of the USC Faculty Workshop for their generous feedback. A special thanks to Jessica Murphy for her assistance with research and the editors of the Southern California Law Review for their excellent work.

Housing Gridlock

The housing crisis dominates much of political and economic life, and it is driven in large part by a lack of housing supply. Recognizing this, many commentators have called for the end of single-family zoning. And some jurisdictions have answered the call. But even if that groundswell grows, the housing shortage likely will persist. One underappreciated reason for that persistence is that a private network of restrictions stands ready to pick up where zoning leaves off. Specifically, restrictive covenants abound nationwide, and, just like zoning, they often cap how much housing can be built without regard to local or regional demand. The aggregate result of these covenants is what this Article calls “housing gridlock,” a side effect of dispersed private ownership rights that prevents property from reaching its full potential in service of both the public good and economic productivity. This Article explores the legal, financial, and, above all, psychological forces that brought about gridlock and further its hold today.

This Article then proposes a solution. Drawing primarily on the psychological phenomena and financial incentives that entrench housing gridlock in the first place, it crafts a two-stage procedural and remedial mechanism to break that gridlock. Critically, that mechanism aims to minimize psychic harm to existing property owners while freeing up additional housing supply. It begins with a lump-sum opt-in mechanism for willing covenant beneficiaries located near a proposed housing project, and an action only for damages within a predefined range for any remaining beneficiaries. This Article describes the finer details of this proposal and explores its potential advantages over other options in the preexisting legal toolkit from economic and (again, especially) psychological perspectives. Accordingly, it provides an asset to the multifront struggle for housing availability.

INTRODUCTION

The affordable housing crisis is well-documented: homelessness abounds;1U.S. Dept. of Housing & Urb. Dev., The 2022 Annual Homelessness Assessment Report (AHAR) to Congress 2 (2022), https://www.huduser.gov/portal/sites/default/files/pdf/2022-AHAR-Part-1.pdf [https://perma.cc/QX64-3LXY]. those who live in homes spend a larger portion of their income on housing than in the past;2Katherine Schaeffer, Key Facts About Housing Affordability in the U.S., Pew Rsch. Ctr. (Mar. 23, 2022), https://www.pewresearch.org/fact-tank/2022/03/23/key-facts-about-housing-affordability-in-the-u-s [https://perma.cc/K3T3-226D]; Housing Affordability Index (Fixed), Fed. Rsrv. Bank St. Louis (2023), https://fred.stlouisfed.org/series/FIXHAI [https://perma.cc/T9NH-JYMY]. the rate of homeownership among young adults over the past several years is lower than it has been in much of recent memory.3Erik L. Hernandez & Christopher Mazur, Homeownership by Young Households Below Pre-Great Recession Levels: Racial and Ethnic Disparity in Homeownership Continues Even Among Highly Educated, U.S. Census Bureau (Nov. 17, 2022), https://www.census.gov/library/stories/2022/11/homeownership-by-young-households-below-pre-great-recession-levels.html [https://perma.cc/NXV7-UV8J]. At a basic level, it is no mystery why this is the case. We have a severe housing shortage. Freddie Mac estimates a shortage of 3.8 million housing units nationwide.4Housing Supply: A Growing Deficit, Freddie Mac (May 7, 2021), https://www.freddiemac.com/research/insight/20210507-housing-supply [https://perma.cc/QA5T-ZAQ2]. Though a simple model of supply and demand cannot tell the entire story, it can go far. More people competing for fewer homes means sellers or landlords can command higher prices.5U.S. Housing Shortage: Everything, Everywhere, All at Once, Fannie Mae (Oct. 31, 2022), https://www.fanniemae.com/research-and-insights/perspectives/us-housing-shortage [https://perma.cc/G9WQ-PC4G]. Thus, many people are without homes, and those who secure housing pay more to do so.

Unsurprisingly, then, the majority of the population agrees that the U.S. needs more housing.6Emily Elkins & Jordan Gygi, Poll: 87% of Americans Worry About the Cost of Housing; 69% Worry Their Kids and Grandkids Won’t Be Able to Buy a Home, Cato Inst. (Dec. 14, 2022), https://www.cato.org/survey-reports/poll-87-americans-worry-about-cost-housing-69-worry-their-kids-grandkids-wont-be#building-more-houses [https://perma.cc/6PAF-56AB]. But that is where the consensus ends. What kind of housing? Where should it be located? How do we get more of it? These questions confound policymakers and prompt fiery reactions from people across the political and socioeconomic spectrums.7Id.; see also Ros Coward, Opinion, Nimbys Are Not Selfish. We’re Just Trying to Stop the Destruction of Nature, The Guardian (July 4, 2021, 10:34 EDT), https://www.theguardian.com/commentisfree/2021/jul/04/nimbys-nature-destruction-wildlife-developers [https://perma.cc/K67H-VZ9Q]; Sarah Holder, NIMBYs Really Hate Developers When They Turn a Profit, Bloomberg (Sept. 14, 2018, 6:42 AM), https://www.bloomberg.com/news/articles/2018-09-14/nimbys-really-hate-developers-when-they-turn-a-profit [https://perma.cc/Y66H-2JLT]; Cadence Quaranta, Developer Has Abandoned Plans to Demolish Former Hampden Bookbindery and Bird Roost, Balt. Banner (Feb. 14, 2023, 8:00 PM), https://www.thebaltimorebanner.com/community/local-news/developer-has-abandoned-plans-to-demolish-a-former-bookbindery-in-hampden-32NVJTNSAZGBRPNDK2HKH3KBQY [https://perma.cc/4BUS-9Y8M]; Jerusalem Demsas, The Next Generation of NIMBYs, The Atlantic (July 20, 2022), https://www.theatlantic.com/newsletters/archive/2022/07/the-next-generation-of-nimbys/670590 [https://perma.cc/LR9G-QJRS].

One flashpoint in this contentious ordeal concerns zoning. Single-family zoning is perhaps “[a]s American as apple pie,”8See Erin Baldassari & Molly Solomon, The Racist History of Single-Family Home Zoning, KQED (Oct. 5, 2020), https://www.kqed.org/news/11840548/the-racist-history-of-single-family-home-zoning [https://perma.cc/5NPK-SRRP]. but it and other restrictions with similar practical effects suppress the housing supply. It inflates prices and drives suburban sprawl, disproportionately harming racial minorities and lower-wealth Americans.9See Jillian McKoy, White Neighborhoods Have More Greenery, Fewer Dilapidated Buildings and Multi-Family Homes, B.U. Sch. Pub. Health (Jan. 19, 2023), https://www.bu.edu/sph/news/articles/2023/across-the-us-white-neighborhoods-have-more-greenery-fewer-dilapidated-buildings-fewer-multi-family-homes [https://perma.cc/LM4T-K9MB]; see also Emily Badger & Quoctrung Bui, Cities Start to Question an American Ideal: A House With a Yard on Every Lot, N.Y. Times (June 18, 2019), https://www.nytimes.com/interactive/2019/06/18/upshot/cities-across-america-question-single-family-zoning.html [https://perma.cc/9393-MEVV]; Cecilia Rouse, Jared Bernstein, Helen Knudsen & Jeffery Zhang, Exclusionary Zoning: Its Effect on Racial Discrimination in the Housing Market, The White House (June 17, 2021), https://www.whitehouse.gov/cea/written-materials/2021/06/17/exclusionary-zoning-its-effect-on-racial-discrimination-in-the-housing-market [https://perma.cc/QHX2-X9LH]. Indeed, racial animus likely motivated much of the initial push for single-family restrictions.10Katherine Levine Einstein, David M. Glick & Maxwell Palmer, Neighborhood Defenders 95, 137 (2020); see also Jonathan Rothwell & Douglas S. Massey, The Effect of Density Zoning on Racial Segregation in U.S. Urban Areas, 44 Urb. Affs. Rev. 779, 779–806 (2009), https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4083588/pdf/nihms453809.pdf [https://perma.cc/9CHU-3RCJ]. Today, then, single-family restrictions hoard opportunity on behalf of existing homeowners.

Anyone who keeps their ear to the ground in the world of housing policy, urban governance, or even national politics has heard calls to abolish single-family zoning.11Tom Coale, Opinion, Opinion: Ending Single-Family Detached Zoning Benefits Everyone, Md. Matters (May 26, 2021), https://www.marylandmatters.org/2021/05/26/opinion-ending-single-family-detached-zoning-benefits-everyone [https://perma.cc/3TUN-26TY]; Michael Manville, Paavo Monkkonen & Michael Lens, It’s Time to End Single-Family Zoning, 86 J. Am. Planning Ass’n 106, 106–12 (2020), https://www.tandfonline.com/doi/full/10.1080/01944363.2019.1651216 [https://perma.cc/B2LA-3HN8]; Badger & Bui, supra note 9. It’s not just chatter: multiple states and cities across the country, including Oregon,12H.B. 2001, 80th Leg. Assemb., Reg. Sess. (Or. 2019), https://olis.oregonlegislature.gov/liz/2019R1/Downloads/MeasureDocument/HB2001/Enrolled [https://perma.cc/LV36-MFWV]. California,13S.B. 9, 2021–22 Leg. Sess. (Cal. 2021), https://leginfo.legislature.ca.gov/faces/billNavClient.xhtml?bill_id=202120220SB9 [https://perma.cc/K2QM-4VHQ]. Maine,14H.R. 1489, 130th Leg., 2d Reg. Sess. (Me. 2022), http://www.mainelegislature.org/legis/bills/getPDF.asp?paper=HP1489&item=1&snum=130 [https://perma.cc/7AG8-WSCK]. and Minneapolis, Minnesota,15Dep’t of Cmty. Plan. & Econ. Dev., City of Minneapolis, Minneapolis 2040 Comprehensive Plan 105–07 (2019) https://lims.minneapolismn.gov/File/2018-00770 [https://perma.cc/AWG6-62DJ] (adopted by Minneapolis City Council Resolution No. 2019R-308 on Oct. 25, 2019). have passed laws to break single-family zoning’s hold on housing development. These are welcome developments, but, of course, they alone are insufficient to solve the housing shortage.16A slowing housing market, high building costs, and many other economic influences play a major role too. C. Tsuriel Somerville, Residential Construction Costs and the Supply of New Housing: Endogeneity and Bias in Construction Cost Indexes, 18 J. Real Estate Fin. & Econ. 43, 43–62 (1999), https://link.springer.com/article/10.1023/A:1007785312398 [https://perma.cc/BGG5-8793]; Kristian Hernández, Rising Construction Costs Stall Affordable Housing Projects, Stateline (Apr. 25, 2022, 12:00 AM), https://stateline.org/2022/04/25/rising-construction-costs-stall-affordable-housing-projects [https://perma.cc/8VXD-4YAX]; Chris Arnold, There’s Never Been Such a Severe Shortage of Homes in the U.S. Here’s Why, NPR (Mar. 29, 2022, 7:00 AM), https://www.npr.org/2022/03/29/1089174630/housing-shortage-new-home-construction-supply-chain [https://perma.cc/3ZEY-P3BA].

This Article targets another legal factor that blocks many cities and regions from meeting housing demand: single-family restrictive covenants. Restrictive covenants are private arrangements attached to property deeds limiting how property can be used, for the benefit of certain surrounding properties.17Cunningham v. City of Greensboro, 711 S.E.2d 477, 485 (N.C. Ct. App. 2011). The terms often apply for many decades to the burdened properties and automatically renew afterwards.18See id.; see also, e.g., Gardner v. Jefferys, 878 A.2d 259, 265 (Vt. 2005); Covenants, Conditions, and Restrictions for Meadowhaven Heights Phase II, Dallas, Or. ¶ 18 (Aug. 20, 1999) (on file with author); Declaration of Conditions, Covenants and Restrictions for Shadow Creek Estates Subdivision Phase II, Polk Cnty., Or. ¶ 15 (Sept. 13, 1997) (on file with author); Robert Ellickson, Stale Real Estate Covenants, 63 Wm. & Mary L. Rev. 1831, 1840 (2022). Covenants thereby can “run with the land” and do not merely bind specific parties to an agreement.19Maureen E. Brady, Turning Neighbors into Nuisances, 134 Harv. L. Rev. 1609, 1614 (2021) (citing Covenant, Black’s Law Dictionary (11th ed. 2019)). On the substance, restrictive covenants can limit property use in various ways.20Historically, the most infamous form was a racially restrictive covenant, which would restrict the property owner subject to the covenant from selling the property (usually a house) to a non-White purchaser. See Shelley v. Kraemer, 334 U.S. 1, 4, 18–19 (1948). The Supreme Court held all such covenants unconstitutional. Id. at 20–23. So, even though many remain on the books and attached to property deeds, they cannot be enforced through any formal legal means. Other sorts of restrictive covenants, however, more or less remain fair game. Cheryl W. Thompson, Cristina Kim, Natalie Moore, Roxana Popescu & Corinne Ruff, Racial Covenants, a Relic of the Past, Are Still On the Books Across the Country, NPR (Nov. 17, 2021), https://www.npr.org/2021/11/17/1049052531/racial-covenants-housing-discrimination [https://perma.cc/E7MF-SEKP]. One of the most prevalent covenant terms is one that restricts the burdened property to hosting one single-family home.21Gerald Korngold, Single Family Use Covenants: For Achieving a Balance Between Traditional Family Life and Individual Autonomy, 22 U.C. Davis. L. Rev. 951, 951 (1989); see also, e.g., Jackson v. Williams, 714 P.2d 1017, 1018 (Okla. 1985); Double D Manor, Inc. v. Evergreen Meadows Homeowners’ Ass’n, 773 P.2d 1046, 1048 (Colo. 1989); Jayno Heights Landowners’ Ass’n v. Preston, 271 N.W.2d 268, 445–46 (Mich. Ct. App. 1978). Other common restrictions include minimum square footage, minimum setbacks from lot lines, and height limits.22See, e.g., Friends of Lubavitch, Inc. v. Zoll, No. 03-C-16-008420, 2018 Md. App. LEXIS 972, at *1–2 (Md. Ct. Spec. App. 2018); Richard R.W. Brooks & Carol M. Rose, Racial Covenants and Segregation, Yesterday and Today 4, 7 (Straus Inst., Working Paper No. 08/10, 2010), https://www.law.nyu.edu/sites/default/files/siwp/Rose.pdf [https://perma.cc/2ZQP-7XZY]. In other words, restrictive covenants, where they exist, often place the same lid on housing development through private rights as single-family zoning does through public regulation. And they often contribute de facto to the same racial segregation and wealth inequality that the now unenforceable racially restrictive covenants once pursued de jure.23John Infranca, Singling Out Single-Family Zoning, 111 Geo. L.J. 659, 661 (2023). Indeed, there is evidence that their ancestors, nuisance covenants, were intentionally used toward such ends, albeit often unsuccessfully. See generally Brady, supra note 19 (discussing the history of attempts by wealthier property owners to rely on nuisance covenants to exclude apartment buildings and those who would dwell in them from locating nearby).

These covenants are incredibly and increasingly common.24See infra Section II.A. So, even if zoning restrictions were lifted across the entire nation, a sizeable portion of single-family properties would still be frozen indefinitely, regardless of the demand for housing in that region. And not only that, but there is also reason to suspect that even where such restrictive covenants do not yet exist, homeowners may seek to implement them when faced with the fear, post “upzoning,” that they will lose the “neighborhood character” they have grown accustomed to.25Stephen R. Miller, Opinion, Ending the Single-Family District Isn’t So Simple, Star Tribune (Jan. 2, 2019, 5:53 PM), https://www.startribune.com/ending-the-single-family-district-isn-t-so-simple/503820202 [https://perma.cc/9F2V-XNFA]; Betsy McCaughey, Opinion, McCaughey: Dems Targeting Suburban Homeowners, Bos. Herald (Feb. 13, 2023, 12:07 AM), https://www.bostonherald.com/2023/02/13/mccaughey-dems-targeting-suburban-homeowners [https://perma.cc/G5M2-5YCN]; Mark Weinter, New York’s Affordable Housing Plan Bypasses Local Zoning, Governing (Feb. 2, 2023), https://www.governing.com/community/new-yorks-affordable-housing-plan-bypasses-local-zoning [https://perma.cc/XFL4-VJSH]; Nordea Lewis, ‘I Don’t Want to See My Neighbors Pushed Out’: Hampden Residents Voice Concerns on New Development, WMAR2 Balt. (Oct. 27, 2022, 10:27 PM), https://www.wmar2news.com/news/local-news/i-dont-want-to-see-my-neighbors-pushed-out-hampden-residents-voice-concerns-on-new-development [https://perma.cc/QV9Y-AUTZ]. At the end of the day, policymakers could all get on board with unlocking more housing, but an extensive network of private legal apparatuses can still say no. If we care about finding real and lasting solutions to the housing crisis, we must explore freeing up property from longstanding private rights that often entrench inefficiencies and stand in opposition to the public good. At the same time, to avoid significant demoralization costs and psychic harm to property owners resulting from upended expectations in the short term (and to avoid certain political failure), we must find a path forward that seeks to honor owners’ reasonable expectations.

The legal academy has given this problem limited attention. At a high level of generality, many scholars have commented on some of the problems associated with restrictive covenants and other forms of deadhand control over real property.26Julia Mahoney, Perpetual Restrictions on Land and the Problem of the Future, 88 Va. L. Rev. 739, 770, 778 (2002); Gerald Korngold, Resolving the Intergenerational Conflicts of Real Property Law: Preserving Free Markets and Personal Autonomy for Future Generations, 56 Am. U. L. Rev. 1525, 1528–29 (2007). See generally James L. Winokur, The Mixed Blessings of Promissory Servitudes: Toward Optimizing Economic Utility, Individual Liberty, and Personal Identity, 1989 Wis. L. Rev. 1 (1989). Additionally, a few scholars have noted that restrictive covenants could swoop in where single-family zoning leaves off.27Brady, supra note 19, at 1681–82; Miller, supra note 25; John Infranca, The New State Zoning: Land Use Preemption Amid a Housing Crisis, 60 B.C. L. Rev. 824, 874–75 (2019). More specifically, Robert Ellickson recently provided a helpful synopsis of the problem of “stale” restrictive covenants of all forms and of some potential paths forward for those burdened by them to free themselves.28See generally Ellickson, supra note 18.

And, just this year, Gerald Korngold explored the likely prominence and harms of single-family covenants in the wake of zoning reform.29See generally Gerald Korngold, Repealing Single-Family Zoning Is Not Enough: A Proposal for Removing Existing Parallel Private Covenants for Violating Public Policy, 89 Mo. L. Rev. 1 (2024). He evaluates whether invalidating such covenants might run afoul of the Takings Clause, and then urges courts to consider such covenants void as contrary to public policy under certain circumstances.30See generally id.

But the literature has only just begun to explore the nature and the depth of the problem as it relates to the housing crisis. This Article attempts to do so. It makes multiple new contributions to the discussion on the housing crisis and private law. First, it explains how the abundance of restrictive covenants contributes to housing “gridlock,” a scenario in which fragmented and dispersed ownership of private rights prevent productive and socially beneficial land use. Second, this Article employs an underutilized approach to both describe that problem and explore solutions. Building on the work of Stephanie M. Stern and Daphna Lewinsohn-Zamir, who have provided a general primer on the relationship between psychology and property law,31See generally Stephanie M. Stern & Daphna Lewinsohn-Zamir, The Psychology of Property Law (2020). this Article reviews contemporary psychological research on ownership, possession, and subjective attachment. It then applies that research to, first, suggest why restrictive covenants’ hold may be stronger and more damaging than some commentators have suggested and, second, to evaluate strengths and weaknesses of legal and policy tools to escape that hold. One of the primary insights this Article hopes to convey is that housing gridlock from restrictive covenants is a deeply psychological problem, and that any solution must account for the driving psychological phenomena.

This Article proceeds in two parts. Part I explains how restrictive covenants contribute to gridlock. It evaluates housing gridlock—how we got here—from legal, financial, and especially, psychological perspectives. Part II explores how to break out of that gridlock. It surveys potential paths forward from within the common law of property as well as certain regulatory or legislative approaches. Then, relying on common law, economic, and psychological principles, it constructs a new solution. That solution involves a statutory mechanism, with procedural and remedial components, that facilitates the bypassing of covenants when it would be efficient to do so. Finally, it aims to respect the preexisting interests of covenant beneficiaries along the way. In the process of explaining this proposed solution to housing gridlock, this Article explores the advantages—especially psychological—that the solution presents compared to other options.

I.  THE TRAGEDY OF THE HOUSING ANTICOMMONS

This Section describes the housing gridlock that restrictive covenants have helped usher in. Section I.A identifies the concept of the tragedy of the “anticommons” and explains why it applies to the modern housing market generally and residential restrictive covenants specifically. Section I.B explores three interrelated social-scientific explanations of how we arrived at housing gridlock—a legal one, a financial one, and a psychological one—and it explains why the psychological explanation is particularly important. This Article thereby draws in part, but not exclusively, on law & behavioral economics to describe the problem, with a special emphasis on underappreciated psychological forces at play.

A.  Gridlock as a Feature of the Modern Housing Market

Few concepts are more influential to the foundations of private property law than that of the “tragedy of the commons.”32Numerous cases cite the concept. See, e.g. NRDC v. Costle, 568 F.2d 1369, 1378 (D.C. Cir. 1977); New York v. Evans, 162 F. Supp. 2d 161, 167 (E.D.N.Y. 2001); Mojito Splash, LLC v. City of Holmes Beach, 326 So. 3d 137, 143 (Fla. Dist. Ct. App. 2021); Verizon v. FCC, 740 F.3d 623, 666–67 (D.C. Cir. 2014) (Silberman, J., concurring in part). Numerous law journal articles do as well. See, e.g., Shi-Ling Hsu, What Is a Tragedy of the Commons? Overfishing and the Campaign Spending Problem, 69 Alb. L. Rev. 75, 76 (2005); Nathaniel Wolloch, Before the Tragedy of the Commons: Early Modern Economic Considerations of the Public Use of Natural Resources, 19 Theoretical Inquiries L. 409, 409–10 (2018); Amy Sinden, The Tragedy of the Commons and the Myth of a Private Property Solution, 78 U. Colo. L. Rev. 533, 533–34 (2007). The basic idea is that when a resource (such as a freshwater lake that is useful for drinking and recreation) is unowned and subject to open access, each individual with access to it has a personal incentive to use it heavily and not limit themselves.33Garrett Hardin, The Tragedy of the Commons, 162 Science 1243, 1243–46 (1968). “Every other person might use it up first,” the thinking goes, “so if I don’t use it now I might never get any.” So, even though it would be in everyone’s interests to preserve the common resource for years and generations to come, the resource is instead spent or spoiled rapidly.34Id.

Theorists traditionally have proposed two main solutions to avoid the tragedy of the commons.35See Sinden, supra note 32, at 533–34. The first is centralized control by a public or governing body, giving authority and power to a single entity to perhaps allow public access to the resource, but limiting such access as necessary to preserve it.36Id. See generally Elinor Ostrom, Governing the Commons (1990). The second is privatization, allowing private individuals or entities to own all or portions of the resource.37See Sinden, supra note 32, at 533–34. See generally Harold Demsetz, Toward a Theory of Property Rights, 57 Am. Econ. Rev. 347 (1967) (arguing that private property comes to exist so that individuals can internalize the externalities of their resource use). Under such an arrangement, the owners theoretically have an incentive to use the resource productively without spoiling it.38See sources cited supra note 37.

But there’s another tragedy that is far less famous. Michael Heller called it “the tragedy of the anticommons.”39Michael Heller, The Gridlock Economy 1 (2008). While the tragedy of the commons describes an undesirable consequence of too much open access, the tragedy of the anticommons sits at the other end of spectrum as an undesirable consequence of, in a sense, too much private ownership.40Id. at 1–9. Here’s the idea: Certain public goods rely on the use of property. And, indeed, some of those goods cannot manifest unless either enough property is pooled together, or a smaller unit of property is utilized or developed to a substantial degree.41Id. Think of public transportation corridors, large parks, or stadiums. But if property is divided among too many private owners, or if the rights concerning one unit of property are spread among too many owners, then for those greater public goods to manifest, more people must be willing to relinquish their right to exclude others from using their small slice of the pie.42Id.

A fictional example can make this more concrete. Imagine a swimming pool that is not owned by one person or entity, but instead is owned by hundreds of different people in individual chunks. Each person owns around one square meter and has complete and total rights over their small space. If even a handful of the individual owners decide to fence off their small portion of the pool from anyone else, they could prevent all other people from swimming laps. Because ownership of the pool is fragmented across many parties, the resource—the pool itself—cannot reach its full productive potential.

Although the swimming pool example is farfetched, anticommons and gridlock can calcify under the radar in ways that shape the real world. Michael Heller points to biomedical patents.43Id. at 4–5. Years ago, a large pharmaceutical company developed what looked to be a promising treatment for Alzheimer’s Disease.44Id. But the treatment never made it out of the lab because it wasn’t financially feasible for the company to pay off all the holders of patents of necessary component technologies.45Id. The U.S. government developed the patent system to incentivize innovation in the world of biomedical technology. And it did. Driven by patents as rewards, we underwent a revolution in biomedical technology in the late 20th century.46Id. But sooner or later it rings true that “there is nothing new under the sun”47Ecclesiastes 1:9. and one must build with the materials that already exist. When too many people have the right to exclude others from using those building blocks, building a wall from the bricks can approach impossibility.

Hence the term “gridlock.”48See generally Heller, supra note 39. We might have the building blocks to assemble resources that would massively improve public welfare, but whether we reach our goal depends on whether assembling those building blocks is feasible or whether they will instead stick in place, widely dispersed. I argue that we are witnessing a tragedy of the anticommons in housing too. Dispersed ownership rights stifle the production of the housing necessary to meet demand and ensure greater affordability and stability for all.

The most well-documented source of housing gridlock is regulatory: specifically, zoning and related regulations.49See Infranca, supra note 23, passim. Most cities are governed by a zoning code, which limits land use in different geographic areas to specific uses—such as single-family housing only.50Scott Beyer, Modern Zoning Would Have Killed Off America’s Dense Cities, Forbes (May 25, 2016, 4:05 PM), https://www.forbes.com/sites/scottbeyer/2016/05/25/modern-zoning-would-have-killed-off-americas-dense-cities [https://perma.cc/Z5JY-3CX9]. And in most cities, a property owner who seeks to use their property in violation of the zoning restrictions may request a “variance”: permission from the local government to, say, build a quadplex or small apartment building in an area otherwise restricted to single-family houses only.51Einstein et al., supra note 10, at 15. Often, a variance request triggers an elaborate comment and review process;52Id. even for the smallest proposed projects, house owners can voice their opposition and demand studies on traffic, environmental impact, water runoff or burden on public utilities.53Id. at 17. Because this process is expensive and time-consuming for the property developer, often the result is either that housing is never built, or the end product contains fewer units at a higher selling price than originally planned.54See, e.g., id. at 3, 24. That being so, gridlock and anticommons can follow from not only too many people owning pieces of what we’d typically think of as “property”—a piece of land, a patent or copyright, for example—but also from too many people or entities having the right and power to stop others’ use of their property, demonstrating the fragmentation of property rights.55See, e.g., Heller, supra note 39, at 145–48 (discussing closed storefronts in post-Soviet Russia).

However, loosening zoning restrictions will not necessarily suffice to escape gridlock. In many regions in and around metropolitan centers, restrictive covenants could pick up much of the slack. Imagine a developer buys a property and wants to build multifamily housing—a condo building, four townhouses, or even just a duplex. The property isn’t subject to any notable zoning restrictions. But it is within the bounds of a homeowners association (“HOA”). And within the HOA’s Covenants, Conditions, and Restrictions (“CC&Rs”) is a provision stating that all properties may only be used for the operation of one residence.

In this case, however, there is no variance process. The developer could petition the HOA board to allow multi-family housing, but (1) the board, made up of other homeowners in the subdivision, has no interest in losing the single-family, detached-home character of their neighborhood;56Einstein et al., supra note 10, at 13, 34. and (2) in any event, CC&Rs often can be amended only by a supermajority vote of all property owners within the association,57See, e.g., Declaration of Covenants, Conditions and Restrictions for Bradford Manor Subdivision, Walton Cnty., Ga. § 9.02(b) (Apr. 5, 2001) (on file with author) [hereinafter Bradford Manor CC&Rs]; Amended and Restated Declaration of Covenants Conditions and Restrictions for Westview Estates, Polk Cnty., Or. § 5.3 (Oct. 16, 2022) (on file with author) [hereinafter Westview Estates CC&Rs]. so if the developer wanted the legal right to build even a duplex, numerous owners in the vicinity would have to agree to the change. The chances of succeeding are slim.

Perhaps that would strike some people as just fine. We are talking about a single-family neighborhood, so maybe the property owner should simply find somewhere else to build the condos, townhouses, or duplexes. And anyway, the restrictive covenants should come as no surprise, right? The developer should have known what it was getting into. But for at least three reasons, restrictive covenants have contributed to and will continue to produce housing gridlock—not only as to an individual piece of property here and there, but also for residential properties and housing supply at local, regional, and national levels.

First, HOAs and single-family covenants are common and even rising in popularity. Single-family homes are the most prevalent form in the United States housing stock, representing around two-thirds of all units.58American Community Survey, Table DP04: Selected Housing Characteristics, U.S. Census Bureau (2021), https://data.census.gov/table?q=DP04 [https://perma.cc/PZ27-S2GG]. And around a quarter of all single-family homes in the United States are subject to HOA governance.59Found. for Cmty. Ass’n Rsch., 2021–2022 U.S. National and State Statistical Review (2022), https://foundation.caionline.org/wp-content/uploads/2022/09/2021-2CAIStatsReviewWeb.pdf [https://perma.cc/FT65-6C4J]. That share is poised to increase in the coming years—over 60% of new, single-family homes are subject to HOA governance, and the number is over 80% for new homes constructed in subdivisions.60Wyatt Clarke & Matthew Freedman, The Rise and Effects of Homeowners Associations, 112 J. Urb. Econ. 1, 1, 7 (2019). Moreover, because a single-family restriction is one of the most common covenant terms for HOAs,61Stephen R. Miller, Dangerous Ideas for Land Use Laboratories #1: Preempt the Single-Family Residence Restrictive Covenant, LPB Network (Jan. 27, 2020), https://lawprofessors.typepad.com/land_use/2020/01/dangerous-ideas-for-land-use-laboratories-1-preempt-the-single-family-residence-restrictive-covenant.html [https://perma.cc/3BZK-NV9P]; Ellickson, supra note 18, at 1841 n.40. it is reasonable to assume that the large majority of single-family homes under HOA governance are bound by a restrictive covenant to stay that way. The practical effect of HOAs’ increasing popularity for single-family home developments, and the prevalence of single-family use restrictive covenants, is that it is increasingly difficult to purchase a home that could one day be expanded into multiple units.62Winokur, supra note 26, at 33 (“Further, as increasingly standardized servitude regimes proliferate, a marginal consumer will confront uniformity not only within developments, but among competing developments. Thus, the alternative sources of supply contemplated in the economic defense of form contracts evaporate for buyers within particular housing markets.”). And that is so even without taking into account the prevalence of other sorts of covenant terms that in practice allow only for single-family development—like minimum square footage requirements for each residence.63Sara C. Bronin, Zoning by a Thousand Cuts, 50 Pepp. L. Rev. 719, 775 (2023). Covenant terms other than those explicitly limiting use to single-family residences can play a significant role in limiting housing production. This Article focuses primarily on explicit single-family restrictions because the connection between them and housing gridlock is more direct. But much of what the Article discusses and suggests could apply to any restrictions that are de facto single-family restrictions. For this reason, I, like Lee Anne Fennell and James Winokur, am skeptical of arguments that decentralized governance by collections of private covenants could meaningfully provide for an open “market” of governance options for homebuyers.64See Lee Anne Fennell, Contracting Communities, 2004 U. Ill. L. Rev. 829, 856–58 (2004); Winokur, supra note 26, at 56–60. But cf. Robert C. Ellickson, Cities and Homeowners Associations, 130 U. Pa. L. Rev. 1519, 1520 (1982) (describing membership in HOAs as “perfectly voluntary”); Clayton P. Gillette, Courts, Covenants, and Communities, 61 U. Chi. L. Rev. 1375, 1379–82 (1994) (suggesting that HOAs could provide communitarian benefits without significantly harming broader public interests). Increasingly CC&Rs converge on this land-use uniformity and gobble up land as they do so.

Second, some people who own single-family homes either purchase them without recognizing that they are bound by a single-family covenant and the significance of that fact, or eventually come to want freedom from such a covenant when they stand to benefit from its removal.65Winokur, supra note 26, at 59–60. Most people, we can assume, buy a home primarily because they like the home and its general location. And their awareness of HOA restrictions and amenities may not extend far past knowledge of the monthly or annual fee, as well as the access owners have to common areas, like a swimming pool.66Id.; see also Clarke & Freedman, supra note 60, at 2 (citing Evan McKenzie, Privatopia: Homeowner Associations and the Rise of Residential Private Government (1994)). Over time, research shows, homeowners often come to disfavor many restrictions governing their private land use.67Winokur, supra note 26, at 59–60; see also Clarke & Freedman, supra note 60, at 2 (noting that some buyers are unaware of extensive restrictive covenants when they purchase a home).

Third, an obvious but important fact: land is a scarce resource.68See, e.g., Eric F. Lambin & Patrick Meyfroidt, Global Land Use Change, Economic Globalization, and the Looming Land Scarcity, 108 P.N.A.S. 3465, 3466 (2011), https://www.pnas.org/doi/10.1073/pnas.1100480108 [https://perma.cc/F3TH-U3LW]. As some land gets developed, the overall share of available land shrinks—especially land well-connected to amenities. Every property and subdivision developed for single-family use accelerates land consumption and constrains the resources available for future housing development.69See Samuel Brody, The Characteristics, Causes, and Consequences of Sprawling Development Patterns in the United States, Nature Educ. Knowledge (2013), https://www.nature.com/scitable/knowledge/library/the-characteristics-causes-and-consequences-of-sprawling-103014747 [https://perma.cc/222C-U8KU]; Winokur, supra note 26, at 33.

Of course, these phenomena conspire to lock individual land parcels into perpetual hosts of one single-family home. But the aggregate effects could be substantial too. We have a housing shortage, but where new housing is being constructed, it’s disproportionately single-family and subject to a covenant keeping it that way.70See, e.g., Clarke & Freedman, supra note 60, at 1, 7. Such development, being less dense and involving excessive land consumption, contributes to suburban sprawl.71See Brody, supra note 69. Radiating out from city centers, more and more large land agglomerations are “filled” with sparse housing; and because of covenants the gaps are not easily filled in later. That means that new housing necessarily stretches away from population centers rich in jobs, recreation, social services, and other amenities.72Infranca, supra note 23, at 661. This phenomenon disproportionately burdens lower-wealth residents for whom greater transportation costs and longer commuting times are especially difficult to manage.73Id.; Einstein et al., supra note 10, at 9.

And it likely will not improve on its own. Although covenant authors can specify that restrictions expire after a certain number of years, they usually do not do so.74Winokur, supra note 26, at 4. Instead, restrictions often apply unless removed through an onerous process, in effect sticking to the corresponding land indefinitely. So, the more HOAs, the more single-family restrictions, and the more land is consumed by indefinite limitations on housing.

Thus, private rights can prohibit cities and surrounding regions from meeting housing demand.75Also, as residents move farther away from city centers and outside of municipal boundaries, cities miss out on part of their tax base. A society filled with single-family land use may function well for a time. But that time is likely to be short if the regional population is to increase. Unused land is depleted, so would-be developers cannot simply buy another lot and build it up. The land that is already occupied is stuck in a web of private promises keeping it from being redeveloped to accommodate more people. Thus, gridlock is inescapable. Properties where the market would offer top dollars to purchase and convert into multiple housing units cannot budge. Too many people—most often HOA boards and members—have the unilateral right to say no.

B.  Why Did We Get Gridlock?

The previous Section explained housing gridlock and how restrictive covenants contribute to it. But there is a more fundamental, or at least chronologically prior, question: Why do we have so many HOAs and restrictive covenants? No one factor or force can explain it all, but this Section discusses three separate, yet related explanations: one legal, another based on financial considerations and incentives, and a third psychological. It explains why the psychological element is particularly critical to grasp.

1.  The Legal Backdrop

Non-possessory rights in land date back several centuries.76Winokur, supra note 26, at 10–12; Korngold, supra note 26, at 1534–35. In fifteenth-century England, much land that up until then had stood open for passage was enclosed, and so free travel became more difficult.77Winokur, supra note 26, at 10–12. In response, the “easement” was born as a legal right of passage through land under private control.78Id. However, for centuries, those non-possessory interests rarely extended to provide rights, allowing people to limit how others used other properties.79Id.

Through the 1800s, the Industrial Revolution and corresponding urbanization brought people closer together and also presented a new array of land uses that could have unwelcome spillover effects on neighboring properties.80Id. Non-possessory property interests that could be enforced to limit neighbors’ land uses proliferated.81See id. at 13. But even then, courts at first would apply such restrictions to the parties to the original agreement only.82Id. That changed in the mid-1800s, when American courts began to enforce these restrictive covenants not only against the parties to the initial agreement, but also to successive owners of the relevant properties.83Id. The thought at the time was to expand property markets, turning contract-created development rights and limitations into transferable commodities.84Id. at 14. Such a framework, proponents thought, would help efficiently allocate land uses to properties where they were most suited.85Id. And in some ways, the commodification and fragmentation of ownership rights did bring about desirable results. For example, they allowed for the birth and proliferation of condominiums, which made home ownership more affordable to many. See Existing-Home Sales, Nat’l Ass’n Realtors, https://www.nar.realtor/research-and-statistics/housing-statistics/existing-home-sales [https://perma.cc/C94M-7E7E] (providing data to show that the price of condos and co-ops tends to be substantially cheaper than the price of single-family homes).

That history is our inheritance, and today the legal rules surrounding restrictive covenants make it quite simple to lock land uses into their present states for long periods of time. For a covenant to run with the land, one traditionally needs four things: (1) intent for a burden on the property to run to future owners; (2) horizontal privity (typically satisfied if the restrictions were placed when the properties were held in common and at an initial land conveyance or sale); (3) vertical privity (meaning a chain in conveyance of the property all the way back to the owners at the time the burden was created); and (4) touch and concern the burdened land (a notoriously fuzzy requirement, which for the purposes of this Article can be understood as a requirement that the burden somehow involves land use as opposed to a non-land-related restriction on owner behavior).86Neponsit Prop. Owners Ass’n v. Emigrant Indus. Sav. Bank, 15 N.E.2d 793, 795, (N.Y. 1938).

To simplify, this standard generally means that if at the initial sale of a piece of property, the right people aim to restrict how the property can be used into the future, they just have to say so, and future owners will be bound in precisely the same way. If someone owns two adjoining lots and wants to sell one, they can simply agree with the first purchaser that the transferred lot can only be used for one single-family home, unless the owner of the other property gives express permission to do otherwise. If that restriction is recorded with the deed to the sold property, then it can potentially stand for generations, even after all involved properties are sold again and again.

The most prominent source of single-family covenants today likely comes from HOA CC&Rs. Such restrictions are quite easy to create even though they can burden large collections of properties in one shot. The standard process is that a developer purchases a large piece of land, then subdivides it in accordance with local legal requirements.87Winokur, supra note 26, at 56–59. In so doing, the developer, as the sole owner, has the right to set conditions on the properties’ future use, and can establish that the properties will all be members of a common-interest community (for example, they may be governed by an HOA and perhaps share in some common amenities).88Id. The developer can then outfit each subdivided piece with a house and all requisite utility infrastructure. The properties at that point are worth far more taken together than the large portion of land the developer initially bought.89See Clarke & Freedman, supra note 60, at 2 (discussing the “HOA premium”); see also Jenny Schuetz, Brookings Inst., To Improve Housing Affordability, We Need Better Alignment of Zoning, Taxes, and Subsidies 2 (2020), https://www.brookings.edu/wp-content/uploads/2019/12/Schuetz_Policy2020_BigIdea_Improving-Housing-Afforability.pdf [https://perma.cc/A6C9-65HP] (providing an example of the relative individual and aggregate costs of different housing structures). So the developer sells each home. If the developer specified in the community’s CC&Rs that each property is limited to single-family use, then each would-be purchaser must simply accept that restriction or look elsewhere.90Winokur, supra note 26, at 56–59. And because the geographic areas where the developer acquires large chunks of land are often sparsely populated at the time and thus better suited to single-family homes, it is quite common for a developer to choose to include single-family covenants in a subdivision’s founding documents. In the short term (which is what matters to the developer), there is a market for it.91See id. at 3.

Once the developer has sold off all the properties, it no longer retains control over how each property is used. That authority, with the pre-drafted restrictions, typically transfers to the HOA or the community members when the last property is sold, if not before.92See, e.g., Bradford Manor CC&Rs, supra note 57, §§ 3.03(b), 8.01, 9.02; Westview Estates CC&Rs, supra note 57, § 1. However, the restrictions tend to stick. There are many reasons for this, some of which will be explored in more detail below. But from a legal standpoint, the most important factor is that under the standard documentation for these common-interest communities, CC&Rs can only be amended through rather difficult means. It is typical for the restrictions to be amendable only by supermajority vote of the homeowners in the community.93See, e.g., Bradford Manor CC&Rs, supra note 57, § 9.02(b); Westview Estates CC&Rs, supra note 57, § 5.3. It is not immediately obvious why in any given case the developer makes it so difficult to change restrictions. As discussed in Section I.B.2, infra, perhaps the developer suspects that buyers will pay more for the assurance that their neighborhood will not change in years to come. Or perhaps it is because such odious provisions are simply features of standard form CC&Rs today.94See Winokur, supra note 26, at 58 (noting the proliferation of reliance on standard forms in servitude regimes). Either way, an entire new community is left with the restrictions, and it often takes nearly the entire community to be of the same mind to change the restrictions to even a single property.

In sum, the American legal system dove headfirst into commodification of fragmented land interests. In doing so, it made it quite simple for owners to restrict use of land for generations to come and to divide the power to restrict among many parties. And in the modern day, large developers use the same basic legal tools to cement indefinitely the single-family use of potentially hundreds of properties at a time—such that even when the developer is long gone, the law ensures that meaningful change is unlikely.

2.  Financial Incentives

The financial considerations surrounding housing gridlock are essential but insufficient to understand the problem. Although they might partially explain why single-family restrictive covenants became so prevalent, they sometimes fail to explain why those covenants persist in certain locations. Basic financial considerations may suggest that restrictive covenants are eventually discarded when they expend their basic utility.95See Ellickson, supra note 18, at 1848–51. But that often does not play out in practice.

As the preceding Section suggests, financial incentives often encourage developers to mandate single-family use at the beginning of a new neighborhood’s life. For developers with massive capital and hopes of substantial profit, the goal is to construct, and then sell, a lot of homes.96As a general rule, the more homes are constructed on a parcel, the lower the per-unit construction cost is, but the greater the total revenue from selling all of the homes. See Schuetz, supra note 89, at 2; see also Alex Baca, Patrick McAnaney & Jenny Schuetz, “Gentle” Density Can Save Our Neighborhoods, Brookings Inst. (Dec. 4, 2019), https://www.brookings.edu/articles/gentle-density-can-save-our-neighborhoods [https://perma.cc/AU7X-AVVE]. One option would be to buy one or multiple smaller plots of land in an urban center where demand for multi-family housing near preexisting amenities is high. Some developers do take that path. But if other developers have already gobbled up city land for that purpose, or if local zoning provisions prevent a developer from building enough housing units to turn a large enough profit, then they might look elsewhere simply as a matter of necessity. In growing regions, that means looking to the open pastures of suburbs and beyond, where demand for housing is beginning to rise but land is still relatively cheap.97William Hawk, Expenditures of Urban and Rural Households in 2011, U.S. Bureau Lab. Stat. (Feb. 25, 2013), https://www.bls.gov/opub/btn/volume-2/expenditures-of-urban-and-rural-households-in-2011.htm [https://perma.cc/BL2Q-HPAZ] (“In many rural areas, land is plentiful, so prices tend to be lower.”); see also Schuetz, supra note 89, at 4 (explaining that “[l]and is most expensive in city centers”). If they cannot build upward, they build outward.

That may partially explain why developers build so many single-family homes in expansive, sprawling subdivisions. But it does not necessarily account for why they restrict all the homes in the subdivision to single-family use. No one forces them to do so.

Although more research could be done as to why developers take that step, there are a couple of related possible explanations. First, and most importantly, there is reason to believe that, at least at the time the developer is first selling off the homes, single-family restrictions do not materially diminish the values of the properties they constrain. Most buyers of newly constructed single-family homes in sparsely populated areas are not interested primarily in an investment opportunity; they are looking for a place to live.98See, e.g., Sophie Kasakove, Why the Road Is Getting Even Rockier for First-Time Home Buyers, N.Y. Times (Apr. 25, 2022), https://www.nytimes.com/2022/04/23/us/corporate-real-estate-investors-housing-market.html [https://perma.cc/8JYG-79LL] (noting the struggle between first-time home buyers who seek to live in the purchased home versus corporate investors seeking to purchase properties to rent). Thus, they perhaps do not care at the outset whether tomorrow or years from now they could be prevented from converting their home into a duplex, or from adding an accessory dwelling unit. Relatedly, buyers are not simply buying a home subject to a restriction; they are buying a home that is surrounded by other homes subject to a restriction. Someone looking for a home in a sprawling development in a sparsely populated region might prefer some assurance that their neighborhood character will remain stagnant.99See sources cited supra note 25 and accompanying text.

But there comes a time when land cannot keep up with demand. As the population increases in a region that was once comparatively empty, the public interest demands more homes. As urban centers fail to accommodate the housing needs of prospective residents, more people seek to move into outer ring suburbs.100Carlos Waters, Suburban Sprawl Is Weighing on the U.S. Economy, CNBC (Feb. 1, 2022), https://www.cnbc.com/2022/02/01/how-suburban-sprawls-single-family-home-zoning-limits-housing-supply.html [https://perma.cc/SG7H-JGYQ] (discussing how limited dense housing supply contributes to sprawl); Schuetz, supra note 89, at 3–5 (noting that housing tends to be more expensive in cities). If the land is already substantially filled in these areas by single-family homes on large lots, housing prices will climb.101Charles Nathanson, Raven Molloy & Andrew Paciorek, Would Housing Cost Less If It Were Easier to Build New Homes? Surprisingly, Not Much, Kellogg Sch. Mgmt. at Nw. Univ.: Insight (Feb. 2, 2022), https://insight.kellogg.northwestern.edu/article/housing-costs-supply-demand-affordability [https://perma.cc/MU26-AL6X] (noting that one driver of high housing cost is low supply). Under such circumstances, developers could easily fill multiple housing units per lot.

Thus, once housing demand increases sufficiently, single-family covenants may come to deflate individual property values. As a general rule, per-unit construction costs decrease as the number of units in the structure increase.102See Baca et al., supra note 96. Further, the more housing units that can be constructed, the more total revenue is up for grabs.103Id. For those reasons, a property that can host twenty, ten, or even two units is typically more valuable than a property that can only host one.104See Schuetz, supra note 89, at 4. That is the basic economic incentive for developers to meet higher housing demand. And although no comprehensive study has been done to measure the effect on a property’s value of lifting a single-family covenant, there is analogous research regarding zoning.

Those numbers seem to point one way: “upzoning” increases property values where housing demand is high. This is especially true for single-family homes. One study found that in Minneapolis, the citywide upzoning initiative increased the value of single-family property by around 3%.105Daniel Kuhlmann, Upzoning and Single-Family Housing Prices: A (Very) Early Analysis of the Minneapolis 2040 Plan, 87 J. Am. Planning Ass’n 383, 391 (2021). Another study found that in Chicago, properties near transit services saw a dramatic increase in value from upzoning—by 15–20%.106Yonah Freemark, Upzoning Chicago: Impacts of a Zoning Reform on Property Values and Housing Construction, 56 Urb. Affs. Rev. 758, 758–89 (2020), https://journals.sagepub.com/doi/10.1177/1078087418824672 [https://perma.cc/G36U-VUAV]. A study focused on Auckland, New Zealand also saw notable value increases.107Ryan Greenaway-McGrevy, Gail Pacheco & Kade Sorensen, The Effect of Upzoning on House Prices and Redevelopment Premiums in Auckland, New Zealand, 58 Urb. Stud. 959 passim (2021), https://workresearch.aut.ac.nz/__data/assets/pdf_file/0010/535096/Effect-of-upzoning.pdf [https://perma.cc/2MGC-DYS7].

Why, then, does the gridlock persist? If properties tend to jump up in value when restrictions limiting them to single-family use are lifted, why do single-family restrictive covenants still dominate so much of our single-family housing stock? There may be several reasons, including the high transactions costs someone who wants to bypass a covenant would incur by seeking waivers from all covenant beneficiaries. But from the perspective of all other parties involved, one of the potential causes is that those who would have to decline to enforce the covenant could in the short term suffer a drop in property value, or at least a drop in the subjective value they place on their property and on living in that neighborhood. In other words, although unlocking growth on a given parcel increases that parcel’s value, surrounding parcels might not similarly benefit, especially if they remain restricted. That is one possible reason why even though upzoning a particular property tends to increase its value, properties that are part of an HOA subject to single-family restrictions are sometimes valued higher than comparable properties not within an HOA.108Clarke & Freedman, supra note 60, at 2. Although the increase in value is perhaps a feature of the additional services as much as the restrictions. See id. Multiple studies have concluded that constructing new multi-family housing on a particular piece of property tends to depress the rent of nearby properties.109See Brian Asquith, Evan Mast & Davin Reed, Local Effects of Large New Apartment Buildings in Low-Income Areas, 105 Rev. Econ. & Stat. 359, 373–74 (2023); Evan Mast, W.E. Upjohn Inst. for Emp. Rsch., The Effect of New Market-Rate Housing Construction on the Low-Income Housing Market 1 (2019) (“New construction opens the housing market in low-income areas by reducing demand.”), https://research.upjohn.org/up_policybriefs/13 [https://perma.cc/AYN7-22GB]; Xiaodi Li, Do New Housing Units in Your Backyard Raise Your Rents?, 22 J. Econ. Geography 1309, 1310 (2021). This result is no surprise, and is in fact a core goal of upzoning’s proponents.110See Nathaniel Meyersohn, The Invisible Laws That Led to America’s Housing Crisis, CNN (Aug. 5, 2023), https://www.cnn.com/2023/08/05/business/single-family-zoning-laws/index.html#:~:text=Strict%20single%2Dfamily%20zoning%20regulations,opportunities%2C%20researchers%20and%20advocates%20say [https://perma.cc/SB74-39SD] (noting that “[s]trict single-family zoning regulations limited housing supply [and] artificially raised prices”). Thus, adding to the housing stock tends to reduce, or at least slow the increase of, the price of housing units that would compete against the new housing for occupants. This is perhaps a laudable result for the public at large, but one that is unwelcome to local homeowners centrally concerned with their own property’s value.

Yet, it is unclear whether such an effect holds true when looking at the value of surrounding homes that are not in the arena to compete for multi-family occupants, but that instead simply remain single-family properties. In theory, if enough potential buyers of a single-family home are dissuaded by the presence of multi-family housing nearby, then a home that remains a single-family property while its neighbors turn into multi-family homes might decrease in value. The evidence, however, struggles to show that such an effect plays out in practice. Again, borrowing from the zoning context, a survey of parts of the greater Raleigh, North Carolina area suggested that the upzoning of property had no significant effect, either positive or negative, on the value of neighboring properties.111Conor Ryan, The Impacts of Upzoning on Property Values in NC, Univ. N.C. Sch. Gov’t: Cmty. & Econ. Dev. (Sept. 1, 2021), https://ced.sog.unc.edu/2021/09/the-impacts-of-upzoning-on-property-values-in-nc [https://perma.cc/JMD3-XSF2]. Admittedly, it can be difficult to isolate the effect of upzoning and determine whether it alone tends to depress the value of neighboring properties. For one, demand for housing is often already quite high in those geographic areas (hence the upzoning decision). Also, upzoning and an increase in multi-family housing units may be accompanied by new amenities like restaurants or stores that can make the neighborhood more attractive to potential buyers.112See generally Henry S. Brown III & Lisa M. Yarnell, The Price of Access: Capitalization of Neighborhood Contextual Factors, 10 Int’l J. Behav. Nutrition & Physical Activity 95 (2013) (finding access to certain food amenities increases property values); Analysis from ATTOM Reveals Fresh Take on Grocery Stores Impacting the U.S. Housing Market, ATTOM (Dec. 23, 2020), https://www.attomdata.com/news/market-trends/home-sales-prices/attom-data-solutions-2020-grocery-store-wars-analysis [https://perma.cc/E64Z-DAY4 ] (finding the same).

On the whole, then, single-family development with single-family restrictive covenants may make some financial sense at the time the covenants are first established. But as a general rule, whenever housing demand increases sufficiently, single-family covenants suppress the values of the properties subject to them. And although homeowners might assume that allowing other nearby properties to host multi-family development could suppress their own home’s value, the actual evidence may not definitively support that theory. What the evidence does support is that if all owners within an HOA could free up their own properties for multi-family use, then they would likely see their property values rise.113See, e.g., Edward L. Glaeser & Joseph Gyourko, The Impact of Building Restrictions on Housing Affordability, Fed. Reserve Bank N.Y. Econ. Pol’y Rev., June 2003, at 21, 35, https://www.newyorkfed.org/medialibrary/media/research/epr/03v09n2/0306glae.pdf [https://perma.cc/8UBC-2ZGT]. But the restrictions persist. Not enough owners within an HOA get on board with the change. Given that this perpetual gridlock is not fully explained by the financial interests of the homeowners, another force must be at play. The next Section turns to it.

3.  Underappreciated Psychological Forces

In general, owners of properties subject to a common single-family covenant stand to financially benefit by the lifting of that restriction. The fact that they do not take the plunge leads to a conclusion that is perhaps intuitive: owners who resist upzoning (and for the purposes of this Article, those who would try to enforce a single-family covenant if given the chance) do so not necessarily because they have calculated a quantifiable financial loss they might suffer, but at least in part because at a psychological level they have an aversion to the change in neighborhood character that the entry or proliferation of multi-family housing might bring about. This perceived change in quality of life can be a driving force even if it does not manifest in quantifiable financial harm. Because human psychology is potentially such a critical impetus behind the prevalence and enforcement of restrictive covenants, any solution to the resulting gridlock would benefit from a review of the relevant psychological principles. This Section serves that purpose.

Property ownership is not only a legal trait; it has deep personal implications to the owner and potential subsequent owners.114See generally Margaret J. Radin, Property and Personhood, 34 Stan. L. Rev. 957 (1982) (exploring connections between property ownership and conceptions of the self and personal identity). That being so, concerns that motivate the retaining, using, and transferring of property are not only financial; they are psychological as well. This Section reviews contemporary psychological research that can help explain why property owners may hold onto possessions and entitlements notwithstanding contrary financial incentives.

People develop psychological attachments to their possessions. From a philosophical standpoint, thinkers from Hegel115M. Blake Wilson, Personhood and Property in Hegel’s Conception of Freedom, 1 Polemos 68, 68–91 (2019), https://philarchive.org/archive/WILPAP-29 [https://perma.cc/CR5X-HJ46]. to Margaret Radin116Radin, supra note 114, at 957–59. have argued that owning property is a prerequisite for human freedom, because at a psychological level people begin to associate what they own with who they are. Objects become part of the subject; material things contribute to immaterial “identity.” In other words, people need property to develop a fuller concept of self-personhood.

Psychological research supports this theory to a degree, particularly through two related key concepts: the “Endowment Effect” and the “Mere Ownership Effect.”117See Matthias S. Gobel, Tiffanie Ong & Adam J.L. Harris, A Culture-by-Context Analysis of Endowment Effects, 36 Proc. Ann. Meeting Cognitive Sci. Soc’y 2269, 2270–71 (2014); Jozef M. Nuttin, Jr., Affective Consequences of Mere Ownership: The Name Letter Effect in Twelve European Languages, 17 Eur. J. Soc. Psych. 381, 381–400 (1987); Michal Bialek, Yajing Gao, Donna Yao & Gild Feldman, Owning Leads to Valuing: Meta-Analysis of the Mere Ownership Effect, 53 Eur. J. Soc. Psych. 90, 91–92 (2022). Although the two concepts are not perfectly identical, they refer to a similar phenomenon—essentially, that people tend to place greater value on a thing they own or possess than on the exact same thing if they do not own or possess it.118See generally Gobel et al., supra note 117 (explaining and measuring the Mere Ownership Effect); Nuttin, supra note 117 (same); Bialek et al., supra note 117 (same). This Article refers to the two concepts interchangeably. The classic methodology to measure these phenomena is to construct an experiment of randomly selected people assigned to either be a “buyer” or a “seller” of some item, such as a coffee mug.119See Bialek, et al., supra note 117, at 94. The studies find that the “sellers” possessing the item assign the item a higher value than the buyers not possessing the item do.120Id.

The Mere Ownership Effect thus reveals that if we own or possess something, by that fact alone we will come to think of it as more valuable. The phenomenon applies to all sorts of “possessions,” even nonphysical ones, like intangible entitlements.121Id. And it applies even when a person has not had time to use or become more accustomed to the possession—merely being informed of possession or ownership is enough to trigger the effect.122Id. at 4–6.

Extrapolating to the broader world of property law, the Mere Ownership Effect means that owners of property will tend to value their property above general market value. Thus, they will resist selling an item even if offered the highest value reasonable buyers might pay. The Mere Ownership Effect means, therefore, that the status quo is sticky. Property will tend to stay in the same hands even when buyers are willing to pay the value of the property’s economic utility.

A related psychological principle is that “losses loom larger than gains.”123Daniel Kahneman & Amos Tversky, Choices, Values, and Frames, 39 Am. Psych. 341, 346, 348 (1984). Someone who stands to lose a possession is likely to think that they will lose more than another person would think they gain by acquiring that same possession.124Stern & Lewinsohn-Zamir, supra note 31, at 105; Carey K. Morewedge, Lisa L. Shu, Daniel T. Gilbert & Timothy D. Wilson, Bad Riddance or Good Rubbish? Ownership and Not Loss Aversion Causes the Endowment Effect, 45 J. Experimental Soc. Psych. 947, 947 (2009). And the fear of losing X amount of value is felt more acutely than being denied an opportunity to gain X amount of value.125Stern & Lewinsohn-Zamir, supra note 31, at 105; Morewedge et al., supra note 124, at 947. Some researchers have suggested that this principle of “loss avoidance” is what gives rise to the Endowment Effect.126See Morewedge et al., supra note 124, at 947. Whether or not that is true, research shows that people are bad predictors of the importance of preference satisfaction.127Stern & Lewinsohn-Zamir, supra note 31, at 61. In other words, people tend to think that having their preferences frustrated will be worse than it actually is. Once someone gets attached to anything—an object, a right, or even an idea—then they tend to overestimate how much they need it, and how bad it will be to lose it.128See id.; see also Bialek et al., supra note 117, at 91–92. Robert Ellickson also noted this phenomenon and tied it to restrictive covenants in a recent article. Ellickson, supra note 18, at 1854.

So, people tend to overinflate the value of anything they begin to conceive of as within their grasp. But perceptions of possession are not the only force that binds people to their things.

The more an owner associates a possession with meaningful memories and social relationships, the greater value the owner is likely to place on the possession.129Stern & Lewinsohn-Zamir, supra note 31, at 58, 75. This principle is intuitive—people subjectively value the sentimental. We value our close relationships with other people, and so we value the items that represent or remind us of those connections. Margaret Radin thus has argued that property law should place greater value on possessions like heirlooms, keepsakes, personal body parts, such as donatable organs, and the family home.130Id. at 75; Radin, supra note 114, at 957. She calls these “personhood” property, possessions that are uniquely tied to the owner’s conception of self-identity (for example, a person who identifies as a “parent” is likely to place greater-than-market value on a picture that their child drew for them).131Radin, supra note 114, at 957–61. Psychological research backs this up (although, interestingly, perhaps not as strongly for the family home as for some other possessions).132Stern & Lewinsohn-Zamir, supra note 31, at 75. All in all, our possessions take on greater meaning, and thus greater subjective value, when we come to associate them with things that give life deeper meaning, like family and friendship.

Just as people tend to place greater subjective value on possessions that are associated with meaningful social connections, people generate the most subjective value of all from the social connections themselves.133Id. at 58. How many movies resolve by reminding the characters and the viewers that family, friends, and so forth are most important? The trope exists because it resonates at a psychological level.

Take these psychological phenomena and combine them. Humans place a value premium on things they already own or possess. We fear losing something more than we would desire to gain that same thing if we did not already have it. We overestimate how much we will suffer from losing something. And of all the things we can own, we guard most fiercely that which we associate with social and relational ties. If all these phenomena are borne out in the psychological research, another conclusion flows naturally: if we hold a tool that enables us to stop or slow change to our neighborhood, we will value that tool dearly—more so than we would value the opportunity to get that tool in the first place.

At this point the connection to restrictive covenants is coming into focus, and we can begin to answer the big question: Why do people hold onto restrictive covenants, especially those mandating single-family use only, when relinquishing such a right could make the most financial sense and when HOA restrictions are often unpopular among those subject to them?134Michele Lerner, Why Homeowners Hate Their HOAs, Wash. Post (Oct. 25, 2018), https://www.washingtonpost.com/business/2018/10/25/why-homeowners-hate-their-hoas/https://www.washingtonpost.com/business/2018/10/25/why-homeowners-hate-their-hoas [https://perma.cc/928T-ZM9K] (discussing the unpopularity of HOAs among HOA members).

In part it is because we are wired to see what we have (for example, our neighborhood as it currently stands) as much more valuable than what we would pay to get it in the first place.135Kahneman & Tversky, supra note 123, at 346; Morewedge et al., supra note 124, at 947–48. And, of course, that only addresses the mindset and position of covenant beneficiaries when posed with an opportunity to waive their right of enforcement. Other factors like the high transactions costs of negotiating such waivers might prevent even the builder from seeking freedom from the covenants in the first place. Our neighborhood is ours, so ipso facto it is more valuable.136See generally Bialek et al., supra note 117 (discussing the Mere Ownership Effect). Thus, the people who must consent to the reworking or circumventing of a restrictive covenant are the same people who are psychologically predisposed not to do so, even if offered the equivalent of top market value to relinquish the right. Homeowners who buy a home in an HOA with single-family use restrictions might or might not see the HOA restrictions as a selling point. But once the home, the neighborhood, and the restrictions become theirs, psychological forces cement the status quo.

I.  UNLOCKING HOUSING POTENTIAL

What can be done to break the housing gridlock? Legal mechanisms and economic interests lay the groundwork for gridlock by blanketing vast swaths of land in single-family restrictive covenants. And later, when both financial incentives and the public interest dictate that a change might be needed, the entrenched collection of private property rights collaborates with features of human psychology to stifle that change. This Part explores various options for breaking the gridlock covenants wrought. It evaluates each option through economic and psychological lenses and ultimately arrives at a new solution: in areas without single-family zoning but limited by a single-family covenant (or other covenant with a similar effect like one prescribing minimum square footage per unit and setbacks from property lines),137See Bronin, supra note 63, at 775. a property owner seeking to build multi-family housing can take advantage of a new procedural and remedial mechanism provided by statute. The mechanism in its first stage would encourage covenant beneficiaries to waive their enforcement rights, and in its second stage would limit remaining beneficiaries to one specific remedy that would allow efficient bypassing of covenants while still giving effect to preexisting legal rights. After this Part scans and evaluates a catalog of other options to avoid single-family covenants, it will explain this proposal in more detail and explain its advantages over other options, especially, but not only, from a psychological perspective.

A.  Preexisting Tools for Breaking Gridlock

This Section explores some of the tools already available to private parties, courts, and policymakers for potentially bypassing the gridlock caused by restrictive covenants. It will evaluate the strengths and weaknesses of each from legal, economic, and psychological perspectives. Ultimately, it concludes that the standard toolkit likely will not be enough to accomplish the goal.

1.  Private Bargaining’s Limitations

The simplest (not necessarily the easiest) way to avoid a restrictive covenant is to obtain a waiver—to convince the relevant party or parties not to enforce the covenant. Someone who seeks to develop multiple homes on a property could offer money to the property owners who have the power to enforce a single-family covenant.138See Winokur, supra note 26, at 26–27. No doubt this approach could work sometimes. But one of the central points of the preceding Part is that it often will not. Housing gridlock means that too many people have the right to say no. This is especially true in HOAs that include dozens or hundreds of homes. If a restrictive covenant will stand unless the large majority of homeowners vote to get rid of it,139See, e.g., Bradford Manor CC&Rs, supra note 57, § 9.02(b); Westview Estates CC&Rs, supra note 57, § 5.3. then all it takes is one or a small handful of holdouts who cannot be paid off to let it go. Psychological phenomena tell us that holdouts are likely.140See supra Section I.B.3. So, many homeowners will require well above “market value” to relinquish a covenant, if they could name a price at all.141Owners of property tend to prefer in-kind compensation and suffer additional harm when coerced into parting with property even for market value. Stern & Lewinsohn-Zamir, supra note 31, at 61, 111, 206–07. Only in cases in which the builder’s expected profit is enormous will the builder successfully buy development rights from all the parties that can stop them.142See Heller, supra note 39, at 4–5. If that were common, housing gridlock might not be a significant problem. But because psychological phenomena suggest that people will likely value and guard their covenant rights, the buyout method is unlikely to yield meaningful results on its own.

2.  Broadscale Invalidation

Turning attention to the other end of the spectrum from free market to government control, the state or local government could simply declare single-family covenants invalid or unenforceable. Such an approach isn’t merely hypothetical. California has done it—at least for certain special circumstances. In 2021, when the state passed a law removing single-family zoning statewide, it also targeted certain covenants. It declared that any restrictive covenants limiting “the number, size, or location of the residences that may be built on the property, or that restrict the number of persons or families who may reside on the property,” are unenforceable against an owner or developer of “affordable” housing.143Assemb. B. 721, 2021–22 Leg. Sess. (Cal. 2021). The language of the provision makes clear that it does not destroy all single-family covenants statewide, but instead only applies to those covenants that would restrict the construction or operation of housing that is subject to certain significant rent restrictions.144See id. § 2(j). But in the sphere where the provision applies, its word is final. Any existing single-family covenants are powerless, no matter how longstanding they are or how many other properties purportedly benefit from them.

This total-abrogation approach has some benefits. It can cover many existing covenants all at once, providing numerous developers and landowners advance notice and assurance as to the (lack of) legal force those restrictions carry moving forward. It also, of course, completely bypasses the holdout problem. It does not matter whether all, or even any, property owners benefiting from the covenant can be persuaded to relinquish their hold. And relatedly, it perhaps does not directly cost the government or the builder anything.145The government can invalidate the covenants simply by saying so. But of course, there may be indirect costs to such action, as described in this Section infra. The philosophy is simple: if the problem is that too many people have a private property right enabling them to say no to development, then remove the right entirely.

But total government appropriation is a blunt instrument. As such, it might bring the hammer down on the housing gridlock caused by single-family covenants while also rattling other interests that we would prefer to leave undisturbed. As described above, the primary harm from the loss of a covenant often is not so much financial as it is psychological.146See supra Section I.B.3. And psychological research shows that being the subject of government coercion can be especially damaging.147Stern & Lewinsohn-Zamir, supra note 31, at 92, 96, 111. Take a study regarding the use of eminent domain, which is similar in some ways to government invalidation of a covenant (in that it takes a property right or interest away from a private party). A study posed to its subjects multiple scenarios in which an owner parted with a parcel of land. In two scenarios, the owner agreed to sell it, and in the other, the land was confiscated by eminent domain in exchange for compensation.148Daphna Lewinsohn-Zamir, Taking Outcomes Seriously, 2012 Utah L. Rev. 861, 872–83 (2012). But in all the scenarios subjects were informed that the owner of the parcel valued it at the same specified market value.149Id. Nevertheless, subjects reported that the owner was worse off in the eminent domain scenario.150Id.

Stern and Lewinsohn-Zamir call this the “coercion premium.”151Id. It represents the fact that people suffer some sort of harm from being forced to part with property that is separate from and in addition to the value of the property itself. From that premise, Stern and Lewinsohn-Zamir suggest that any use of eminent domain to which the Takings Clause applies should require “just compensation” above the property’s market value.152Id. The idea is that if what the owner lost is beyond market value, then just compensation is also more than market value.

This principle carries at least two implications for restrictive covenants. The first concerns the takings issue itself. Courts across the country appear split as to whether a restrictive covenant gives the benefitted properties the sort of interest such that they are entitled to compensation when the government extinguishes that interest through a taking—but many jurisdictions likely would say that it does.153Compare Creegan v. State, 391 P.3d 36, 45 (Kan. 2017) (providing a list of cases and jurisdictions supporting the idea that restrictive covenants are a property interest protected by the Takings Clause), with Anderson v. Lynch, 3 S.E.2d 85, 87 (Ga. 1939) (holding that owners of adjacent lots did not have a compensable ownership interest in a residential-use restrictive covenant). So, any such proposal might run into significant constitutional challenges depending on where it is enacted.154Robert Ellickson suggests that there might not be a significant takings problem because some states have successfully weakened covenants in certain contexts. Ellickson, supra note 18, at 1863. Based on the number of states that consider such property interests compensable when extinguished, I am not so sure. Ken Stahl considers the takings problem to be a serious issue, although he ultimately argues that such invalidations would likely survive Takings Clause challenges. Ken Stahl, The Power of State Legislatures to Invalidate Private Deed Restrictions: Is It an Unconstitutional Taking?, 50 Pepp. L. Rev. 579 (2023). Either way, the psychological factors described infra in this Section present their own problem, and the mere threat of substantial takings litigation could be enough to deter many government actors. In the jurisdictions that might find the extinguishing of a restrictive covenant to be a taking, the government could incur an expense—perhaps a substantial one—to destroy the covenants.155On top of that, the termination would likely turn out to be overbroad and thus unnecessarily expensive for the government. It would give every property owner benefitting from a covenant the right to sue for just compensation, even though only a small fraction of those owners would actually experience new housing construction in violation of the covenant in the near future. And whatever “just compensation” ultimately might be, the coercion premium makes it more likely that landowners would challenge the action in court, thus presenting another cost to the government from litigation.156Stern & Lewinsohn-Zamir, supra note 31, at 111.

The second implication is that coercively terminating restrictive covenants will in itself inflict some level of psychic harm.157Id. at 92, 108–11. The precise amount of harm is perhaps impossible to quantify. But if the question of best policy turns on utilitarian considerations to any degree, the psychic harm must be part of the calculus nonetheless.158Dmytro Taranovsky, Utilitarianism, Mass. Inst. Tech. (Feb. 7, 2003), http://web.mit.edu/dmytro/www/Utilitarianism.htm [https://perma.cc/9M9W-TSJ8] (explaining that utilitarian considerations involve psychological distress). How much weight and credence it deserves is of course another question. Perhaps local or state policymakers would conclude that the benefit of more housing from terminating numerous single-family covenants is worth the cost of the psychic harm to single-family homeowners.

However, as that cost-benefit analysis shakes out, policymakers contemplating it should keep in mind two pillars of political participation: voice and exit.159See Heather K. Gerken, Exit, Voice, and Disloyalty, 62 Duke L.J. 1349, 1352 (2013); Lee Anne Fennell, Homes Rule, 112 Yale L.J. 617, 626 (2002). See generally Albert O. Hirschman, Exit, Voice, and Loyalty: Responses to Decline in Firms, Organizations, and States (1970). These two forces drive much local (and, to an extent, state) policymaking. Voice refers to residents’ ability to give their input, either through detailed communication of some kind or, most significantly, through voting.160See sources cited supra note 159. If enough people are fed up with policymakers’ decisions, they can vote them out. If that happens, then the officials that fill the vacancies might be more amenable to the homeowners’ interests (and therefore might try to undo the undoing of the covenants, leading to an end result that is worse than before—a reestablishment of the conditions that led to gridlock and new government leaders who might be more anti-housing on the whole).

The related force, “exit,” refers to residents’ ability to vote with their feet by moving away from jurisdictions with policies they disfavor and into jurisdictions with policies they favor.161See sources cited supra note 159; see also Ilya Somin, Foot Voting, Federalism, and Political Freedom, 55 Federalism & Subsidiarity 83, 83–90 (2014). Exit casts a shadow over local and state government decision-making because when residents exercise it, the jurisdiction loses economic activity and housing demand, and so the tax base and revenue diminishes.162See sources cited supra note 159; see also Somin, supra note 161, 83–90.

Voice and exit are of particular concern when the displeased demographic is homeowners. For one, homeowners tend to be especially active in local politics,163Jesse Yoder, Does Property Ownership Lead to Participation in Local Politics? Evidence from Property Records and Meeting Minutes, 114 Am. Pol. Sci. Rev. 1213, 1213–18 (2020). so any new policy or law that disfavors them is sure to garner strong opposition. Also, because homeowners (especially of single-family homes) tend to be wealthier than the average citizen,164Neil Bhutta, Jesse Bricker, Andrew C. Chang, Lisa J. Dettling, Sarena Goodman, Joanne W. Hsu, Kevin B. Moore, Sarah Reber, Alice Henriques Volz & Richard A. Windle, Changes in U.S. Family Finances from 2016 to 2019: Evidence from the Survey of Consumer Finances, 106 Fed. Rsrv. Bulletin, Sept. 2020, at 1, 22, https://www.federalreserve.gov/publications/files/scf20.pdf [https://perma.cc/GJ3C-XFX6]; Baca et al., supra note 96 (supporting the assertion that single-family units tend to cost more than multi-family). their exit can be distinctly harmful to localities’ short-term fiscal goals.

But it is possible that in the long run these bogeymen of voice and exit would not prove catastrophic to the jurisdiction seeking to “take” the covenants. As for voice, if lifting single-family covenants creates room for enough new people to move into the jurisdiction early enough, then perhaps some of the strength of an anti-new-housing voting bloc can be diluted by new participants in public life. And as for exit, perhaps the exit of wealthy homeowners who prefer freezing neighborhoods as single-family only would not be that harmful to the jurisdiction if the removing of covenants unlocked trapped property value and increased the number of taxable housing units in the base.165Even if cities saw a momentary dip in demand because of the new policies—and thus a momentary dip in property values and economic activity—eventually the construction of new housing units could presumably provide enough economic activity and properties from which to levy taxes that the locality ends up ahead on net.

Overall, then, if a local or state government chose to address housing gridlock simply by destroying single-family covenants, it would have to be ready for some negative economic and political backlash. And although it may turn out that destroying the covenants is worth the backlash on the whole, the coercion premium brought about by such actions should give decisionmakers pause and reason to evaluate less heavy-handed legislative options. Robert Ellickson recently surveyed some such options, such as statutory provisions limiting the lifespan of covenants to two or three decades.166Ellickson, supra note 18, at 1861–62. He disfavors such restrictions because, if set on too short a timeline, they may terminate covenants before the covenants have expended their value to those directly benefited.167Id. I would also be wary of them because of the flipside of the coin: If set on too long of a timeline, they might allow harmful covenants to maintain their hold for too long. In other words, such provisions may sometimes prove beneficial but fail to narrowly tailor to problem sources.

3.  Traditional Common-Law Tools

Although covenants are supported by strong legal backing, the common law provides some principles to escape them in special cases. The two most obviously relevant here are the changed-circumstances doctrine and voiding as contrary to public policy.168Some other bases for terminating covenants under the common law include release, merger, abandonment, acquiescence, and laches. Aladar F. Siles, Methods of Removing Restrictive Covenants in Illinois, 45 Chi.-Kent L. Rev. 100, 101–06 (1968). Other than release, which I discuss extensively from various angles in this Article, I do not discuss most of these. With a common-interest community like an HOA, in which dozens or hundreds of individual property owners have the right to sue any other owner to enforce covenants, most of these methods are unlikely to pan out. The changed-circumstances doctrine in some ways comports with the psychological underpinnings of possession and attachment. But both that doctrine and the principle of voiding as contrary to public policy are unlikely sufficiently accessible from a legal standpoint to make much difference in the aggregate.

Under the common law, the changed-circumstances doctrine holds that a party owning a property burdened by a restrictive covenant can escape its enforcement if local conditions have changed so that the covenant no longer serves its purpose and no longer benefits the once-benefited property.169Davis v. Canyon Creek Ests. Homeowners Ass’n, 350 S.W.3d 301, 309 (Tex. Ct. App. 2011); Cnty. Club Dist. Homeowners Ass’n v. Cnty. Club Christian Church, 118 S.W.3d 185, 194 (Mo. Ct. App. 2003); Cordogan v. Union Nat’l Bank, 380 N.E.2d 1194, 1197–99 (Ill. Ct. App. 1978). For example, consider if one property (hosting one house) was in covenant with another (also hosting one house) to never have loud parties, but years later the benefited property is purchased and converted into an industrial site. A court could determine that the purpose of the covenant, presumably to ensure peace and quiet for whoever lived in the home on the benefited parcel, can no longer be carried out because that property no longer hosts a home. The party can go on.

Relying on the changed-circumstances doctrine has some intuitive appeal for voiding single-family covenants. Imagine that six homes on one street were under a covenant for single-family use only. But twenty years after the establishment of the covenant nearly all of the other homes on the street and the surrounding area have been converted to duplexes, quadplexes, or small apartment buildings. If the purpose of the covenant was to do what could be done to preserve the single-family character of the entire surrounding area, and (so the owners thought) to preserve property values, perhaps the covenant can no longer serve its purpose. Psychologically, the feeling of loss the owners might experience by seeing the covenant dissolve would probably be diluted, because the neighborhood had already been changing for years. And if it was once financially beneficial to lock in single-family use, perhaps increased housing demand in the area (signaled by all the new construction) makes it so that the properties would be worth more if not so limited.

But the problem with the changed circumstances doctrine is that courts employ it only in rare cases.170Robert Ellickson discusses the changed circumstances doctrine as a potential mechanism for handling “stale” covenants, but ultimately the legal background he discusses instills little confidence in such efforts proving fruitful. Ellickson, supra note 18, at 1857–59. As a general matter, courts hesitate to find changed circumstances unless there is no reasonably conceivable benefit to the covenant.171Cordogan, 380 N.E.2d at 1199–200. In the context of single-family covenants, that would be incredibly hard to show. At least theoretically, ensuring that a neighboring property or properties remains single-family could benefit the surrounding properties’ values, because it suppresses local housing supply. The purpose of the covenant may not be economically efficient, but as long as the court finds a purpose, it likely will stand.172Id.; see also Restatement (Third) of Prop.: Servitudes § 7.10 (Am. L. Inst. 1999).

And the fact that such covenants often appear as part of the regulations of an HOA with dozens or more properties make the changed-circumstances doctrine an even weaker tool. If an HOA with a single-family covenant covers a sufficiently large swath of land, then it necessarily will insulate the properties within it from much neighborhood change. Instead, the entire subdivision or several blocks of the neighborhood remains a single-family enclave, and the areas beyond the HOA’s bounds are the spaces that might change. Of course, proliferation of multi-family housing outside of the bounds of the HOA is more than likely a primary reason why HOA members would want the single-family covenant to persist. The covenant’s purpose is to preserve the area as distinct from its surroundings.

So, although declining to enforce single-family covenants because of changed circumstances makes sense from a psychological standpoint and is probably less likely than other approaches to elicit backlash from psychic harm to homeowners, in practice under current common-law rules it is unlikely to be a powerful tool to break housing gridlock on any meaningful scale.

Declaring such a covenant invalid as contrary to public policy is potentially more powerful, but ultimately suffers from a similar weakness. Under the Restatement approach, a covenant is invalid if it is contrary to public policy.173Restatement (Third) of Prop.: Servitudes § 3.2 (Am. L. Inst. 1999). It is rare that a court concludes a covenant is contrary to public policy.174See Korngold, supra note 29, at 49. But when it does happen, often it is because a state (or sometimes even the federal government) has made clear through enacted statutes or other written law that what the covenant aims to accomplish is specifically disfavored, or that it stands directly opposed to a goal codified in state law.175Id. at 51–52.; Viking Props., Inc. v. Holm, 118 P.3d 322, 329–30 (Wash. 2005); Westwood Homeowners Ass’n v. Tenhoff, 745 P.2d 976, 980–81 (Ariz. Ct. App. 1987); Terrien v. Zwit, 648 N.W.2d 602, 608 (Mich. 2002). In the context of restrictive covenants, the public policy exception closely adheres to the unique principle that a court’s enforcement of certain restrictive covenants constitutes “state action.”176See Shelley v. Kraemer, 334 U.S. 1, 18–19 (1948). Thus, when a court is faced with a request to enforce or invalidate such a covenant, it must keep in mind whether doing so would contravene the explicit goals expressed in the state or federal constitution, or any other law such as one passed by a legislative body.

If a court concluded that enforcing a single-family covenant was contrary to public policy, then that would be a powerful tool to break apart some of the housing gridlock. The result of such a decision would completely bypass the covenant and thus free up the land for more productive residential use. But it is unclear what it would take to prove that enforcing a single-family covenant is against public policy, and even that inquiry could vary substantially by state. There are some easy cases. Take California, for example. There, again, the state passed a law declaring that any single-family covenant is unenforceable against a developer or operator of “affordable housing.”177Supra notes 143–44 and accompanying text. Quite plainly, then, there is a public policy against single-family covenants blocking the development of affordable housing. But that is an easy case because the statute already does all the work—the statute by its own enactment guts all such covenants, so the “public policy” exception on the judicial side of the equation is unnecessary.

And there are easy cases on the other end of the spectrum, when enforcing such a covenant would clearly not violate public policy. If the government actor has already zoned large portions of similar land for single-family use only, then it would be hard to credibly assert that the government has any discernible public policy against private agreements that would do the same.

But there are cases between these two extremes that are trickier. Take for instance any of the jurisdictions that have effectively eliminated single-family zoning on the public side of things, but unlike California have said nothing about private covenants.178These would include Oregon, Maine, and Minneapolis. See supra notes 12–15 and accompanying text. On the one hand, one could argue that a jurisdiction that took the affirmative step of freeing all land from the constraints of government-induced single-family restrictions would likewise oppose private arrangements that constrained property in a similar way. On the other hand, one could also argue that if the government wanted to touch private covenants, it could have said so more explicitly. And in any event, a government could reasonably want to loosen government constraints but remain agnostic on what private entities and private rights accomplish in the same subject area.

In an article released this year, Gerald Korngold specifically argues that the public policy doctrine is a viable method for voiding many single-family covenants.179See generally Korngold, supra note 29. He rightly notes that occasionally courts in some jurisdictions have viewed the doctrine more broadly, applying it even without explicit enacted guidance from the legislature on the issue.180Id. at 51–53. I would welcome extending that approach to the issue of single-family covenants. But because it appears that such an approach would mark a stark departure from how courts in many states have approached the public policy doctrine, I think it profitable to seek another solution.

In sum, I believe that the void-as-contrary-to-public-policy approach likely is not a reliable mechanism for breaking gridlock moving forward under the current state of the law. At the very least, in many jurisdictions it would potentially require the jurisdiction to already have eliminated single-family zoning, and so far, that has happened only in a handful of cities and states. And even in those jurisdictions, it is likely that some other specific expression of a policy against restrictive covenants operating to constrain housing supply would be required.

B.  A Hybrid Solution

This Section first briefly identifies a potential multi-step solution to escaping housing gridlock while limiting negative externalities and psychic harm to property owners. It then explains each component of the solution in more detail and explores why it presents some advantages to other approaches from legal, economic, and especially psychological standpoints.

1.  The Two-Stage Solution to Escape Gridlock

This Section gives an overview of a procedural and remedial solution to housing gridlock that single-family covenants (and covenants with a similar effect) helped bring about. The first stage of the solution involves a new legal mechanism created by statute. It would be available to any property owner subject to such a covenant in an area without single-family zoning. If the property owner (I will call this entity the “builder” for clarity) wants to construct multi-family housing on its property, it can force a decision of each beneficiary property on whether to decline to enforce the covenant. The builder would submit to the HOA members a rough plan for the development, such as whether it intends to create a duplex, add an accessory dwelling unit, or build an apartment or condo building, as well as the estimated size of the structure. Along with the rough plan, it would submit a lump-sum monetary offer—the amount the builder is offering to the property owners collectively to not oppose the development. The beneficiary property owners (most often this will include all other members of the HOA) then can individually decide whether to “opt in” to the offer. Those who do so will be entitled to a pro rata share of the total sum.

The second stage of the solution acts as a backdrop. Whether or not a majority of interested property owners opt in to receive a pro rata share of the monetary offer, the construction can go forward. But for those who did not opt in, a legal action for damages will still be available once the structure is completed to a habitable state. They cannot sue to block the development or require it to be torn down if it is already built; but they will be entitled to damages of some amount. There would be no need to determine liability—the builder’s violation of the covenant would be open and undisputed. The action, if it does not result in settlement, will proceed to a trial or hearing on the issue of harm alone. At that proceeding, the property owner would be entitled to a jury determination of damages within statutorily defined ranges, which could rest not only on evidence of the market value of enforcing the covenant in that instance, but also on any sort of harm (financial, psychological, or otherwise) the suing property owner suffered from the new construction.

The following Sections explore each component of the proposed solution in more detail and discuss the comparative advantages over other approaches from legal, economic, and especially psychological perspectives. They address the damages backdrop first since that component is more central to the overall scheme, and then address the initiating offer second.

2.  Why Damages?

By now the stickiness of restrictive covenants is evident. They are hard to get rid of without express agreement by most benefited property owners.181See supra Section II.A.1. Broad coercive action by governing authorities might achieve the primary goal of breaking gridlock, but in the process it might generate various negative externalities rooted in the psychic harm that research suggests it would cause.182See supra Section II.A.2. And the common-law mechanisms limiting the perseverance of such rights are narrow and apply rarely.183See supra Section II.A.3. But courts and the common law can contribute another tool: remedies. It is one issue what private rights parties possess; it is another issue which way a court will operationalize the right.184See, e.g., Daphna Lewinsohn-Zamir, Do the Right Thing: Indirect Remedies in Private Law, 94 B. U. L. Rev. 55, 56 (2014) (“Private law provides diverse remedies for right violations: compensatory and punitive, monetary and nonmonetary, self-help and court awarded.”). At the simplest level, sometimes a court can issue an injunction protecting a right, while other times it can order damages as payment for violating the right.185See id.

A promising tool for breaking the hold restrictive covenants have on housing supply would be—to use the classifications made famous by Calabresi and Melamed’s foundational work—converting the right provided by such a covenant from a property rule to a liability rule.186Guido Calabresi & A. Douglas Melamed, Property Rules, Liability Rules, and Inalienability: One View of the Cathedral, 85 Harv. L. Rev. 1089, 1092 (1972) (“An entitlement is protected by a property rule to the extent that someone who wishes to remove the entitlement from its holder must buy it from him in a voluntary transaction in which the value of the entitlement is agreed upon by the seller.”); see also id. (“Whenever someone may destroy the initial entitlement if he is willing to pay an objectively determined value for it, an entitlement is protected by a liability rule.”). In other words: enforcing covenants only in such a way that the beneficiary of the covenant can receive payment for its violation, but cannot by force of law keep the burdened property in compliance. In a recent article, Robert Ellickson briefly suggested a damages approach and commended one state, Massachusetts, which has provided for it by statute in some special instances.187Ellickson, supra note 18, at 1860–61; see Mass. Gen. Laws ch. 184, § 30 (2023); see also Winokur, supra note 26, at 83. This Section explores a damages approach for single-family covenant violations from legal, economic, and psychological viewpoints. It then discusses the finer details of how such an approach could be implemented from a practical standpoint.

i.  The Legal Landscape

From a legal perspective, damages are the preferred remedy across much of the common law.188Restatement (Second) of Contracts § 359(1) (Am. L. Inst. 1981) (“Specific performance or an injunction will not be ordered if damages would be adequate to protect the expectation interest of the injured party.”); eBay v. MercExchange, LLC, 547 U.S. 388, 391 (2006) (explaining that typically for a plaintiff to be entitled to an equitable remedy instead of a legal one like damages, “[the] plaintiff must demonstrate: (1) that it has suffered an irreparable injury; (2) that remedies available at law, such as monetary damages, are inadequate to compensate for that injury; (3) that, considering the balance of hardships between the plaintiff and defendant, a remedy in equity is warranted; and (4) that the public interest would not be disserved by a permanent injunction”). Even for property rights (which one would rightly assume are often protected by a “property rule”), courts often require the plaintiff seeking to enforce its right to show why equitable relief like an injunction is appropriate. For intellectual property, electronic property, chattel, and sometimes even real property, a plaintiff must show that damages cannot adequately compensate them and that they will suffer irreparable harm without an injunction.189eBay, 547 U.S. at 391 (intellectual property); Intel Corp. v. Hamidi, 71 P.3d 296, 303 (Cal. 2003) (electronic property); Wiggins v. City of Burton, 805 N.W.2d 517, 534–35 (Mich. Ct. App. 2011) (stating the general rule favoring damages for real property). And even after all of that, a court might decline to issue an injunction if it finds it inequitable to do so based on the interests both of parties to the lawsuit and of third parties.190See cases cited supra note 189; see also Blakeley v. Gorin, 313 N.E.2d 903, 912 (Mass. 1974). Thus, at a high level of generality, it comports with common-law remedial principles to presume that damages are a proper remedy when a restrictive covenant is violated.

In practice, however, most jurisdictions will enforce restrictive covenants by injunction.191E.g., 7 Fla. Jur. 2d Building, Zoning, and Land Controls § 102 (2024); 12A Carmody-Wait 2d Injunctions § 78:87 (2023); 43A C.J.S. Injunctions § 186 (2023). A covenant is a property interest and so, the thinking goes, an injunction to protect against its violation is presumptively appropriate.192See sources cited supra note 191. Injunctions are often available simply on a showing that a covenant was violated, without any necessary demonstration of harm by the complaining party.193See sources cited supra note 191; see also 15 Standard Pa. Prac. 2d Injunctions § 83:43 (2024). And indeed, an injunction in some circumstances might even require a property owner to tear down a structure that was erected contrary to the covenant.194Tanglewood Homes Ass’n v. Henke, 728 S.W.2d 39, 47–49 (Tex. App. 1987); Heath v. Uraga, 24 P.3d 413, 422–23 (Wash. Ct. App. 2001). For restrictive covenants specifically, many courts favor injunctions to enforce them precisely because it is challenging to quantify the harm from violating them.195See sources cited supra note 191. From a historical perspective, this practice follows from a unique turn of events in Anglo-American law that lowered the bar for what a restrictive covenant beneficiary would have to show to obtain equitable relief. Traditionally, covenants could be enforced against successors in interest only when the original covenanting parties were in horizontal privity; but eventually courts placed restrictive covenants under the umbrella of a new form of nonpossessory interest known as an equitable servitude, which more freely allowed for enforcement by injunction.196See generally Tulk v. Moxhay (1848) 41 Eng. Rep. 1143.

Yet in most jurisdictions courts still ultimately retain remedial discretion, and thus need not enforce a covenant through an injunction if doing so would be inequitable.197Hall v. Gregory A. Liebovich Living Trust, 731 N.W.2d 649, 652–53 (Wis. Ct. App. 2007). In Blakeley v. Gorin, a property owner planned to construct a large hotel and apartment building that would connect to the neighboring property at the rear via a large, elevated bridge.198Blakeley v. Gorin, 313 N.E.2d 903, 906 (Mass. 1974). But a covenant required property owners to leave a sixteen-feet-wide space behind their buildings at the rear of the property.199Id. at 906–07. The restrictive covenant was over a century old and originally served to preserve a cart path.200Id. Even though the court noted that cart paths are now mostly obsolete, it explained that the covenant still served the valuable purpose of preserving light and air for surrounding properties.201Id. at 911–12. Thus, the court held that the covenant should be enforced.202Id. at 912. However, the court chose damages instead of an injunction to do so.203Id. In its view, the harm to surrounding properties from reduced light and air was minimal compared to the benefit to the developer and the public from the more productive use of the land.204Id.

Likewise, sometimes courts specifically conclude that damages are an adequate legal remedy for the violation of a restrictive covenant. In Crossmann Communities, Inc. v. Dean, a builder violated a setback covenant by beginning to construct a house too close to the property boundary line.205Crossmann Cmtys., Inc. v. Dean, 767 N.E.2d 1035, 1038 (Ind. Ct. App. 2002). A neighboring property owner within the planned community sued to enjoin the construction.206Id. The Court of Appeals of Indiana held that although the covenant was enforceable, damages were an adequate remedy because “[a] restrictive covenant constitutes a compensable interest in land.”207Id. at 1042 (quoting Dible v. City of Lafayette, 713 N.E.2d 269, 273 (Ind. 1999)) (“Because the violation of the restrictive covenants constitutes a compensable interest and because Dean’s subjective concerns are directed to the possibility of a future injury, we find that Dean has an adequate remedy at law for monetary damages that can be corrected at the final judgment.”).

When it comes to single-family covenants, though, courts almost never elect damages instead of injunction.208See Golston v. Garigan, 265 S.E.2d 590, 592 (Ga. 1980); Cordogan v. Union Nat’l Bank, 380 N.E.2d 1194, 1198 (Ill. Ct. App. 1978); Bob Layne Contractor, Inc. v. Buennagel, 301 N.E.2d 671, 681 (Ind. Ct. App. 1973). But see Dible v. City of Lafayette, 713 N.E.2d 269, 273 (Ind. 1999) (explaining that a restrictive covenant is a compensable interest in land). Why? It is not necessarily because they walk through the standard equitable considerations and determine an injunction is necessary. Instead, courts typically note that for violations of such covenants, plaintiffs are not limited to damages they can prove.209See Golston, 265 S.E.2d at 592; Cordogan, 380 N.E.2d at 1198; Buennagel, 301 N.E.2d at 681. Essentially, the court simply states that an injunction is allowed in the face of a violation and goes on its way.210See cases cited supra note 209; see also Restatement (Third) of Prop.: Servitudes § 8.3 cmt. b (Am. L. Inst. 2000).

That approach would be misguided in many cases dealing with proposed or completed multi-family housing development. If restrictive covenants were treated like most other private legal rights, then a plaintiff would have to affirmatively show that an injunction is necessary to safeguard their interests and that such relief is equitable considering all parties affected by it.211See Reynolds v. Amerada Hess Corp., 778 So.2d 759, 765–66 (Miss. 2000); Saint John’s Church in the Wilderness v. Scott, 194 P.3d 475, 480–81 (Colo. App. 2008). In many cases, a property owner benefiting from a single-family restrictive covenant may have a very difficult time making such a showing. As described above in Section I, presumably from a homeowner’s perspective, the main reason to cherish a single-family covenant is that it preserves property values, both by suppressing nearby housing supply and by preserving “neighborhood character.” But, again, the evidence is tenuous that removing such a covenant significantly harms surrounding property values.212See supra Section I.B.2.

Regardless, if there is some measurable harm to property value by terminating the covenant, then damages could conceivably compensate for it because it would be financial loss.213Crossmann Cmtys., Inc. v. Dean, 767 N.E.2d 1035, 1042 (Ind. Ct. App. 2002) (explaining that because restrictive covenants are compensable interests in land, damages may serve as an adequate remedy for their violation). And if there is no measurable harm to property value, that likely means the harm is not so much financial as psychic. Assuming psychic harm provides the foundation for a cognizable legal interest, it is true that damages might not always perfectly compensate for it. From a psychological standpoint, people prefer in-kind redress over monetary redress, even when the loss they suffered is of a fungible asset.214Stern & Lewinsohn-Zamir, supra note 31, at 111, 206–07. Extrapolating to the context of psychic harm, it is reasonable to assume that if there is real psychic harm from terminating a covenant, damages might not completely compensate for it. That may be one of the reasons the Restatement (Third) of Property and many courts remark without much explanation that because harm from the violation of a restrictive covenant is hard to quantify, injunctive relief is appropriate.215Restatement (Third) of Prop.: Servitudes § 8.3 cmt. b (Am. L. Inst. 2000); see Golston v. Garigan, 265 S.E.2d 590, 591 (Ga. 1980); Cordogan v. Union Nat’l Bank, 380 N.E.2d 1194, 1198 (Ill. Ct. App. 1978).

Still, under standard remedial principles, a court may go on to evaluate whether injunctive relief—even if a better compensator than damages—is appropriate based on a “balance of equities”; that is, whether an injunction makes sense given the harm such relief might cause to both the defendant and the broader public.216See Blakeley v. Gorin, 313 N.E.2d 903, 912 (Mass. 1974); Crossmann Cmtys., 767 N.E.2d at 1041–42; Reynolds, 778 So.2d at 765–66; Saint John’s Church in the Wilderness, 194 P.3d at 480–81 (Colo. App. 2008). Often it does not make sense in light of those considerations. The benefit to the plaintiff homeowner would be avoiding some indeterminate amount of psychic harm. But the harm to the defendant and the public from an injunction (that is, strictly enforcing the covenant) might be more pronounced. The owner of the land limited by the covenant would suffer the financial consequences of only being able to operate a single-family home when multi-family use might be dramatically more profitable. And the public would suffer the consequences of one more instance of constrained housing supply.

The legal landscape thus points two ways here. On the one hand, courts are generally free to opt for damages instead of an injunction when presented with a violation of a covenant. On the other hand, courts in practice rarely do so, especially for single-family covenants. I next move on to the interrelated economic and psychological perspectives on the issue to explore why courts’ overprotectiveness of such covenants with injunctive relief is likely misguided.

ii.  Economic Considerations

From an economic perspective, damages are often the best remedy if available. That is because they allow for “efficient breach.”217Winokur, supra note 26, at 37. This concept is most prominent in contract theory, but it applies more broadly. To put it simply, damages place a number value on a legal right or duty. If a party wants to violate such a right or duty, damages set how much they must pay to do so.218See Huynh v. Vu, 111 4 Cal. Rptr. 3d 595, 607–08 (Cal. Ct. App. 2003); Bhole, Inc. v. Shore Invs., Inc., 67 A.3d 444, 453 n.39 (Del. 2013). And most commonly, damages aim to compensate the injured party for what it lost. So, if a violating party chooses to violate a right, and then pays damages to do so, theoretically that party has determined that its action is worth more to it than the money it had to pay. The end result is that the injured party is no worse off than before, and the violating party is better off—approaching a Pareto-efficient result.219In re Grace, No. 7-04-14547, 2008 WL 1766752, at *7 (Bankr. D.N.M. Apr. 14, 2008).

If instead a violating party does not have the opportunity to violate a right and pay damages—indeed, if it cannot permanently violate a right at all because it is prohibited by an injunction—then the law entrenches inefficiency.220See Ian Ayres & Kristin Madison, Threatening Inefficient Performance of Injunctions and Contracts, 148 U. Pa. L. Rev. 45, 47 (1999); Winokur, supra note 26, at 37. The violating party cannot pay to get something they value more than the money, even when allowing them to do so theoretically would not ultimately make anyone else worse off.

To make it plain for the context of this Article: in theory a single-family covenant has a specific quantifiable value to a beneficiary of it (such that the owner assumes it enhances their property value).221See, e.g., Crossmann Cmtys., Inc. v. Dean, 767 N.E.2d 1035, 1042 (Ind. Ct. App. 2002) (explaining that because restrictive covenants are compensable interests in land, damages may serve as an adequate remedy for their violation). And a prospective builder expects a certain financial gain from being able to construct a duplex, quadplex, or other multi-family housing development.222See Baca et al., supra note 96 (supporting that an owner can generate greater revenue from more units). If the value the builder expects to gain from the project is greater than the value of the single-family restriction to the neighbors, then an efficient framework would allow the developer to violate the covenant and pay the neighbors what the restriction on that particular property was worth to them. The developer still profits, and the neighbors are no worse off (and that is not to mention the added benefit to the public at large from the increased housing supply).223See generally Andrea Ventura, Carlo Cafiero & Marcello Montibeller, Pareto Efficiency, the Coase Theorem, and Externalities: A Critical View, 50 J. Econ. Issues 872 (2016) (discussing Pareto optimalization and Coasian bargaining).

As explained in Part I above, although such a result could theoretically be achieved through private bargaining, the logistical difficulties of doing so means that a “liability rule” might be necessary to reach an optimal result.224See supra Section I.B.2. Under Calabresi and Melamed’s famous and foundational remedial framework, the standard perspective is that a right should be protected by a property rule when transactions costs are low, and a liability rule when transactions costs are high.225Louis Kaplow & Steven Shavell, Property Rules Versus Liability Rules: An Economic Analysis, 109 Harv. L. Rev. 713, 718 (1996) (identifying, though questioning, the standard perspective that property rules should be used when transactions costs are high and liability rules when costs are low). The reason for this is that when transactions costs are low, such as when only two parties in close proximity are involved, then it is logistically simple for them to negotiate a buyout.226See generally id. (discussing the standard rationale behind the choice of property versus liability rules). But when transactions costs are high, such as when a builder must obtain permission from many parties, then a liability rule bypasses drawn-out negotiations and holdouts and jumps straight to compensation.227See generally id. (discussing the standard rationale behind the choice of property versus liability rules).

For single-family covenants, a liability rule is often more appropriate. This is so especially when the population of property owners with the right to say “no” to multi-family development is large and thus the risk is great of holdouts who will accept no reasonable price for a change.228See Winokur, supra note 26, at 26–27, 33 (“In addition to the association and possibly the developer, there may be hundreds of individual neighbors entitled to enforce the servitudes. Their sheer numbers may make negotiation for modified enforcement unworkable.”). Combined with the psychological phenomena leading neighbors to assume that such restrictions are worth more than they in fact are,229See supra notes 124, 127–28 and accompanying text. this means that a lawsuit with damages as the end result is often a necessary tool to reach an efficient arrangement that private bargaining cannot achieve.230See Winokur, supra note 26, at 26–27, 37.

What about an obvious and important economic objection—that any such limitation on the strength of single-family covenants might in the short run constrain housing supply? The idea is that there must be some profits-focused reason why subdivision developers include the single-family covenant in new CC&Rs.231See supra Section I.B.2. And so if such a provision is weaker, the developers might have a harder time selling off homes initially. By extension, they perhaps would have less incentive to develop land in the first place.

Two responses: first, although it is true that homes within HOAs tend to be more valuable than those not in HOAs,232Clarke & Freedman, supra note 60, at 2. the evidence does not indicate that reciprocal single-family covenants are necessarily the primary reason for that. It is more likely that the better services within an HOA compared to those of the surrounding locality is a major selling point, with the complete, broad network of restrictions playing a role alongside.233Id. at 1. Indeed, a single-family restriction on a given piece of property likely suppresses that property’s value.234See Glaeser & Gyourko, supra note 113, at 35. So even if it were a selling point that other surrounding houses are covenant-bound to remain single family, any value bump from that fact could be offset by the fact that the same restriction applies to the purchased property too. Put another way, any “demoralization costs” of limiting the enforcement of restrictive covenants may in part be offset by the “morale benefits” of loosening constraints on individuals’ free use of their properties.235Nestor M. Davidson, Property’s Morale, 110 Mich. L. Rev. 437, 442 (2011).

Second, the objection fails to distinguish between local market conditions at the time of initial sale of the home and the time of subsequent transfers of the property to new owners. Most likely, by the time an owner of a house restricted to single-family use determines that they want to develop additional housing, years have gone by since the covenant was initially placed.236The average homeownership tenure is around thirteen years, see Dana Anderson, The Typical U.S. Home Changes Hands Every 13.2 Years, Redfin (Mar. 2, 2022), https://www.redfin.com/news/2021-homeowner-tenure [https://perma.cc/4N4T-9FDR], and that does not account for any owners who owned the home after the covenant was placed and before the present owners took possession. When the initial developer of the subdivision first sold off the homes, it was able to capitalize on any value added immediately from the single-family restrictions (if there was any value added). So, the fact that legal remedies might make some room for market pressure towards multi-family housing down the road is unlikely to significantly affect the initial profitability for the developer from building out and selling off the subdivision of homes restricted to single-family use. The availability of damages down the road unlocks new housing and probably will not stifle initial subdivision development.

iii.  Psychological Phenomena at Play

One might argue, though, that the precise problem is that you cannot put a numerical value on a single-family restriction from the point of view of the neighbors.237See Restatement (Third) of Prop.: Servitudes § 8.3 cmt. b (Am. Law Inst. 2000). This very Article even suggests that psychological attachment, more than quantifiable financial interest, explains the perseverance of such covenants.238See supra Section I.B.3. But even if it is difficult to put a number on what it means to lose the benefit of a restrictive covenant, psychological phenomena suggest that enforcing a covenant, but ordering damages, is perhaps the best way to pursue the public interest in a way that does minimal harm to landowner expectations.

First, I will address some psychological research that might seem to point away from a monetary remedial scheme. Research shows that people prefer in-kind redress over monetary relief.239Stern & Lewinsohn-Zamir, supra note 31, at 111, 206–07. When they lose something, they are likely to be more satisfied if it is replaced with something similar—even if the thing lost is fungible.240Id. Relatedly, Stern and Lewinsohn-Zamir argue in favor of property rules and injunctive relief because in their view this principle means that courts systematically undercompensate injured parties.241Id. at 193, 201. If that is so, then perhaps one might assume that a property rule enforced by an injunction makes the most sense; if money in exchange for the loss of a right does not seem to make the injured party feel whole, then monetary relief is inadequate and an alternative remedy like an injunction is necessary.

Relatedly, the Mere Ownership Effect may partially explain the existence of property rules in general. If a liability rule assumes that basic infringements on property rights can be rectified through pay, property rules assume that sometimes they cannot.242Calabresi & Melamed, supra note 186, at 1092. A property rule promises that the legal system will protect a property interest even if infringement of that interest does not manifest as quantifiable financial harm (hence the classic standard for injunctive relief, which requires the plaintiff to show that they will suffer irreparable harm without an injunction and that legal remedies like damages do not adequately safeguard their interests).243See, e.g., RMH Tech, LLC v. PMC Indus., Inc., 352 F. Supp. 3d 164, 198 (D. Conn. 2018). Property rules assume real harm even absent affirmative proof of it.244See Henry Smith, Property and Property Rules, 79 N.Y.U. L. Rev. 1719, 1760 (2004) (explaining that property rules place with the owner of an entitlement the power to determine the value of the entitlement).

To put it in terms of the Mere Ownership Effect: if something becomes “mine,” then for that reason alone it is more valuable to me.245Supra notes 117–22 and accompanying text. Therefore, an act that I see as an afront to the thing being “mine” hurts me, even if it causes no measurable damage to the object and even if it does not prevent me from using my property. If my right to exclusive possession is not respected, then I have lost something.2467 Fla. Jur. 2d Building, Zoning, and Land Controls § 102 (2024) (“Since the value of a restrictive covenant is often difficult to quantify and may be impossible to replace, injunctive relief is normally available to redress violations of restrictive covenants affecting real property, without proof of irreparable injury or a showing that a judgment for damages would be inadequate. It is the theory of the law that every piece of land has a peculiar value, infringement of which is not readily remedied by an assessment of damages of law.”). At the end of the day, then, monetary compensation would fall short because the core of the harm is difficult or impossible to quantify.247Id. And so, if we care about that non-quantifiable harm, the best way to guard against it would be a property rule expressed through injunctive relief.248See Smith, supra note 244, at 1758–60.

That argument is compelling, but it suffers from a few shortcomings when applied to the issue at hand. First, it would mean mostly surrendering to gridlock. If we assume that because owners of restrictive covenants would rather have the covenants than money, we therefore must accommodate that preference; then life will go on as it has. In the numerous and increasing swaths of land across our country where single-family homes under covenant dominate, but housing demand is high, we will simply persist indefinitely in the housing shortage. Psychological realities may be quite important, but do not alone carry all the normative weight. Property owners might report that they prefer something over something else, but whether the law should cater to the preference is another matter entirely. Stern and Lewinsohn-Zamir, despite their favoring of in-kind remedies and property rules, note that just because people might prefer one kind of remedy does not mean the law necessarily should accommodate their desire.249Stern & Lewinsohn-Zamir, supra note 31, at 201–02.

Indeed, while we can generate crucial insights by understanding psychological phenomena undergirding people’s relationship to their possessions, we likely cannot satisfy short-term preferences of all property owners and correct the legal and market mechanisms that have brought about housing gridlock at the same time. Again, recall the Mere Ownership Effect.250Supra notes 117–22 and accompanying text. Because people value their possessions above market rate by the simple fact that they possess them, a feature of human psychology makes efficient transfer of goods less likely.251Stern & Lewinsohn-Zamir, supra note 31, at 11, 196–97; Winokur, supra note 26, at 34–37. The Mere Ownership Effect thus entrenches a status-quo bias into people’s relationship to property. And property rules simply bolster that status quo, regardless of whether it is economically efficient or socially beneficial to do so.252Stern & Lewinsohn-Zamir, supra note 31, at 11, 196–97; Winokur, supra note 26, at 34–37.

Next, even from a psychological standpoint, we must take the preference for in-kind redress itself with several grains of salt. This Article explained above how the fear of neighborhood change has such deep psychological roots.253See supra Section I.B.3. But there is another element at play here. It is the fact that people are bad estimators of their future flourishing.254See Stern & Lewinsohn-Zamir, supra note 31, at 61. Take for example, one of the most serious forms of compulsory lifestyle change: relocation. If people develop a fear and aversion to changes in their familiar surroundings like their neighborhood, then even more so they assume they will suffer deep psychic harm from being forced to move somewhere else entirely. But the psychological reality is that that is generally not the case. In fact, there is almost no evidence at all that people who must relocate to another home suffer any lasting psychological harm from that event.255Id. at 59. There are some narrow exceptions, such as people living in poverty for whom relocation often means eviction and homelessness.256Id. at 59, 67. But for most people, even though the prospect of change may hurt, they readjust quickly.

That is not to say the initial discomfort does not matter. But when the choice is between an injunction that freezes a housing shortage under force of law and damages that both attempt redress and free up new housing, we have some important questions to ask: If any discomfort from the addition of new housing may wane into the immeasurable given enough time, then why operate under legal rules that, in effect, indefinitely prevent such change? And if psychological research shows that a person would not pay over market value to get a restriction in the first place, then why should we effectively “pay” them far above market value (expressed through unending specific performance)257Cf. Autozone Stores, Inc. v. Ne. Plaza Venture, LLC., 934 So. 2d 670, 675 (Fla. Dist. Ct. App. 2006) (“We acknowledge the view that any harm—including the harm caused by the violation of a restrictive covenant relating to the use of real property—can be assigned a monetary value. But the law of Florida has not embraced that view.”). when they stand to lose that restriction?

Furthermore, not only do people adjust to new circumstances despite their initial discomfort, they also can be nudged to see their own possessions and entitlements through a different lens. Specifically, the degree of psychic harm a person experiences by losing a legal entitlement can be influenced by how that entitlement was previously framed and presented to them. One study measured this effect among a group of first-year law students.258Jonathan R. Nash, Packaging Property: The Effect of Paradigmatic Framing of Property Rights, 83 Tul. L. Rev. 691, 693–94 (2009). Students were divided into two groups and all were given a laptop. The students in one group were told they owned the laptop, which, they were informed, included the right to use, exclude others, and transfer.259Id. at 712–15. The other group was told that they owned all those same sticks in their bundle of property rights but not specifically that they owned the laptop itself.260Id. Both groups were then informed that the school was placing certain restrictions on the laptops’ use. Between the two groups, the students who were told they owned the laptop were more likely to report that they “lost” something because of the restrictions compared to the students who were merely told they had a collection of specific rights regarding the laptop.261Id. at 721–22. Researchers later concluded that forewarning property owners of potential restrictions on their free use of property leads those owners to feel less like their rights had been violated when restrictions were later presented.262Jonathan Remy Nash & Stephanie M. Stern, Property Frames, 87 Wash. U. L. Rev. 449, 470 (2010).

So, if owners of a possession or entitlement are informed about the qualified nature of their right, they are less likely to feel a sense of loss when restrictions manifest later. For single-family covenants, this means that people’s expectations about what entitlements a restrictive covenant gives them potentially can be adjusted. If, in practice, they begin to see that violation of such a covenant entitles them to some form of financial compensation, then over time they may adjust to see their interest in the covenant as a financial one—and not an amorphous, yet deeply cherished, interest that entitles them to block neighborhood change.

On the other side of the coin, that same principle demonstrates why the damages approach has some psychological advantages to other methods of breaking housing gridlock, especially those involving sweeping mandates or broad invalidation of such single-family covenants. A damages regime enables people to relearn the nature of their interests. A wide-reaching statute that terminates or voids all single-family covenants in one shot gives people no time to adjust their expectations about their own entitlements before those entitlements are taken away. But if the covenants stay in place and are simply enforced differently when violations arise, then people perhaps have room to adjust their expectations without feeling like they have been entirely deprived of their rights. Admittedly, property owners might still experience some degree of surprise, such as when the legislative body first enacts an approach like the one this Article urges. Furthermore, the first property owners in a given geographic area to find themselves neighbors to a covenant violation under the new remedies approach would not have had sufficient time or previous examples based on which to adjust their expectations. But for subsequent neighbors, the blindsiding effect could be reduced.

But if damages are the answer, how will they be calculated? It is an important question because, as Stern and Lewinsohn-Zamir point out, the “coercion premium” means that property owners are unlikely to receive compensation that makes them whole, at least in the takings context.263Stern & Lewinsohn-Zamir, supra note 31, at 111, 201. First, property owners, so the thinking goes, refuse to sell in a voluntary market transaction because they value their property above market value.264See Clare Trapasso, Why Aren’t Would-Be Sellers Listing Their Homes? There’s One Big Reason They’re Stalling, Realtor.com (May 4, 2023), https://www.realtor.com/news/trends/why-potential-sellers-arent-listing-their-homes [https://perma.cc/TKC5-PAUB] (“About a fifth of homeowners in February [2023] reported they were concerned about slowing buyer demand in their area and that sellers aren’t receiving good offers.”). Second, and relatedly, when the government resorts to a taking, the property owner suffers an additional psychic harm through the coercive practice.265Stern & Lewinsohn-Zamir, supra note 31, at 111. So, if just compensation is determined by a court’s estimation of fair market value, that compensation will systematically undercut what the property owner feels they lost.266Id. If all of that is true in the takings context, presumably it could hold true in the damages context too. In both cases, the property owner is given money in the face of their refusal to relinquish a property right.267Indeed, it is possible that certain changes in property law and entitlements at the hands of a court could constitute a taking. See James E. Krier, Judicial Takings: Musings on Stop the Beach, 3 Brigham-Kanner Prop. Rts. Conf. J. 217, 221 (2014) (“Judicial takings are solely concerned with court decisions that reallocate existing property rights by changing established property doctrine.”).

Still, the fact that someone subjectively would feel immense loss from having a property right transformed to a liability right does not by itself mean that their preference should determine proper compensation. If it did, then we could hardly hope to escape the holdout problem. A property owner who deeply cherishes the restrictive covenant could be entitled to such a substantial damages award for the loss of the covenant that developers would seldom find it profitable to build and risk the lawsuit. If all that mattered were property owner expectations, that might be fine. But if the public interest factors into the balance of the equities, we need another approach.

iv.  Four Features of the Damages Action

To address this problem, I propose a compensation method with four key features.268In the United Kingdom, a common method is to estimate the amount the parties would have reasonably negotiated for. Amec Devs. Ltd. v. Jury’s Hotel Mgmt. (UK) Ltd. [2000] EWHC (Ch) 454 [12] (Eng.). In my view, this method risks overinflating damages awards, in light of the premium owners tend to demand for goods because of the Mere Ownership Effect. In the U.S., the common method is to estimate the degree the value of the plaintiff property owner’s property decreased from the violation. See, e.g., Garrett v. City of Topeka, 916 P.2d 21, 36 (Kan. 1996) (applying the principle in the context of an inverse condemnation action). This method, in my view, may fail to consider the psychic harms to the plaintiff rooted in the principle of loss avoidance and the Mere Ownership Effect. The first is that the property owner ought to be allowed to have a jury decide compensation if they so choose. Some jurisdictions already allow for this, even if they are not required to as a matter of constitutional law.269The Seventh Amendment does not generally guarantee the right to a jury trial in civil cases in state courts. Great Lakes Gas Transmission Ltd. P’ship v. Markel, 573 N.W.2d 61, 63–64 (Mich. Ct. App. 1997). However, states often allow the right to trial by jury for cases alleging a violation of a restrictive covenant when the plaintiff seeks damages. Ingledue v. Dyer, 937 P.2d 925, 930–31 (Haw. Ct. App. 1997) (noting that Hawai’i gives this right when damages are involved); Glover v. Santangelo, 690 P.2d 1083, 1085 n.3 (Or. Ct. App. 1984). A jury trial could in theory soften the effect of the coercion premium. Each member of the jury is someone who could own property or reside in a home that is part of a neighborhood with covenants.270McCandless v. Pease, 465 P.3d 1104, 1120 (Idaho 2020) (“The American tradition of trial by jury, considered in connection with either criminal or civil proceedings, necessarily contemplates an impartial jury drawn from a cross-section of the community.” (quoting Thiel v. S. Pac. Co., 328 U.S. 217, 220 (1946)). Because each of them could find themselves in the same situation, they could sympathize with the property owner’s subjective plight. Conversely, the jury could be a useful tool to moderate the intensity of the property owner’s preferences. Although the jury could have sympathy for the property owner, the fact that the jury is drawn from a “cross-section” of the community271Id. means that it is less likely to be swayed by a property owner with an unusual attachment to the single-family covenant.272Of course, attorneys could seek to form a more favorable jury for their client through voir dire, but such efforts take place in an adversarial setting and are thus open to both sides. And it would not be unique to this issue alone.

The second and related feature of my compensation regime is that the property owner ought to be allowed to present evidence explaining why and to what degree they value their right to prevent the building of the structure at issue above market. In takings cases, most often the measure of just compensation is the market value of what was taken (that is, for the purposes of the issue at hand, the right to enforce the covenant in that particular instance), or the difference in fair market value between the plaintiff’s entire property before and after the taking.273See, e.g., U.S. v. 50 Acres of Land, 469 U.S. 24, 25–26 (1984); Twp. of Chester v. Commonwealth, 433 A.2d 1353, 1354–55 (Pa. 1981). But of course in tort actions and elsewhere across the common law, other factors (such as pain and suffering and other psychic harms) can enter the equation to determine what would make an injured party whole.274Kahrar v. Borough of Wallington, 791 A.2d 197, 204–05 (N.J. 2002); Miranda v. Said, 836 N.W.2d 8, 22–23 (Iowa 2013) (emotional distress damages from legal malpractice); Gates v. Richardson, 719 P.2d 193, 200 (Wyo. 1986); Howard v. Lecher, 366 N.E.2d 64, 65 (N.Y. 1977) (citing Johnson v. State, 334 N.E.2d 590, 593 (N.Y. 1975)) (physical harm not a necessary prerequisite).

The third feature of my compensation regime is that the damages should be based on the harm from the construction of the precise structure or development the covenant-violating landowner builds, not from an extinguishing of the right to enforce the covenant against any other property bound by it.275Courts dealing with property harms generally attempt to award damages that would make the plaintiff whole. See Ruiz v. Varan, 797 P.2d 267, 270 (N.M. 1990). There are multiple routes to reach that goal, such as measuring damages by the diminution in property value or by the cost to restore the property to its former state. Id.; Thompson v. King Feed & Nutrition Serv., Inc., 105 P.3d 378, 381 (Wash. 2005); Romine v. Gagle, 782 N.E.2d 369, 383 (Ind. Ct. App. 2003). In other words, someone who wants to construct a medium or large apartment building may have to pay more in damages than a homeowner who wants to add an accessory dwelling unit behind their house, or convert their house into a duplex. If the central harm from the non-enforcement of a single-family covenant is more rooted in the change that could result from the specific covenant violation, as opposed to the weakening the covenant in the abstract, then it makes sense for the damages due to reflect that.276This feature of the compensation system could help limit dramatic covenant violations. It is unlikely to result in the construction of massive apartment buildings in otherwise sparsely populated regions because, in such circumstances, the neighboring property owners would likely suffer greater psychic harm. (Moreover, from a financial standpoint, a builder would likely only find it profitable to build a larger structure in regions where demand for such housing would be substantial. Ten-story buildings in rural areas are unlikely.).

The fourth and final feature of the compensation regime is statutory ranges or caps on damages, if not inconsistent with state law where the regime would be implemented. Specifically, the legislative body should prescribe either a range or a maximum damages award per property owner that considers the nature of the new structure, the nature of the surrounding structures, and the distance from the new structure to the property of the aggrieved owner. In other words, the legislative body could, for example, provide a range of permissible damages awards for duplexes built in single-family neighborhoods, which would be slightly lower in value than the range of permissible awards for quadplexes in the same neighborhood, which would be lower than that for ten- to twenty-unit apartment buildings in the same neighborhood, and so on.277The general guiding principle would be that the greater a deviation from the neighborhood state the covenant would have preserved, the greater the damages could be. Prescribing a range instead of specific values enables courts or juries to consider various other factors that might make the new structure more or less harmful to the aggrieved property owner in a given case. And prescribing a range instead of allowing any damage award whatsoever accomplishes at least two things: (1) it gives builders some amount of predictability and thus confidence about whether it will likely be financially feasible for them to move forward and build the non-conforming structure;278Cf. Alan E. Garfield, Calibrating Copyright Statutory Damages to Promote Speech, 38 Fla. State U. L. Rev. 1, 6–7 (discussing the relative unpredictability created by damages measures that allow for more discretion by the court). and (2) it guards against the risk that some juries might offer astronomical awards even for minor departures from single-family use restrictions—disproportionately burdening builders of smaller structures who likely have less capital on hand. Many states have addressed whether statutory caps on noneconomic compensatory damages are constitutional as a matter of their state’s law.279MacDonald v. City Hosp., Inc., 715 S.E.2d 405, 421–22 (W. Va. 2011). In most states such limitations would likely be constitutional,280See id.; see also McClay v. Airport Mgmt. Servs., LLC, 596 S.W.3d 686, 688 (Tenn. 2020); Evans v. State, 56 P.3d 1046, 1050–57 (Alaska 2002); Judd v. Drezga, 103 P.3d 135, 139–45 (Utah 2002); Etheridge v. Med. Ctr. Hosp., 376 S.E.2d 525, 529 (Va. 1989). but in the minority of states where they would not be, this compensation regime would have to operate without express limits on the compensatory damages that a plaintiff property owner could obtain.281In such circumstances, I would still consider the damages regime beneficial, though it would lack some of the predictability advantages described immediately above.

It would be difficult here to declare the specific dollar value ranges appropriate for all possible cases. A legislative body could benefit from thorough input on that matter from builders and property owners. But, importantly, whatever ranges the legislative body chooses must not allow for such significant damages awards that builders would rarely bother moving forward with relatively noninvasive multi-family projects for which there is market demand. To make it just a bit more concrete: in regions where demand for more housing is high but not astronomical, damages awards in the range of several hundred dollars for the nearest neighboring properties may be appropriate for converting one home to a small handful. The value could scale up for larger construction projects and scale down for neighbors located farther from the project. Perhaps three-digit damages would strike many people as surprisingly low (not to mention the two-digit damages potentially in play for more distant neighbors). But the key here is that the damages award would, again, be provided in response to one covenant violation, not for termination of the covenant all together. The same or a similar amount in damages could be in play the next time around if another neighbor plans to develop housing in violation of the covenant too. And from the builder’s perspective, it is possible that they will have to pay every other property owner within the HOA in some form, either through damages or as part of the initial lump-sum payment. Thus, if individual neighbors are entitled to too high of a value in compensation, rarely would builders go through any of this process at all (except in cases of extremely high potential profit from the project).

These four features of the process of calculating damages should go far towards ensuring that property owners’ psychic harm is taken seriously, but not given so much weight that gridlock can persist as it has. And, as the next Section explains in more detail, from a psychological standpoint, it gives property owners a voice in the process—a crucial component to minimizing psychic harm throughout the procedural stages themselves.282Stern & Lewinsohn-Zamir, supra note 31, at 92, 96.

One thorny issue still remains: the proper timing for such a damages action. Many HOA CC&Rs specify time limits on how long construction projects may take; these are often one-year limitations.283See, e.g., Conditions, Restrictions, Easements and Set Back Lines, Westhaven, Polk Cnty., Or. ¶ 10 (Sept. 15, 1986) (on file with author). And HOAs may levy reasonable fines for failure to comply with such a deadline.284Restatement (Third) of Prop.: Servitudes § 6.5 (Am. L. Inst. 2000) (Statutory Note); Morningside Crescent Ct. Condo. Ass’n v. Nayak, No. 2-15-1126, 2016 Ill App. Unpub. LEXIS 1908 at **12 (explaining that a fine must be reasonable). That being so, an action for damages could ripen whenever covenant terms have been violated.285Generally, for a plaintiff to have standing to sue, they must have suffered an injury recognized by law or show that such an injury is imminent. Leiendecker v. Asian Women United of Minn., 731 N.W.2d 836, 841 (Minn. Ct. App. 2007) (citing State v. Colsch, 284 N.W.2d 839, 841 (Minn. 1979)); Knittle v. Progressive Cas. Ins. Co., 908 P.2d 724, 725–26 (Nev. 1996). The timing of the action discussed in this Article is meant to comply with that general requirement for both legal and practical reasons (i.e., so that damages will be easier to determine). For an express single-family covenant, that means the plaintiff can sue once a multi-family structure is built out to a habitable standard (that is, when the defendant no longer uses the property as a single-family residence as the covenant requires).286Note that this approach to timing may differ from when a plaintiff could (and, indeed, must) sue to enjoin a covenant violation. See, e.g., Hidalgo v. 4-34-68, Inc., 988 N.Y.S.2d 64, 66–67 (N.Y. App. Div. 2014). For other restrictions that have the effect of only allowing for one single-family unit, such as minimum-square-footage requirements, an action could ripen as soon as the new construction has caused a violation. The HOA’s time limits on construction can serve as separate restrictions for which fines could accrue to the HOA daily after the deadline passes. This acts as an incentive for builders not to unduly delay and thus force aggrieved neighbors to delay their compensation. Yet, it is possible that HOAs could limit construction times so stringently that few multi-family projects could go forward.287Some have six-month limitations, Declaration of Restrictions on Mountain Fir Estates, Independence, Polk Cnty., Or. 4 (Aug. 5, 1999) (on file with author), and some could be even quicker. To dodge this counterpunch, localities or states may need additional legislative provisions directly targeted at allowing reasonable construction times.

3.  Why an Opt-In Mechanism?

There is, however, one more psychological observation that bears on the question of the proper mechanism for escaping a single-family covenant, for which a damages approach alone might not meaningfully account. It is the fact that even property owners who lose an entitlement tend to feel that they have not lost as much if they (1) were given voice in the process,288Stern & Lewinsohn-Zamir, supra note 31, at 96. and (2) were not singled out for negative treatment.289Id. at 98. To account for the coercion harm and singling-out harm, I suggest a new legal mechanism that would chronologically precede the damages schema described above.

Psychologists have conducted specific research on the psychological effects of takings, and of procedural legal processes more generally. They have found that people often care as much about being treated with dignity and respect during legal processes as they care about the end result.290Id. at 96. Likewise, people are averse to legal processes that single them out for unfavorable treatment, even if it is in the name of the public interest.291Id. at 98. That means that people are likely to suffer psychic harm if they are coerced into a situation in which they are disadvantaged for the sake of some public good but others against whom they compare themselves are not.292Id.

Property owners who expect to be able to limit fellow HOA members to single-family use, but then can only collect market price for the right instead, might feel that their particular interests were steamrolled on behalf of some public good. That does not mean the damages approach is inappropriate, but it does beg the question of whether that particular form of psychic harm could be minimized along the way.

My proposal is one that gives landowners voice, but not veto. The legal mechanism is created by statute and triggered when an owner of a property within an HOA seeks to build in violation of a single-family covenant, and that area is not subject to single-family zoning. The builder can initiate a decision by each member of the HOA. It offers a single lump-sum value in exchange for the right to violate the covenant. Each member decides whether to opt in to receive a portion of the sum. The sum is divided among all HOA members who opt in, and in exchange, those members and the HOA itself relinquish the right to challenge the building. For those that do not opt in, the damages approach will apply.293See supra Section II.B.1. If they object to the construction, they can roll the dice and collect damages in court.

This legal mechanism draws inspiration from land assembly districts (“LADs”), though it has some key differences. LADs seek to break another form of property gridlock.294Heller, supra note 39, at 118–21. Sometimes a large development that would span several individual land parcels would generate more economic value or public benefit than the sum of all the individual parcels under their current use.295Id. But because each individual parcel owner can refuse to sell, a single holdout can sink a socially beneficial development. LADs tackle this problem.296Id. When a state or locality authorizes an LAD, it gives the neighborhood the power negotiate a sale of all the land within it—either by majority or supermajority vote, or by the appointment of a board to negotiate the sale.297See Michael Heller & Rick Hills, Land Assembly Districts, 121 Harv. L. Rev. 1465, 1488–92 (2008). The dissenting property owners can opt out, but then they would still lose their land to the project by eminent domain (and in exchange, receive just compensation from the government).298See id. All other properties effectively receive a pro rata share of the overall sale price.299Heller, supra note 39, at 120.

My proposal is similar to the LAD mechanism in two important ways. One, it allows for a lower-transactions-costs method of negotiating a selling price than bargaining with each individual entitlement holder. Two, it provides an incentive against holdouts,300Because the lump sum would be divided between only those who opt in to receive it, in a sense the sum itself could act as a form of “commons,” encouraging individual owners to take a slice in fear of missing out on their share. See supra notes 32–34 and accompanying text. But, of course, the more owners who do so, the less each one will receive. and ultimately can move forward whether there are individual holdouts or not. My proposal differs from the LAD mechanism because whereas with a LAD the owners have the ultimate say by majority vote over whether the land collection is sold at all,301See Heller & Hills, supra note 297, at 1488–92. in my proposal for restrictive covenants the development may move forward either way.302I do not see this diversion from the standard LAD procedure to be an absolutely essential component of the compensation regime. I favor it because I suspect that the psychological phenomena leading people to resist change and cherish their restrictive covenant rights are sufficiently strong to (often) prevent a majority of HOA members from acquiescing to the new development. But it is possible that, especially in larger HOAs, many members, particularly the ones located farther away from the development, would be content to collect their portion of a lump sum and leave the resistance to a minority of homeowners who are closer to the development. But the property owners can choose whether they prefer compensation through the slice of the developer’s up-front offer or whether they would rather leave their compensation to litigation and jury determination within the predefined damages range.

A psychological benefit of this approach is that it gives the property owners a sense of ownership over whatever result they reach. Those who opt in to the developer’s offer choose to do so and thus escape any acute harm from direct coercion. But even those who do not opt in could to an extent feel that their interests mattered. True, they were not able to stop the development simply by speaking up. But when they ultimately are only entitled to damages in court, that result is one they had some choice in bringing about; they are in the same boat as all other HOA members in that respect.

Of course, this procedural mechanism will interplay with the damages backdrop. If the builder has reason to believe that juries would give more favorable awards to neighboring property owners and strongly compensate for any unquantifiable psychic harm (that is, select higher values within the predefined damages ranges), then the builder might want to avoid such damages actions. They therefore would have an incentive to offer more in the initial lump sum to try to persuade owners to opt in and thus relinquish their rights to a later suit. Conversely, if the builder offers too much in the lump sum, but the neighborhood contains many holdouts, then the builder might end up substantially paying in the lump sum and still face the prospect of numerous damages suits, after which it would pay out again and again. And from the neighboring property owners’ perspectives, the lower the value a jury is likely to award within the damages range, the more likely the neighbors will be encouraged to accept a lump sum monetary offer by the builder up front. But the crucial point is that the two-stage solution, while guaranteeing that a builder can develop multi-family housing if it is determined to do so, also provides neighboring landowners both a degree of ownership over how their interests are credited and a mechanism for targeted compensation. This approach avoids holdouts and loosens housing gridlock in a way that takes property interests and people’s psychological attachments to them seriously.303If enough property owners within an HOA followed the procedures I have laid out to create more housing, might the community reach a critical mass, so to speak, by which the covenants could be considered unenforceable under the common law because of changed conditions, or even abandonment? See supra Section II.A.3. A court probably would not conclude so. Through the process I have laid out, each property owner with a right to enforce the covenant either can do so through the damages action, or they will assert their legal right of enforcement by expressly opting into the lump sum instead. Neighbors thus would not have sat back passively while covenants were violated. In my view, that fact would count against a finding of abandonment, acquiescence, or anything similar. As for changed conditions: the framework I have presented is meant to facilitate a change in conditions. If at a certain point, the framework is “shut off” because the covenants simply are deemed unenforceable for changed conditions, then that would in part undermine the balance I have tried to strike. At the very least, it would mean that neighbors would probably be entitled to more damages for every violation, because each violation would be chipping away at the enforceability of the covenant against anyone in the community. Rather than elevate damages accordingly, I think it makes more sense to simply ignore the changed circumstances doctrine wherever my schema is in place.

Of course, when a jury or relevant decision maker pinpoints the appropriate damages award within the statutory range in a given instance, it could take into account the changed circumstances of the neighborhood to determine how much the suing neighbor is harmed by the one additional covenant violation at hand (the first property in a single family neighborhood converted to a six-unit building is more striking of a change than the hundredth such domino to fall).

CONCLUSION

The housing crisis is complex. It is partly a legal and financial problem and, as this Article especially emphasizes, partly a psychological one. Even if homeowners have the economic incentive to build or allow for multi-family housing, and even if localities and states lift the “zoning straitjacket,”304See Robert C. Ellickson, The Zoning Straitjacket: The Freezing of American Neighborhoods of Single-Family Houses, 96 Ind. L.J. 395, 397 (2021). housing supply still could lag. Private restrictive covenants enshrining single-family housing curb housing production in many localities, potentially distressing millions of Americans and disproportionately harming those living in poverty. Psychological phenomena concerning how people develop attachments and perceive change conspire with powerful legal tools to hold these restrictions in place. The solution to this gridlock thus must account for those conspiring factors. This Article charts a possible course: an initial lump-sum offer from a builder with an opt-in mechanism for each HOA member, followed by a carefully prescribed damages action available to all remaining members. If we care deeply about both property owner expectations and providing an adequate supply of housing, this approach shows a way forward.

97 S. Cal. L. Rev. 1233

Download

* Assistant Professor of Law, Duquesne University Kline School of Law. I would like to thank Molly Brady, Charles Barzun, John Infranca, Peter Danchin, Michele Gilman, Will Moon, Anne-Marie Carstens, Aadhithi Padmanabhan, Alexi Pfeffer-Gillette, Chris Bryant, Matthew Sipe and Chelsea Banister for insightful comments. Thanks also to all the participants in the Maryland Law/Baltimore Law Junior Faculty Workshop for helpful feedback on an earlier draft. And thanks to the participants in the Richmond Junior Faculty Forum for helpful comments as well. I am grateful to Marc LeVan for valuable research assistance and to Tanya Thomas for research assistance at the earliest stages. Finally, thanks to the exceptional editors of the Southern California Law Review. Errors are mine.

Major Questions, Common Sense?

The Major Questions Doctrine (“MQD”) is the newest textualist interpretive canon, and it has driven consequential Supreme Court decisions concerning issues from vaccine mandates to environmental regulation. Yet, the new MQD is a canon in search of legitimization. Critics allege that the MQD displaces the Court’s conventional textual analysis with judicial policymaking. Textualists have now responded that the MQD is a linguistic canon, consistent with textualism. Justice Barrett recently argued in Biden v. Nebraska that the MQD is grounded in ordinary people’s understanding of language and law, and scholarship contends that the MQD reflects ordinary people’s understanding of textual clarity in “high-stakes” situations. Both linguistic arguments rely centrally on “common-sense” examples from everyday situations.

This Article tests whether these examples really are common sense to ordinary Americans. We present empirical studies of the examples offered by advocates of the MQD, and the results challenge the arguments that the MQD is a linguistic canon. Moreover, the interpretive arguments offered to legitimize the MQD as a linguistic canon threaten both textualism and the Supreme Court’s growing anti-administrative project.

INTRODUCTION

The Supreme Court’s most consequential interpretive canon is a new one: the major questions doctrine (“MQD”). The basic idea is as follows: when an agency undertakes a “major” policy action, the statutory authorization must be clear and specific (rather than unclear or general).1See infra Section I.A. In several high-profile cases, the Court has used the MQD to strike down agency actions involving vaccine mandates,2Nat’l Fed’n of Indep. Bus. v. Dep’t of Lab., Occupational Safety & Health Admin., 142 S. Ct. 661, 665–66 (2022). environmental regulation,3West Virginia v. EPA, 142 S. Ct. 2587, 2609–16 (2022). and student loan relief.4Biden v. Nebraska, 143 S. Ct. 2355, 2375 n.9 (2023). The majority opinion states that the issue is resolved by “statutory text alone,” and its appeal to the Major Questions Doctrine (“MQD”) “simply reflects [the] Court’s familiar practice of providing multiple grounds to support its conclusions.” Id. Given this track record, no wonder critics have argued that the MQD poses an existential threat to the administrative state, since few statutes are likely to provide the requisite clear language, and what constitutes “majorness” is subjective and potentially applicable to a wide range of agency actions.5See, e.g., Lisa Heinzerling, The Power Canons, 58 Wm. & Mary L. Rev. 1933, 1938 (2017). But see Kristin E. Hickman, The Roberts Court’s Structural Incrementalism, 136 Harv. L. Rev. F. 75, 76–77 (2022) (arguing that the development of the MQD is more incrementalist than critics have suggested and that it will likely not threaten the administrative state).

Despite its undeniable influence, the MQD is undertheorized, and it remains a canon in search of a justification.6See Mila Sohoni, The Major Questions Quartet, 136 Harv. L. Rev. 262, 285–87 (2022) (recounting but disagreeing with these efforts). Scholars and judges have splintered in their understanding of how the doctrine operates on statutory language.7See, e.g., Louis J. Capozzi III, The Past and Future of the Major Questions Doctrine, 84 Ohio St. L.J. 191, 219, 222–23 (2023). Compare West Virginia v. EPA, 142 S. Ct. at 2587, with West Virginia v. EPA, 142 S. Ct. at 2616 (Gorsuch, J., concurring), and Biden v. Nebraska, 143 S. Ct. at 2376 (Barrett, J., concurring). For instance, one advocate of the canon describes it as a requirement for a “clear and specific statement from Congress if Congress intends to delegate questions of major political or economic significance to agencies.”8Ilan Wurman, Importance and Interpretive Questions, 110 Va. L. Rev. 909, 909 (2024). As we discuss in Section I.B, Wurman’s characterization of the MQD as a clear statement rule notwithstanding, he views the MQD as justifiable as a linguistic canon. Two critics of the MQD have described it similarly as a rule requiring courts “not to discern the plain meaning of a statute using the normal tools of statutory interpretation, but to require explicit and specific congressional authorization for certain [major] agency policies.”9Daniel T. Deacon & Leah M. Litman, The New Major Questions Doctrine, 109 Va. L. Rev. 1009, 1009 (2023). In response, Justice Barrett in Biden v. Nebraska has denied that the MQD requires courts “to depart from the best interpretation of the text,” and claims that the canon is not a clear statement rule and does not require explicit congressional authorization of the “precise agency action under review.”10Biden v. Nebraska, 143 S. Ct. at 2378 (Barrett, J., concurring). These kinds of disagreements, while perhaps technical, influence how the doctrine is defended and employed, and even implicate its future as an interpretive canon.

So far, efforts to legitimize the doctrine have been unpersuasive. The canon is used primarily by self-identified textualists,11See, e.g., West Virginia v. EPA, 142 S. Ct. at 2616 (Gorsuch, J., concurring); Biden v. Nebraska, 143 S. Ct. at 2372–75; id. at 2376 (Barrett, J., concurring). but critics (textualist and non-textualist alike) have alleged that the MQD is inconsistent with textualism, or even is anti-textualist, because it displaces the ordinary meaning of statutory text in the name of normative values.12See, e.g., Sohoni, supra note 6, at 282–90; Daniel E. Walters, The Major Questions Doctrine at the Boundaries of Interpretive Law, 109 Iowa L. Rev. 465, 523–37 (2024); Chad Squitieri, Who Determines Majorness?, 44 Harv. J.L. & Pub. Pol’y 463, 480 (2021); Benjamin Eidelson & Matthew C. Stephenson, The Incompatibility of Substantive Canons and Textualism, 137 Harv. L. Rev. 515, 522–33 (2023); Mike Rappaport, Against the Major Questions Doctrine, The Originalism Blog (Aug. 15, 2022, 8:00 AM), https://originalismblog.typepad.com/the-originalism-blog/2022/08/against-the-major-questions-doctrinemike-rappaport.html [https://web.archive.org/web/20240728034527/https://originalismblog.typepad.com/the-originalism-blog/2022/08/against-the-major-questions-doctrinemike-rappaport.html]; Chad Squitieri, Major Problems with Major Questions, L. & Liberty (Sept. 6, 2022), https://lawliberty.org/major-problems-with-major-questions [https://perma.cc/2D3Y-AA4K]. In fact, the MQD’s rise coincides with a surge of skepticism among textualists and commentators about the validity of substantive canons generally.13See, e.g., Eidelson & Stephenson, supra note 12, at 517–21. Of course, textualist skepticism about substantive canons is not new. See, e.g., Amy Coney Barrett, Substantive Canons and Faithful Agency, 90 B.U. L. Rev. 109, 110 (2010). The Court’s use of the MQD even prompted Justice Kagan to retract her quip that “we’re all textualists now.”14Harvard Law School, The 2015 Scalia Lecture Series: A Dialogue with Justice Elena Kagan on the Reading of Statutes, YouTube (Nov. 25, 2015), https://youtu.be/dpEtszFT0Tg [https://perma.cc/L65V-9AET]. She now notes: “It seems I was wrong. The current Court is textualist only when being so suits it.”15West Virginia v. EPA, 142 S. Ct. at 2587, 2641 (Kagan, J., dissenting). See generally Kevin Tobia, We’re Not All Textualists Now, 78 N.Y.U. Ann. Surv. Am. L. 243 (2023) (providing an overview of the influence and evolution of “all textualist” statements). These critiques allege that the MQD inappropriately licenses textualists to depart from the best reading of statutory text in the name of values or norms. An ideal response for a textualist favoring the MQD would be some account of how the MQD determines the linguistic meaning of a statute.

Increasingly, textualists are making precisely this “linguistic” move. Some textualists now propose that the MQD is a linguistic interpretive canon, consistent with textualism.16See, e.g., Wurman, supra note 8, at 916–17; Biden v. Nebraska, 143 S. Ct. at 2376 (Barrett, J., concurring). On this account, textualists remain committed to the ordinary reader’s understanding of language, with the MQD simply reflecting how ordinary people, exercising basic “common sense,” generally understand the meaning of statutes delegating authority to agencies.17See, e.g., Amy Coney Barrett, Congressional Insiders and Outsiders, 84 U. Chi. L. Rev. 2193, 2208–11 (2017) (arguing for statutory interpretation to focus on the understanding of ordinary people rather than Congress). On this “linguistic” picture, normative or substantive values are not relevant to the canon or its application, and they certainly do not lead textualists to depart from the best reading of the text. Instead, the MQD is just like any other linguistic canon—it reflects only a generalization about how ordinary people use and understand language in context.18On the modern textualist Court’s emphasis on ordinary readers and the relationship between ordinary understanding and linguistic canons, see Kevin Tobia, Brian G. Slocum & Victoria Nourse, Statutory Interpretation from the Outside, 122 Colum. L. Rev. 213, 213 (2022) [hereinafter Tobia et al., From the Outside]. This rebranding of the MQD as a linguistic canon has rapidly moved from the pages of law reviews19See Wurman, supra note 8, at 909. to the Supreme Court.20Biden v. Nebraska, 143 S. Ct. at 2376 (Barrett, J., concurring). There, Justice Barrett recently denied that the MQD is normatively driven and instead argued that it merely reflects ordinary people’s “common-sense” understanding of instructions, including those given by Congress.21Id. at 2384; see also Beau J. Baumann, Let’s Talk About That Barrett Concurrence (on the “Contextual Major Questions Doctrine”), Yale J. on Reg.: Notice & Comment Blog (June 30, 2023), https://www.yalejreg.com/nc/lets-talk-about-that-barrett-concurrence-on-the-contextual-major-questions-doctrine-by-beau-j-baumann [https://perma.cc/8PKB-458K] (discussing Barrett’s arguments).

In this Article, we evaluate the MQD’s “linguistic turn” and subject its premises to empirical study. We study two key issues: (1) Does the MQD follow from ordinary people’s understanding of language and, more specifically, delegating instructions?; and (2) Do ordinary people interpret more cautiously or narrowly in “high-stakes” situations? The empirical results support answering “no” to both questions. Contrary to the MQD proponents’ contentions, the results indicate that ordinary people do not adjust their judgments of textual clarity according to the stakes of interpretation, and they interpret broad delegations broadly, even in situations in which Justice Barrett claims that “common sense” would dictate narrower interpretations of the scope of authorization.22See infra Part III.

Part I introduces the MQD and the two linguistic arguments that have been offered in defense of the canon. After briefly addressing the defense of the MQD as a substantive canon in Section I.A, we turn in Section I.B to the proposal that ordinary interpretation shifts in “high-stakes” contexts, and that this behavior justifies the MQD as a linguistic canon.23See Wurman, supra note 8, at 917. The high-stakes argument appeals to an example from analytic philosophy24See, e.g., Keith DeRose, Contextualism and Knowledge Attributions, 54 Phil. & Phenomenological Rsch. 913, 913–18 (1992). and prior legal scholarship25See Ryan D. Doerfler, High-Stakes Interpretation, 116 Mich. L. Rev. 523, 523 (2018). that suggests that high-stakes contexts diminish ordinary knowledge. Thus, as a famous hypothetical illustrates, you might know that the town bank is open on the weekend when planning to deposit a small check with low stakes. In contrast, in a higher-stakes context (for example, if the check is for ten thousand dollars and must be deposited before Monday to avoid an overdraft), you may decide instead that you do not really know that the bank is open. Legal scholarship proposes that this is how ordinary people understand knowledge: ordinary knowledge is stakes sensitive.26See, e.g., Wurman, supra note 8, at 957–59. More importantly for the MQD, an emerging argument builds on this premise to suggest that ordinary understanding of textual clarity is also stakes driven: in high-stakes contexts, a text is less clear.27See id. As such, in those high-stakes (or “major”) cases, courts should require highly specific language to authorize agency action.

Section I.C introduces Justice Barrett’s separate proposal that ordinary language is context sensitive and anti-literal, and therefore a textualist faithful to the ordinary reader should adopt the MQD as a means to determine the best reading of statutory language.28Biden v. Nebraska, 143 S. Ct. 2355, 2376 (2023) (Barrett, J., concurring); see also Barrett, supra note 17, at 2200 (on textualists’ commitment to the ordinary reader, not the ordinary legislator). Justice Barrett’s argument also appeals to an intuitive example: instructing a babysitter to “have fun with the kids” while handing him a credit card might literally permit the babysitter to take them on an overnight trip to an out-of-town amusement park (after all, doing so would be “fun”). But in context, ordinary people employ “common sense” and understand the literal meaning of the instruction to only permit the most reasonable set of applications of the instruction.29Biden v. Nebraska, 143 S. Ct. at 2376 (Barrett, J., concurring). Ordinary people are therefore non-literalists, understanding general delegations to be more limited in meaning than their terms alone might suggest. As such, the argument goes, the MQD is “consistent with how we communicate conversationally,” making it a valid linguistic canon that reflects an interpretive commitment to ordinary people.30Id. at 2379.

Justice Barrett’s argument is important and places her as a leader among the Court’s textualists; she is the only textualist advocate of the MQD who has offered a proposal to square the MQD with textualism. At the same time, the linguistic argument in her brief concurring opinion is not entirely clear. As such, we attempt to charitably reconstruct Justice Barrett’s defense as a workable argument—that is, one that derives the MQD conclusion from the babysitter hypothetical premise.

Part I contributes to the literature by explaining these two new arguments for the linguistic MQD in sufficient detail. Unpacking the arguments clarifies each argument’s theoretical challenges and empirical claims. Both arguments employ hypotheticals about how ordinary people interpret language but, significantly, support these hypotheticals with references to academic philosophy or judicial intuition; neither uses empirical evidence.

Parts II and III investigate these empirical claims, both by engaging with the existing empirical literature on high-stakes knowledge (much of it uncited by proponents of the linguistic MQD) and by conducting original survey experiments of both high-stakes interpretation and how ordinary people interpret instructions. Part II considers the claim that ordinary knowledge is stakes sensitive. This claim has been influential in philosophy,31See, e.g., Keith DeRose, Contextualism, Contrastivism, and X-Phi Surveys, 156 Phil. Stud. 81, 81 (2011). legal scholarship,32Doerfler, supra note 25, at 523. and now the major questions debate.33Wurman, supra note 8, at 917. Although philosophers claim knowledge is stakes sensitive, many existing studies report that stakes have little or even no effect on ordinary attributions of knowledge.34See generally Jonathan Schaffer & Joshua Knobe, Contrastive Knowledge Surveyed, 46 Noûs 675 (2012) (surveying studies). Other studies report only a small effect. See, e.g., David Rose, Edouard Machery, Stephen Stich, Mario Alai, Adriano Angelucci, Renatas Berniūnas, Emaa E. Buchtel, Amita Chatterjee, Hyundeuk Cheon, In-Rae Cho, Daniel Cohnitz, Florian Cova, Vilius Dranseika, Ángeles Eraña Lagos, Laleg Ghadakpour, Maurice Grinberg, Ivar Hannikainen, Takaaki Hashimoto, Amir Horowitz, Evgeniya Hristova, Yasmina Jraissati, Veselina Kadreva, Kaori Karasawa, Hackjin Kim, Yeonjeong Kim, Minwoo Lee, Carlos Mauro, Masaharu Mizumoto, Sebastiano Moruzzi, Christopher Y. Olivola, Jorge Ornelas, Barbara Osimani, Carlos Romero, Alejandro Rosas Lopez, Massimo Sangoi, Andrea Sereni, Sarah Songhorian, Paulo Sousa, Vera Tripodi, Naoki Usui, Alejandro Vásquez del Mercado, Giorgio Volpe, Hrag Abraham Vosgerichian, Xueyi Zhang & Jing Zhu, Nothing at Stake in Knowledge, 53 Noûs 224, 232–37 (2019) (reporting no effect of stakes on knowledge in fifteen countries, a small effect in three, and a marginal and small effect in the U.S.). For example, in the U.S., over 80% of participants agreed in both the high- and low-stakes cases that there was knowledge; in Japan, a country with the largest difference between high and low stakes, over 70% of participants attributed knowledge in both. Id. And, to our knowledge, no empirical study bears on the question of whether higher stakes reduce textual clarity (a related but different issue). The critical link in one version of the linguistic MQD argument is therefore entirely untested.

Part III presents studies designed to test the empirical claims of the linguistic MQD arguments. Our studies use the exact two cases offered by proponents of the linguistic MQD—the “bank case” and the “babysitter hypothetical”—to conduct original survey experiments. Overwhelmingly, ordinary people in our studies did not interpret these scenarios consistently with the empirical premises of the linguistic MQD arguments.

Part IV develops three sets of implications that follow from our empirical evidence and the textualist efforts to legitimize the MQD as a linguistic canon. These implications concern the empirical evidence for the linguistic MQD (IV.A), challenges that the linguistic MQD poses for textualism (IV.B), and the relationship between empirical evidence of how ordinary people view delegations and administrative law, including intriguing evidence that people are more concerned about underenforcement of instructions compared with overenforcement (IV.C).

In brief, the extant and new empirical findings do not support the linguistic MQD. Specifically, the findings count against the predictions of the two leading linguistic MQD arguments, using the exact cases offered in defense of the linguistic MQD. Of course, we are open to the possibility that study of further examples could weigh against our conclusions. But for interpreters deciding today whether to employ a “linguistic MQD,” there is insufficient empirical support and theoretical clarity to cast the MQD as a valid linguistic canon. Moreover, the results provide stronger support for a new counter-MQD: ordinary people understand general authorizing language as consistent with a broad range of reasonable actions that fall under the text’s meaning. Textualists committed to the “ordinary reader” and “interpretation from the outside” claim to follow those commitments to where they lead—and the current evidence favors an interpretive rule far from the current MQD.35Barrett, supra note 17, at 2208–11 (arguing that courts should interpret from the “outside,” from the perspective of ordinary people, rather than from the “inside,” which would reflect Congress’s perspective).

I.  THE MAJOR QUESTIONS DOCTRINE AND THEORIES OF ITS LEGITIMACY

The MQD has sparked a great deal of scholarly effort to specify exactly what the doctrine is and how it fits into traditional categories of interpretive doctrine. In this Part, we survey these efforts, many of which conclude that the MQD is a substantive, or normative, canon.36See infra Section I.A. These classifications matter because substantive canons are increasingly questioned as being inconsistent with textualism.37See Eidelson & Stephenson, supra note 12, at 517–21; Barrett, supra note 13, at 110. But see Brian G. Slocum & Kevin Tobia, The Linguistic and Substantive Canons, 137 Harv. L. Rev. F. 70, 70–73 (2023) (arguing that an interpretive canon can have both a linguistic and substantive basis). Classifying the MQD as substantive (rather than linguistic) is tantamount to saying it is illegitimate or tenuous, at least on textualist grounds.38But see Walters, supra note 12, at 469–73 (assuming that substantive canons are often acceptable but arguing that the MQD has features that differentiate it from the rest of the canons in troubling ways). Perhaps not surprisingly, some textualist defenders of the MQD have not fully endorsed the idea that the MQD is a substantive canon.39Wurman, supra note 8, at 912. The exception here is Justice Gorsuch, who offered a full-throated endorsement of the MQD as a nondelegation canon in his concurrence in West Virginia v. EPA. See West Virginia v. EPA, 142 S. Ct. 2587, 2617 (2022) (Gorsuch, J., concurring). In fact, as we discuss below, perhaps the most serious attempt to ground the MQD in interpretive law asserts that the doctrine is instead a linguistic, or semantic, canon.40See infra Sections I.B & I.C. In theory, at least, this move would legitimize the canon for textualists and everyone else because the doctrine would simply be folded into the relatively uncontroversial search for the ordinary meaning of delegating statutes.41Wurman, supra note 8, at 916. For a discussion of “ordinary meaning,” see Brian G. Slocum, Ordinary Meaning: A Theory of the Most Fundamental Principle of Legal Interpretation 1–5 (2015).

This pivot to a linguistic defense raises many questions, very few of which have been answered. After describing how the linguistic defense works, we then highlight theoretical limitations, open questions, and the broader implications of defending the MQD as a linguistic canon.

A.  The Canonization of the Major Questions Doctrine

1.  Historical Threads of the Major Questions Doctrine

The MQD is not entirely new; it is in the process of “metamorphosis.”42Walters, supra note 12, at 480–81. It is also, of course, the talk of the town because of fears/hopes that it will be deployed in such a way as to “kneecap” administrative agencies and promote an economic, libertarian conception of American governance. See Matt Ford, The Supreme Court Conservatives’ Favorite New Weapon for Kneecapping the Administrative State, New Republic (Mar. 13, 2023), https://newrepublic.com/article/171093/supreme-court-major-questions-doctrine-administrative-state [https://perma.cc/R3FJ-GVN8]; John Yoo & Robert Delahunty, The Major-Questions Doctrine and the Administrative State, Nat’l Affairs (Fall 2022), https://www.nationalaffairs.com/publications/detail/the-major-questions-doctrine-and-the-administrative-state [https://perma.cc/7NYU-M8FJ]. Arguably, the first appearance of something like the MQD was in the plurality opinion in a 1980 case known as the Benzene Case.43Indus. Union Dep’t v. Am. Petrol. Inst., 448 U.S. 607, 614–15 (1980) [hereinafter Benzene Case]. In that case, the Occupational Safety and Health Administration (“OSHA”) was charged with promulgating standards that “most adequately assure[], to the extent feasible, on the basis of the best available evidence, that no employee will suffer material impairment of health or functional capacity even if such employee has regular exposure to the hazard dealt with by such standard for the period of his working life.”4429 U.S.C. § 655(b)(5). Rather than follow OSHA’s argument that the statute, fairly read, seemed to require it to “impose standards that either guarantee workplaces that are free from any risk of material health impairment, however small, or that come as close as possible to doing so without ruining entire industries,” the plurality opinion held that OSHA had only been delegated authority to regulate “significant” risks.45Benzene Case, 448 U.S. at 641, 651.

As Cass Sunstein notes, although the Court invoked the nondelegation doctrine and constitutional avoidance to arrive at this statutory interpretation, it is impossible to square what the Court did with the “(standard) nondelegation doctrine.”46Cass R. Sunstein, There Are Two “Major Questions” Doctrines, 73 Admin. L. Rev. 475, 484 (2021) (calling the MQD a “linear descendant” of the Benzene Case). This relatively recent vintage has been contested by Louis Capozzi, who argues that the Supreme Court deployed the MQD in a series of rate cases in the late 19th Century. Capozzi, supra note 7, at 196–97. However, this analogy has itself been contested. See Capozzi on the Future of the Major Questions Doctrine, Admin Wannabe (Oct. 19, 2022), https://adminwannabe.com/?p=114 [https://perma.cc/FK6S-MGZW]. The interpretation offered by OSHA, in addition to doing little violence to the text of the statute, would “sharply cabin” the agency’s discretion.47Sunstein, supra note 46, at 486. Sunstein suggests that the plurality opinion in the Benzene Case instead endorsed the novel idea that “without a clear statement from Congress, the Court will not authorize the agency to exercise that degree of (draconian) authority over the private sector.”48Id.

It was hardly clear at the time, however, that the Court was creating something called the “major questions doctrine”; in fact, that would not become clear until very recently. Instead, for several decades, the Court intermittently invoked similar, but often distinct, reasoning from the Benzene Case in regulatory cases involving “extraordinary” circumstances, all while leaving the precise theory behind the reasoning unstated. Paradigmatic of these invocations is FDA v. Brown & Williamson Tobacco.49FDA v. Brown & Williamson Tobacco Corp., 529 U.S. 120, 120 (2000). In that case, the Food and Drug Administration (“FDA”) promulgated a rule regulating tobacco products as “drugs” under the Food, Drug, and Cosmetics Act. The Court applied the familiar Chevron two-step analysis and concluded, on the basis of an examination of legislative history, that Congress had unambiguously declined to give the FDA this power.50Id. at 160–61. The Court added another reason for its conclusion, though, stating that “[i]n extraordinary cases . . . there may be reason to hesitate before concluding that Congress has intended . . . an implicit delegation.”51Id. at 159.

As the “implicit delegation” phrase reveals, the Court explicitly couched its consideration of the “majorness” or “extraordinariness” of the power asserted by the FDA as part of the Chevron analysis. Thus, the MQD acted as a “carve-out” or “exception” to the ordinary rule that statutory ambiguities constitute implicit delegations that an agency is given primacy over courts to resolve, so long as it does so reasonably.52Sunstein, supra note 46, at 482. Instead, when “extraordinary” questions are presented by the agency’s claim of delegated authority, the Court itself resolves the ambiguity at Chevron step one.53Id.

The Brown & Williamson opinion’s use of proto-MQD logic departed from the apparent logic of the Benzene Case in an important way. The Benzene Case left little room for an agency interpretation to survive once the doctrine was triggered. The only way to prevail was to point to clear statutory authorization that could not be limited by the Court to avoid the major implications of the agency’s interpretation. Sunstein calls this the “strong version” of the MQD.54Id. at 486. By contrast, in Brown & Williamson, Sunstein sees a “weak version” that theoretically allowed an agency’s major action so long as the statutory interpretation could be endorsed by a Court engaged in independent (de novo) review without according the agency any deference.55Id. at 484.

As a practical matter, the weak version of the MQD seemed to win out for a while after Brown & Williamson, and on at least one occasion, an agency did win in a major questions case. In King v. Burwell, the Internal Revenue Service (“IRS”) interpreted the Affordable Care Act to make tax credits available even if an individual purchased health insurance on a federal insurance exchange, despite statutory language that limited tax credits to plans purchased through “an Exchange established by the State.”56King v. Burwell, 576 U.S. 473, 483 (2015) (citing 77 Fed. Reg. 30378 (2012) and 26 U.S.C. §§ 36B(b)–(c)). Like in Brown & Williamson, the Court noted that there “may be reason to hesitate before concluding that Congress has intended such an implicit delegation.”57Id. at 485 (quoting FDA v. Brown & Williamson Tobacco Corp., 529 U.S. 120, 159 (2000)). Unlike in Brown & Williamson, however, the Court concluded that the agency had the power to issue the rule, even on a de novo interpretation of the statute. Although the Court’s interpretation of the statutory language at issue has been criticized,58Richard M. Re, The New Holy Trinity, 18 Green Bag 407, 408–09 (2015); Ryan D. Doerfler, The Scrivener’s Error, 110 Nw. U. L. Rev. 811, 811 (2015). the important point is that the “weak version” of the MQD—that is, an “exception,” or “carve-out” from Chevron deference—seemed to rule the day. The only open questions were about where, precisely, to locate the major questions exception: at Chevron step zero,59See Thomas W. Merrill & Kristin E. Hickman, Chevron’s Domain, 89 Geo. L.J. 833, 873 (2000); Cass R. Sunstein, Chevron Step Zero, 92 Va. L. Rev. 187, 207–11 (2006) (elucidating a “step zero” that asks whether Chevron deference even potentially applies or whether some other standard of review, such as Skidmore or de novo review, should prevail). Most observers viewed King v. Burwell as deploying the major questions exception at step zero. step one,60Most observers viewed Brown & Williamson as deploying the major questions exception at step one. or step two.61The only case to have apparently located the major questions exception at step two was Utility Air Regulatory Group v. EPA, 573 U.S. 302, 331–33 (2014).

2.  The Modern Major Questions Doctrine and Its Justification

Enter what Mila Sohoni calls the “major questions quartet.”62Sohoni, supra note 6, at 262. If it was unclear exactly which version of the MQD existed before the quartet, the waters have become only murkier afterward. One thing is unmistakably clear though: The Court did not treat the MQD as a mere exception or carve-out from Chevron deference. Instead, it “unhitched the major questions exception from Chevron.”63Id. at 263. In fact, the majority opinion in West Virginia v. EPA,64West Virginia v. EPA, 142 S. Ct. 2587, 2587–616 (2022). the leading case in the quartet, did not even mention Chevron in its elaboration or application of the MQD.65Part of the reason why Chevron was not mentioned may be because the Court is now generally hostile to the doctrine. See Lisa Schultz Bressman & Kevin M. Stack, Chevron Is a Phoenix, 74 Vand. L. Rev. 465, 466 (2021) (describing the debate about Chevron and arguing that judicial deference to agency interpretations is a foundational aspect of administrative law). As this Article went to press, the Court overruled Chevron. See Loper Bright Enters. v. Raimondo, 144 S. Ct. 2244, 2254 (2024). Instead, the Court offered an almost entirely new gloss on the doctrine:

“[I]n certain extraordinary cases, both separation of powers principles and a practical understanding of legislative intent make us ‘reluctant to read into ambiguous statutory text’ the delegation claimed to be lurking there. To convince us otherwise, something more than a merely plausible textual basis for the agency action is necessary. The agency instead must point to ‘clear congressional authorization’ for the power it claims.”66West Virginia v. EPA, 142 S. Ct. at 2609 (citation omitted).

For the vast majority of commentators, these words have been taken to suggest that the current Court, post-quartet, thinks of the MQD as a particularly powerful form of substantive canon: a clear statement rule.67Deacon & Litman, supra note 9, at 1012; Sohoni, supra note 6, at 264; Walters, supra note 12, at 480–89. On this reading—which seems similar to the implicit use of the doctrine in the Benzene Case—Congress must have spoken with unmistakable clarity in order for agencies to have the “major” power they are claiming to have been delegated. If there is any ambiguity, and even if the agency has a “plausible” basis for concluding that it has the authority under applicable statutes, the agency cannot exercise that power. Some are not convinced the MQD is a clear statement rule and view it as a weaker substantive canon that resolves ambiguity.68See, e.g., Natasha Brunstein & Donald L.R. Goodson, Unheralded and Transformative: The Test for Major Questions After West Virginia, 47 Wm. & Mary Env’t L. & Pol’y Rev. 47 (2022) (noting that the Court in West Virginia v. EPA does not refer to the MQD as a clear statement rule). Accordingly, when the MQD is applicable, any statutory ambiguities should be resolved against the agency’s assertion of power so as to vindicate “separation of powers principles.”69West Virginia v. EPA, 142 S. Ct. at 2609. In any event, a common understanding is that the MQD is driven by a normative commitment to a limited role for administrative agencies in the legal system, and perhaps by a “delegation doctrine” that insists that agencies have no power unless it is affirmatively shown that Congress has granted it to them.70See generally Jonathan H. Adler, The Delegation Doctrine, Harv. J. Pub. Pol’y: Per Curiam, Summer 2024, at 1.

The MQD is inherently controversial as a substantive canon regardless of whether it is a clear statement rule or a tiebreaker canon. Simply by virtue of being a substantive canon, the “new MQD” is in tension with textualism. As Justice Kagan, a self-avowed textualist, puts it, there is some momentum for “toss[ing] [substantive canons] all out.”71Transcript of Oral Argument at 60, Ysleta del sur Pueblo v. Texas, 142 S. Ct. 1929 (2022) (No. 20-493). As she noted in her West Virginia dissent, channeling Karl Llewelyn, “special canons like the ‘major questions doctrine’” function as “get-out-of-text-free cards.”72West Virginia v. EPA, 142 S. Ct. at 2641 (Kagan, J., dissenting). Karl Llewellyn famously purported to show that every canon can be countered by an equal and opposite canon, which he argued deprives canons of any probative force in the interpretive process. See Karl N. Llewellyn, Remarks on the Theory of Appellate Decision and the Rules or Canons About How Statutes Are to Be Construed, 3 Vand. L. Rev. 395, 401–06 (1950). Llewellyn’s famous critique, however, overstated the conflict among canons. See William N. Eskridge, Jr., Norms, Empiricism, and Canons in Statutory Interpretation, 66 U. Chi. L. Rev. 671, 679 (1999) (“The large majority of Llewellyn’s competing canonical couplets are presumptions about language and extrinsic sources, followed by qualifications to the presumptions.”). Recently, Benjamin Eidelson and Matthew Stephenson have exhaustively assessed “leading efforts to square modern textualist theory with substantive canons” and ultimately concluded that “substantive canons are generally just as incompatible with textualists’ jurisprudential commitments as they first appear.”73Eidelson & Stephenson, supra note 12, at 520–21; see also Barrett, supra note 13, at 110. This challenge would apply to a range of canons employed by the textualist Supreme Court. The Roberts Court, though textualist, often employs substantive canons. See Nina A. Mendelson, Change, Creation, and Unpredictability in Statutory Interpretation: Interpretive Canon Use in the Roberts Court’s First Decade, 117 Mich. L. Rev. 71, 141 tbl.2 (2018); Anita S. Krishnakumar, Reconsidering Substantive Canons, 84 U. Chi. L. Rev. 825, 825–26 (2017). The MQD, insofar as it is a substantive canon, would not be spared.74Eidelson & Stephenson, supra note 12, at 520–21.

Beyond these generalized concerns with substantive canons, some commentators have questioned whether the MQD satisfies basic expectations about the Court’s recognition and use of substantive canons, even assuming that they can sometimes be legitimate aids to interpretation. Simply put, the Court has not been at all clear about the source of the normative foundation of the MQD.75Biden v. Nebraska, 143 S. Ct. 2355, 2376 (2023) (Barrett, J., concurring) (noting that “there is an ongoing debate” about the MQD’s “source and status”). For Sohoni, formulating the MQD as a kind of constitutional avoidance rule fails because of the “Court’s failure to say anything about nondelegation”—a failure that “creates genuine conceptual uncertainty about what exactly it was doing in these cases.”76Sohoni, supra note 6, at 297. The currently prevailing nondelegation test asks merely whether Congress has provided a “reasonably intelligible policy” to guide an agency’s exercise of discretion.77A.L.A. Schechter Poultry Corp. v. United States, 295 U.S. 495, 500 (1935). That test would not have provided anywhere close to a “significant risk” of constitutional invalidity in any of the statutes examined in the major questions quartet.78Significant risk is required under the modern form of the constitutional avoidance doctrine. Although Justice Gorsuch in his concurrence in West Virginia v. EPA suggested that the MQD is inspired by the nondelegation doctrine (and probably his preferred version of the nondelegation doctrine, which is not the law currently), the majority pointed more generally to “separation of powers principles.”79West Virginia v. EPA, 142 S. Ct. 2587, 2609 (2022). Some have inferred that the Supreme Court might be interested in developing constitutional principles demanding affirmative proof of delegation in certain circumstances—and that the MQD reflects this implicit constitutional project80Sohoni, supra note 6, at 312–13; Adler, supra note 70, at 6.––but if so the Court has not been explicit. This uncertainty about the connection between constitutional principles and the MQD also seems to doom the MQD under Justice Barrett’s own test for the legitimacy of substantive canons within textualism, under which there must be a reasonably specific constitutional principle to which a constitutionally inspired substantive canon attaches.81Barrett, supra note 13, at 178. In other words, if the MQD is a substantive canon, its substance, or normative content, is not clear. Most substantive canons either reflect a broad societal consensus or are tied closely to constitutional law. The MQD at first glance has neither of these attributes.

3.  The Modern Major Questions Doctrine’s Linguistic Turn

Perhaps not surprisingly, given the strong pushback that the MQD has received when it is formulated as a substantive canon, defenders of the MQD are increasingly suggesting that the MQD is not a substantive canon at all. Instead, proponents suggest it is a linguistic canon.

This rebranding is not as far-fetched as it might seem at first. “ ‘[L]inguistic’ validity and ‘substantive’ value are properties of canons.”82Slocum & Tobia, supra note 37, at 73. The standard dichotomy between “linguistic” and “substantive” canons suggests that a canon has at most one property; but, it is conceptually possible for a canon to have both.83Id. There is evidence that some canons that have long been treated as “substantive canons”—such as anti-retroactivity and anti-extraterritoriality—are also consistent with how ordinary people understand rules. For example, when a rule (especially a punitive rule) does not explicitly state whether it applies retroactively, prospectively, or both, people tend to understand it to apply only prospectively.84Id. at 82–83. Insofar as textualism is guided by ordinary understanding of language,85See, e.g., Barrett, supra note 17, at 2194. textualists have good reason to consider such “substantive” canons as simultaneously linguistic ones. Even some tough critics of substantive canons like Eidelson and Stephenson show some openness to these arguments: “[T]he textualist’s reasonable reader . . . opens the door to recasting some seemingly substantive canons as simply default inferences that a reasonable reader would draw . . . . The presumption against extraterritoriality is a possible example.”86Eidelson & Stephenson, supra note 12, at 539.

Could a similar linguistic argument support the MQD? Acknowledging that criticisms of the MQD as a substantive canon “are, to some if not a large extent, warranted,”87Wurman, supra note 8, at 912. Professor Ilan Wurman recently rebranded the MQD as a linguistic canon.88Wurman, supra note 8, at 916 (“On this conceptualization, the importance of a purported grant of authority would operate as a kind of linguistic canon: ordinarily, lawmakers and private parties tend to speak clearly, and interpreters tend to expect clarity, when those lawmakers or parties authorize others to make important decisions on their behalf.”). Wurman argues that the MQD could be understood as motivated by a theory of linguistic usage about how interpretive uncertainty should be resolved rather than as importation of substantive or normative values into the interpretive enterprise. He appeals to prior work in philosophy and legal philosophy, which argues that “high-stakes” contexts lead to less knowledge or legal clarity.89See infra Section I.B.

Even more recently, Justice Barrett has proposed her own, separate linguistic argument for the MQD’s legitimacy. The Supreme Court has made the major questions quartet a quintet with its decision in Biden v. Nebraska. That case concerned President Biden’s 2022 proposal to forgive $10,000 to $20,000 in student loans for low to middle-income borrowers. Biden’s Department of Education traced the authority for their emergency loan relief to the HEROES Act, a 2001 law that grants the U.S. Secretary of Education the ability to “waive or modify” provisions related to federal student loans “in connection with a war or other military operation or national emergency.”9020 U.S.C. § 1098bb(a)(1). After Biden announced his administration’s loan forgiveness program as a response to the COVID-19 national emergency, several states challenged the program. That case reached the Supreme Court and divided the Justices 6–3 along conservative-liberal lines. Justice Roberts’s majority opinion proceeded with traditional textual interpretation, concluding that the government’s student loan relief is not within the statutory meaning of “waive or modify” any provision. But the opinion also referenced the major questions doctrine, as an alternative ground for the holding.

Justice Barrett wrote separately to argue that the MQD is not a substantive canon but rather “a tool for discerning—not departing from—the text’s most natural interpretation.”91Biden v. Nebraska, 143 S. Ct. 2355, 2376 (2023) (Barrett, J., concurring). Candidly, and consistently with her prior writings on substantive canons,92See Barrett, supra note 13, at 110. Justice Barrett conceded that the substantive canon version of the MQD might be “inconsistent with textualism” and therefore “should give a textualist pause.”93Biden v. Nebraska, 143 S. Ct. at 2376 (Barrett, J., concurring). By grounding the MQD in how ordinary readers apply common sense in reading statutory text, Justice Barrett aims to put the MQD on more solid footing, particularly for textualists.

After the opinion, some suggested that Justice Barrett’s argument “mirrors” Wurman’s.94See, e.g., Baumann, supra note 21. We disagree: the two arguments both present the MQD as a linguistic canon, but the arguments are distinct. Wurman appeals to high-stakes context and the resolution of interpretive uncertainty, while Barrett appeals to anti-literalism and contextual restriction concerning major actions (with nothing about high stakes). Thus, Wurman’s argument centers on “ambiguity” caused by high stakes, whereas Justice Barrett’s theory is about how ordinary people generally use “common sense” to interpret non-literally (with no mention of “ambiguity”). The next two Sections separately reconstruct Wurman’s (I.B) and Justice Barrett’s (I.C) linguistic arguments in detail and present some theoretical challenges for each.

B.  The Major Questions Doctrine as a High-Stakes Linguistic Canon

One important line of work defending the “linguistic” MQD appeals to the philosophical and legal-philosophical literature on stakes and knowledge.95Wurman, supra note 8, at 957–61 (appealing to Doerfler, supra note 25). That theoretical literature proposes that knowledge is sensitive to high stakes: it could be true that one knows a proposition in a low-stakes context (for example, the bank is open) but does not know that proposition, given the same evidence, in a high-stakes context.

The legal literature about stakes and interpretation, including the linguistic MQD defense, takes this claim about knowledge to be important. But the relationship between knowledge and legal interpretation is not entirely clear. Roughly, the argument goes as follows: we are less likely to know a proposition when the practical stakes of its truth are raised, and similarly, we are less likely to assess that a text is clear when the practical stakes of its meaning are raised.96E.g., Doerfler, supra note 25, at 523.

The linguistic defense of the MQD is clearly based in part on this philosophical literature about stakes and knowledge. Before interrogating the full argument, however, we must spell it out. Here we attempt to reconstruct the defense.

1.  Reconstruction of the “High Stakes” Linguistic Defense of the Major Questions Doctrine

(1)  [Empirical Premise 1: Stakes-Sensitive Knowledge]: The ordinary reader’s knowledge is sensitive to high stakes.97Wurman, supra note 8, at 957 (“[O]rdinary speakers attribute ‘knowledge’—and, in turn, ‘clarity’—more freely or less freely depending upon the practical stakes.” (quoting Doerfler, supra note 25, at 527)).

(2)  [Empirical Premise 2: Stakes-Sensitive Clarity]: The ordinary reader’s understanding of textual clarity is sensitive to high stakes.98Id. at 959 (“[O]rdinary readers and speakers are more likely to find the statute ambiguous in that [high-stakes] context than in a relatively lower-stakes context.”); see also id. at 917 (appealing to “how ordinary persons interpret instructions in high-stakes contexts”).

(3)  [Definition: MQD Case]: In a MQD case, the agency’s statutory powers are defined in linguistic terms that are semantically clear but highly general. The agency is exercising “vast powers” of great economic/political significance and pointing to the statutory language as authorization.99See id. at 911 (summarizing the MQD as the idea that “[c]ourts should have ‘skepticism’ when statutes appear to delegate to agencies questions of major political and economic significance, which skepticism the government can only overcome ‘under the major questions doctrine’ by ‘point[ing] to “clear congressional authorization” to regulate in that manner’ ” (citation omitted)).

(4)  [Premise]: MQD cases involve a high-stakes context.100Although Wurman, supra note 8, never explicitly describes the MQD cases as “high stakes,” we assume this premise is uncontroversial as part of a reconstruction of the argument. If they did not involve a high-stakes context, none of the arguments would be relevant.

(5)  [Textualist Premise]: Judges should interpret statutory language from the perspective of the ordinary reader.

(6)  [Minor Conclusion, from 1, 2, 3, 4, 5]: In a MQD case, the text is unclear.

(7)  [Premise]: If a text is unclear with respect to authorizing an agency’s action, it does not authorize that action.

(8)  [Major Conclusion, from 6, 7]: In a MQD case, the agency’s action is not authorized.

Attempting to construct the argument fully and precisely reveals several interesting features and questions. First, consider the two “Empirical Premises” (1 and 2). It is unclear exactly what function the first Empirical Premise (about knowledge) serves. It is included in the argument above because it features repeatedly and centrally in Wurman’s (and Doerfler’s) scholarship on high stakes, but even if that Premise were false, Premise 2 alone could support the argument.

Why, then, does the “high-stakes” literature emphasize knowledge in addition to textual clarity? Perhaps because there is little data bearing on the truth of Premise 2, but there is rich, decades-old philosophical literature that seemingly supports Premise 1.101E.g., Stewart Cohen, Contextualism, Skepticism, and the Structure of Reasons, 13 Phil. Persps. 57, 57 (1999); DeRose, supra note 24, at 913–18. As such, we understand the legal literature to be using Premise 1 as support for Premise 2: philosophers have concluded that knowledge is stakes sensitive, and this conclusion supports also concluding that textual clarity is stakes sensitive.

In Part III, we investigate the stakes-knowledge-clarity relationship empirically, but here we note some initial skepticism about the inference from knowledge to clarity. Law includes technical language,102Frederick Schauer, Is Law a Technical Language?, 52 San Diego L. Rev. 501, 501 (2015). and as such, many ordinary people do not have direct knowledge of a law’s meaning. Nevertheless, this does not imply that a particular law is unclear, in the sense of being unclear to a legal expert or inherently indeterminate. Recent empirical work supports this point: ordinary readers understand law to include technical legal meanings, and they defer to legal experts to elaborate those meanings.103Kevin Tobia, Brian G. Slocum & Victoria Nourse, Ordinary Meaning and Ordinary People, 171 U. Pa. L. Rev. 365, 365 (2023) [hereinafter Tobia et al., Ordinary People]. The mere fact that laypeople do not know the meaning of a law without further inquiry or assistance strikes us as an implausible basis for judges to treat the law as ambiguous or unclear.

Moreover, the “Minor Conclusion” (6) only follows with a very strong interpretation of the meaning of “sensitive to high stakes” (1) and (2). To conclude that “general” statutory language is unclear because of ordinary sensitivity to a high-stakes context, one must interpret (2) to mean that a high-stakes context eliminates clarity.

Wurman describes the MQD as limited to “resolving statutory ambiguities.”104Wurman, supra note 8, at 940–41. This is a common way to describe a “tiebreaker” canon. We ultimately find this confusing insofar as Wurman also presents the MQD as a linguistic canon, a rule of thumb that is evidence of linguistic meaning. If “ambiguity” refers to linguistic ambiguity, an applicable “linguistic” canon would render the statute non-ambiguous. For example, in Lockhart the Court faced a linguistic ambiguity.105Lockhart v. United States, 577 U.S. 347, 361 (2016). Lockhart was convicted under 18 U.S.C. § 2252(a) and faced a mandatory minimum due to an earlier conviction. The penalty increased if the defendant had a prior conviction “under the laws of any State relating to aggravated sexual abuse, sexual abuse, or abusive sexual conduct involving a minor or ward.”10618 U.S.C. § 2252(b)(1). That final modifier (involving a minor or ward) could modify all three noun phrases (aggravated sexual abuse, sexual abuse, and abusive sexual conduct) or just the last (abusive sexual conduct). The series qualifier canon instructs us to apply the modifier to all three noun phrases. The determination that the series qualifier canon applies qua linguistic canon is a decision that the linguistic meaning of the provision is determinate and has a specific meaning, not that it is ambiguous. If ambiguity persists—for example, if there is a competing linguistic canon that counsels in favor of the opposite interpretation—the Court might resolve ambiguity with some non-linguistic consideration, such as the rule of lenity.

Alternatively, perhaps the argument is that the MQD is “linguistic” in the sense that it represents how ordinary people believe that ambiguity should be resolved, and thus how ordinary people would choose to resolve disputes in MQD cases. But that would be an unusual sense of “linguistic.” Existing linguistic canons help determine the linguistic meaning of a provision; they do not enter the interpretive process after that meaning has been concluded to be indeterminate.

This might all seem pedantic, but it highlights a problem with this linguistic defense of the MQD. We have done our best to explain the argument in a clear form, but we are unsure that there is even a workable argument for the “high stakes” linguistic MQD that arrives at the Major Conclusion (8).

Beyond this general issue (that the logic of the argument itself is unclear), several of the premises are open to debate. For example, perhaps some of the Court’s major questions cases do not involve high stakes or sufficiently high stakes (Premise 4).107See Deacon & Litman, supra note 9, at 1009–10 (discussing and critiquing the Court’s criteria of majorness); Natasha Brunstein & Richard L. Revesz, Mangling the Major Questions Doctrine, 74 Admin. L. Rev. 217, 219 (2022) (discussing how the Trump Administration distorted the majorness determination by invoking the doctrine “enormously expansively and inconsistently”). Premise 7 is also controversial: just because a text’s meaning is unclear does not necessarily imply that it should be interpreted against an agency delegation (perhaps instead, it should be interpreted with a presumption of judicial nonintervention).108Wurman acknowledges that this is a contestable claim. See Wurman, supra note 8, at 958 (noting that Doerfler views the question as whether judges should “demand more epistemic confidence” before overturning an expert agency’s interpretation). But Wurman suggests that “the legal system already contingently addresses this question” by presumptively disallowing agency action unless agencies “demonstrate authority for their actions” and thus satisfy their “burden of proof.” Id. at 960. Note the connection here to the theory of exclusive delegation, which is a nascent substantive grounding for the canon, not a linguistic one. See supra note 80 and accompanying text.

Nevertheless, most of our attention in this Article is on the two Empirical Premises, 1 and 2. Whatever the argument is, it is clear that these two premises are central: the “high-stakes” argument repeatedly appeals to these claims.109See Wurman supra note 8, at 954–55. If these premises—and especially the second premise—are empirically invalid, the entire argument is a nonstarter. Part II of this Article presents evidence bearing on Premise 1, and Part III presents original empirical studies bearing on both Premise 1 and Premise 2. To preview the findings, (1) although academic philosophers have long assumed that higher stakes reduce knowledge, many studies find that stakes have no effect on ordinary people’s knowledge attributions;110See infra Part II. (2) we find a very small effect of stakes on knowledge (far from sufficient to conclude that “the ordinary reader” is stakes-sensitive about knowledge), and no effect of stakes on linguistic clarity.111See infra Section III.A.

C.  The Major Questions Doctrine as an Anti-Literal Linguistic Canon

A second argument for the “linguistic” MQD surfaced in summer 2023. Justice Barrett’s concurrence in Biden v. Nebraska proposes that the MQD has a linguistic basis in ordinary people’s anti-literalism and sensitivity to context.

The crux of the argument is an appeal to the predicted reaction of ordinary people to everyday situations, such as Justice Barrett’s “babysitter” hypothetical:

Consider a parent who hires a babysitter to watch her young children over the weekend. As she walks out the door, the parent hands the babysitter her credit card and says: “Make sure the kids have fun.” Emboldened, the babysitter takes the kids on a road trip to an amusement park, where they spend two days on rollercoasters and one night in a hotel. Was the babysitter’s trip consistent with the parent’s instruction? Maybe in a literal sense, because the instruction was open-ended. But was the trip consistent with a reasonable understanding of the parent’s instruction? Highly doubtful. In the normal course, permission to spend money on fun authorizes a babysitter to take children to the local ice cream parlor or movie theater, not on a multiday excursion to an out-of-town amusement park. If a parent were willing to greenlight a trip that big, we would expect much more clarity than a general instruction to “make sure the kids have fun.”112Biden v. Nebraska, 143 S. Ct. 2355, 2379–80 (2023) (Barrett, J., concurring) (emphasis omitted).

Justice Barrett explains that additional context could make a difference, including (1) “maybe the parent left tickets to the amusement park on the counter,” (2) “[p]erhaps the parent showed the babysitter where the suitcases are, in the event that she took the children somewhere overnight,” (3) “maybe the parent mentioned that she had budgeted $2,000 for weekend entertainment,” (4) the “babysitter had taken the children on such trips before,” or (5) “if the babysitter were a grandparent.”113Id. at 2380. Notably, not all of these are additions to the text of the statement. We are sympathetic to this view of non-text-based context, but it is arguably a significant departure from traditional text-focused textualism.114See infra Section IV.B.

Moreover, Justice Barrett argues that the babysitter hypothetical illustrates how “we communicate conversationally” and that the MQD merely represents “common sense” in a different context:

In my view, the major questions doctrine grows out of these same commonsense principles of communication. Just as we would expect a parent to give more than a general instruction if she intended to authorize a babysitter-led getaway, we also “expect Congress to speak clearly if it wishes to assign to an agency decisions of vast ‘economic and political significance.’ ” That clarity may come from specific words in the statute, but context can also do the trick. Surrounding circumstances, whether contained within the statutory scheme or external to it, can narrow or broaden the scope of a delegation to an agency.115Biden v. Nebraska, 143 S. Ct. at 2380 (Barrett, J., concurring) (citation omitted).

This justification coheres with Justice Barrett’s “ordinary speaker” approach to interpretation. In Congressional Insiders and Outsiders, Justice Barrett argues that judges should approach language “from the perspective of an ordinary English speaker—a congressional outsider.”116Barrett, supra note 17, at 2194. This generally requires avoiding insider knowledge about Congress: “What matters to the textualist is how the ordinary English speaker—one unacquainted with the peculiarities of the legislative process—would understand the words of a statute.”117Id.

While Justice Barrett’s babysitter example is intriguing, it is not immediately clear how it supports the MQD. A skeptic might read the babysitter-to-MQD argument as committing a “motte” and “bailey” fallacy, conflating one position that is very easy to defend (the motte) with one much harder to defend (the bailey). It is undeniable that context influences interpretation and it would not be surprising that ordinary people are more confident in delegation of power with additional supporting contextual evidence. If the babysitter had previously taken the children on trips ((4) from above) or the agency had a longstanding practice of developing new programs, that context would often make readers equally or more confident that a text delegating authority to that agent encompasses similar action.

But this observation (that context can lend further support to particular actions taken pursuant to a delegation) does not justify the MQD. Justice Barrett’s key claim about ordinary language is much stronger, something like: ordinary people understand general delegations to X to be limited to only the most reasonable ways to X, absent further textual or contextual support for X. Recall Justice Barrett’s argument about the babysitter’s trip: “But was the trip consistent with a reasonable understanding of the parent’s instruction? Highly doubtful.”118Biden v. Nebraska, 143 S. Ct. at 2379–80 (Barrett, J., concurring). The central claim in the strong form of Justice Barrett’s argument is not merely that context matters but that absent supporting context, ordinary delegations are limited to the set of most reasonable applications of the instruction.

To appeal to the “motte” claim in support of the “bailey” claim is to trade an obvious fact about context to support a highly controversial claim about intuitive understanding of delegations. We do not, however, read Justice Barrett to make such a slippery move. There is a more charitable way to read her concurrence (that is, relying on the stronger key claim). This reading relies on an interesting and empirically testable question: When a text delegates an agent the power to X with general language, do people intuitively understand the delegation to be limited to only the set of the most reasonable/natural ways to X, or do they understand the delegation more broadly (even if not entirely literally)? For example, when a parent instructs a babysitter to “use this credit card to make sure the kids have fun this weekend,” does that authorize only the most reasonable actions (for example, ordering pizza, ordering a movie), or does it also authorize some actions that would be understood as less reasonable (for example, taking the kids to an amusement park)? Similarly, when Congress delegates to an agency, is the agency limited to only the set of most reasonable understandings (absent supporting context), or do people understand delegations to communicate a broader (if not quite literal) authorization?

Justice Barrett’s “linguistic defense” of the MQD leaves some questions open—the quotations above capture the bulk of the defense. Our formal reconstruction of the arguments follows.

1.  Reconstruction of the “Anti-Literalism” Defense of the Major Questions Doctrine

(1)  [Definition: Ordinary Majorness]: For a given rule, an action is “major” if the ordinary reader understands it, absent additional context, as not among the set of most reasonable ways to follow the rule.119A “major” action may be consistent with the rule’s literal meaning. The appeal to “reasonableness” generates an interesting feature of this definition: an action could be “major” in the sense of exceeding the reasonable set of actions or subceeding it. For example, imagine the babysitter responds by choosing to simply play board games with the kids, without using the credit card. It is possible that this is not among the most reasonable ways to follow the rule.

(2)  [Definition: MQD Case]: In a MQD case, the agency’s statutory powers are defined in linguistic terms that are semantically clear but highly general. The agency is exercising “vast powers” of great economic/political significance and pointing to the statutory language as authorization.

(3)  [Empirical Premise: MQD Cases Involve Ordinary Majorness] The ordinary reader takes MQD cases to involve a “major” action (for example, in the MQD cases, the ordinary reader takes the contested action, absent additional context, as not among the most reasonable ways to follow the rule).

(4)  [Textualist Premise]: Judges should interpret statutory language from the perspective of the ordinary reader.120Some textualists might adopt a weaker premise: “In interpreting statutes, judges should employ some principles that guide the ordinary reader, some principles that guide an idealized or informed reader (for example, ‘reasonable reader’), and some principles that guide the expert reader (for example, ‘ordinary lawyer’).” Insofar as Justice Barrett’s linguistic MQD argument adopts something like this weaker premise, the argument only goes through if the weaker premise is supplemented with a further premise: “In MQD cases, textualists should employ the principles that guide an ordinary reader’s understanding of delegations of authority to agents.” Justice Barrett’s MQD argument relies heavily on her ordinary babysitter example, suggesting that—at least for the purpose of major questions cases—judges’ approach to language should include the ordinary reader’s understanding of delegations (including how the literal meaning of a delegation is restricted by context). For simplicity, our main argument uses the simpler but stronger premise, but it could also use the weaker (but more complicated) pair of premises.

This weaker premise also reveals hard questions for textualists, which are beyond the scope of this Article: When, exactly, should a textualist adopt one or other of these perspectives and principles? We are skeptical about textualists that freely shift among these perspectives, with no guiding principles. Justice Barrett herself has not clearly answered this question, sometimes treating the ordinary reader as the lodestar for interpretation and other times pointing to legally trained readers. See Barrett, supra note 17, at 2202. A defense of the MQD on the grounds that it reflects lawyerly training is arguably more substantive than linguistic, and more circular than logical, but we do not purport to address this defense of the MQD in this Article. See also Tobia et al., Ordinary People, supra note 103, at 432–34 (arguing that standards like “appropriately informed interpreter” are more normative than descriptive).

(5)  [Empirical Premise]: Absent additional context, the ordinary reader understands rules that delegate power to an agent to have significant contextual limitations against all “major” actions; such a rule’s communicative content is limited to authorizing only the set of most reasonable actions.

(6)  [Conclusion]: In MQD cases, absent additional context, judges interpreting delegations should interpret delegations to exclude all major actions.

II.  PHILOSOPHICAL AND EMPIRICAL BACKGROUND

The previous Part introduced the two linguistic MQD arguments, one concerning high stakes and one concerning anti-literalism. This Part provides background from philosophy and empirical studies related to these arguments.

Some of the questions at the heart of the “high-stakes” MQD defense have been long debated by epistemologists (philosophers who specialize in the study of knowledge). More recently, the same questions have been studied empirically by psychologists and experimental philosophers.121See generally A Companion to Experimental Philosophy (Justin Sytsma & Wesley Buckwalter eds., 2016) (for an overview of experimental philosophy). Much of this work challenges a premise in the high-stakes MQD argument: although philosophers have claimed high stakes impact knowledge, high stakes have (at most) a small effect on ordinary judgments of knowledge. Section II.A reviews this research.

Section II.B provides background related to Justice Barrett’s claims about context and anti-literalism. Context matters in interpretation, and recent research has found that ordinary people understand law in line with anti-literalism, as Justice Barrett notes. However, there is no extant research that supports the stronger empirical premise in the anti-literalism argument.

A.  Stakes and Knowledge

1.  Philosophical Epistemology of Stakes and Knowledge

For decades, philosophers have evaluated stakes’ impact on knowledge with hypothetical “thought experiments.”122See, e.g., Cohen, supra note 101, at 58–60. Consider a pair of cases as an example.123Rose et al., supra note 34, at 237–39 (for a discussion of this version of the case); see also DeRose, supra note 24, at 913–16. The only differences between cases are highlighted in italics.

(1)  Low-Stakes Bank Deposit:

Bob and Jane are considering whether to stop at the bank to deposit a check on a Friday. Nothing turns on whether they deposit the check in the next week. The line is long, and they consider coming back on Saturday. Bob says that he remembers that the bank was open last Saturday, and Jane replies that banks sometimes change their hours. Bob says, “I know the bank will be open tomorrow.”

In this case, many philosophers claim that Bob knows that the bank will be open tomorrow.124Keith DeRose, The Ordinary Language Basis for Contextualism, and the New Invariantism, 55 Phil. Q. 172, 176 (explaining that “almost any speaker in my situation would claim to know the bank is open on Saturdays” in this low stakes case). Now consider a slight variation on this case.

(2)  High-Stakes Bank Deposit:

Bob and Jane are considering whether to stop at the bank to deposit a check on a Friday. It is critical that the check is deposited on one of the next two days. On Sunday, there will be a large debit to Bob’s account, which does not currently have enough funds, and the check is Bob’s only means to cover that expense. The line is long, and they consider coming back on Saturday. Bob says that he remembers that the bank was open last Saturday, and Jane replies that banks sometimes change their hours. Bob says, “I know the bank will be open tomorrow.”

In this case, philosophers say that Bob’s statement is false.125Id. at 177 (“Almost everyone will accept [‘I don’t know if the bank is open’] as a reasonable admission, and it will seem true to almost everyone.”). He does not know the bank will be open tomorrow.

The epistemology literature has taken philosophers’ shared reactions to these cases as intuitive data. And philosophers have offered different theories to make sense of that data. These are rich and complicated philosophical debates, which this Article does not have the space to rehearse or explore deeply.126For example, “contextualism” holds that “to know” is context sensitive, such that the truth conditions for knowledge attributions vary across contexts. Cohen, supra note 101, at 57; DeRose, supra note 24, at 914; see also Keith DeRose, Solving the Skeptical Problem, 104 Phil. Rev. 1, 4–5 (1995). “Interest-relative invariantism” (“IRI”) rejects the claim that knowledge is context sensitive; instead, IRI holds that practical factors impact whether knowledge obtains. Jason Stanley, Knowledge and Practical Interests 85–89 (2005). Our principal interest is in how this work has informed recent debates in legal philosophy.

Legal-philosophical scholarship has drawn on this work in epistemology in support of the claim that high-stakes legal interpretation differs from lower-stakes interpretation. Ryan Doerfler suggests that high-stakes contexts influence textual clarity,127Doerfler, supra note 25, at 523; see also William Baude & Ryan D. Doerfler, The (Not So) Plain Meaning Rule, 84 U. Chi. L. Rev. 539, 546–47 (2017). and Wurman piggybacks on this premise to argue that stakes sensitivity supports the MQD.128Wurman, supra note 8, at 957–61. Importantly, these legal applications appeal to “ordinary speakers129Doerfler, supra note 25, at 523, 542. and “ordinary epistemic justification,” especially reactions to the bank cases described above.130Id. at 575. A starting premise is that, for ordinary speakers of ordinary language, stakes impact knowledge; this is typically illustrated by the low- and high-stakes bank example.

2.  Do Stakes Impact Knowledge? Empirical Perspectives

Despite the pedigree of the stakes-knowledge literature, there is one big problem: many empirical studies report that stakes have no effect on ordinary attributions of knowledge. As Joshua Knobe & Jonathan Schaffer explain, “[l]ooking at this recent evidence, it is easy to come away with the feeling that the whole contextualism debate was founded on a myth. The various sides offered conflicting explanations for a certain pattern of [stakes-sensitive] intuitions, but the empirical evidence suggests that this pattern of intuitions does not exist.”131Schaffer & Knobe, supra note 34, at 675–76.

Much of this evidence comes from “experimental philosophy.” Rather than relying on the intuitions of philosophers (some of whom might have a lot at stake in intuitions about contextualism), experimental philosophers examine the understandings of ordinary people. Moreover, they often conduct experiments, which present different participants with different versions of the same scenarios, varying in only one respect (for example, higher stakes). This allows experimenters to draw inferences about whether certain factors (for example, stakes) affect people’s judgments in these cases. Some readers may be familiar with experimental philosophy’s testing of the well-known “trolley dilemma.”132See, e.g., Joshua D. Greene, R. Brian Sommerville, Leigh E. Nystrom, John M. Darley & Jonathan D. Cohen, An fMRI Investigation of Emotional Engagement in Moral Judgment, 293 Sci. 2105, 2105 (2001). Many have also poured substantial effort into testing the influence of stakes on knowledge, especially in the “bank cases.”

Do stakes affect lay attributions of knowledge? Many studies report no.133Adam Feltz & Chris Zarpentine, Do You Know More When It Matters Less?, 23 Phil. Psych. 683, 697 (2010); Wesley Buckwalter, Knowledge Isn’t Closed on Saturday: A Study in Ordinary Language, 1 Rev. Phil. & Psych. 395, 403 (2010); Wesley Buckwalter & Jonathan Schaffer, Knowledge, Stakes, and Mistakes, 49 Noûs 201, 228 (2015); Rose et al., supra note 34, at 245; Kathryn B. Francis, Philip Beaman & Nat Hansen, Stakes, Scales, and Skepticism, 6 Ergo 427, 450–52 (2019); Joshua May, Walter Sinnott-Armstrong, Jay G. Hull & Aaron Zimmerman, Practical Interests, Relevant Alternatives, and Knowledge Attributions: An Empirical Study, 1 Rev. Phil. & Psych 265, 272–73 (2010). As one important example, consider the study conducted by David Rose and other contributing authors. They gave participants versions of the bank case described at the start of this Section. They collected data from over 3,500 participants across 16 countries. The vast majority of countries show no significant effect, and for the few that show an effect, the size is very small (about a 10% difference in low- versus high-stakes cases). The researchers conclude that, overall, there is “virtually no evidence that stakes affect knowledge attribution.”134Rose et al., supra note 34, at 233.

Other papers report a complicated pattern for other epistemic notions besides knowledge. For example, Mark Phelan finds no effect of stakes on judgments about how (epistemically) confident someone should be in a between-subjects study, but he finds an effect in a within-subjects study (when the same participant considered matched cases).135Mark Phelan, Evidence that Stakes Don’t Matter for Evidence, 27 Phil. Psych. 488, 505 (2014); see also May et al., supra note 133, at 272 (reporting an effect of stakes on confidence but not knowledge).

Other studies report stakes effects for more complicated (and perhaps controversial) measures of knowledge. As an example, consider Alexander Dinges and Julia Zakkou’s study.136Alexander Dinges & Julia Zakkou, Much at Stake in Knowledge, 36 Mind & Language 729, 746 (2021). For another recent example, see generally Brian Porter, Kelli Barr, Abdellatif Bencherifa, Wesley Buckwalter, Yasuo Deguchi, Emanuele Fabiano, Takaaki Hashimoto, Julia Halamova, Joshua Homan, Kaori Karasawa, Martin Kanovsky, Hackjin Kim, Jordan Kiper, Minha Lee, Xiaofei Liu, Veli Mitova, Rukmini Bhaya, Ljiljana Pantovic, Pablo Quintanilla, Josien Reijer, Pedro Romero, Purmina Singh, Salma Tber, Daniel Wilkenfeld, Stephen Stich, Clark Barrett & Edouard Machery, A Puzzle About Knowledge Ascriptions, Noûs: Early View, July 4, 2024, at 1, available at https://onlinelibrary.wiley.com/doi/10.1111/nous.12515?af=R (finding no effect for questions like “[name] knows/only thinks he knows that [. . .],” but an effect for questions like “how many times do you think [name] has to check the logs before he knows [. . .]”). The weight of current evidence suggests that there is a small or no effect of stakes on knowledge attribution, but there is an effect of stakes on these other measures, such as questions about whether you “stand by” your claim or whether you should “check” your evidence more times. This study instructed participants to consider a scenario in one of three versions. All scenarios began with the following:

Picture yourself in the following scenario:

You and Hannah have been writing a joint paper for an English class. You have agreed to proofread the paper. You’ve carefully proofread the paper 3 times and used a dictionary if necessary. You spotted and corrected a few typos, but you didn’t find any typos in the last round anymore.

You meet up with Hannah to finally submit the paper. Hannah asks whether you think there are no typos in the paper anymore. You respond:

“I know there are no typos anymore.”

At this point, . . .

Then, the scenarios proceeded in either a “neutral,” “stakes,” or “evidence” version. The “stakes” manipulation sought to change the practical significance of the knowledge claim, while the “evidence” manipulation sought to change the evidence base on which the knowledge claim rests.

Neutral: . . . Hannah reveals to you for the first time that she’s always been a big fan of the Backstreet Boys. You’ve never liked the Backstreet Boys, but since you like Hannah, you promise to listen to a few songs she particularly recommends. You doubt that it will change your mind but agree that it doesn’t hurt to give it a try. As you’re about to submit the paper, Hannah asks whether you stand by your previous claim that you know there are no typos in the paper. You respond:

Stakes: . . . Hannah reveals to you for the first time that it is extremely important for her to get an A in the English class. Her scholarship depends on it, and she’ll have to leave college if she loses the scholarship. If there is a typo left in the paper, she’s very unlikely to get an A, so it is extremely important to her that there are no typos in the paper. As you’re about to submit the paper, Hannah asks whether you stand by your previous claim that you know there are no typos in the paper. You respond:

Evidence: . . . Hannah reveals to you for the first time that she’s secretly read your previous term papers and always spotted lots of typos in them even when you said you had carefully proofread them. She apologizes for not telling you earlier. You are slightly disappointed but forgive her. Hannah is a good friend, and you appreciate that she was honest with you in the end. As you’re about to submit the paper, Hannah asks whether you stand by your previous claim that you know there are no typos in the paper. You respond:

All scenarios ended with: “I do” or “I don’t,” asking participants to pick the response they would be more likely to give.

Using this “stand by” question, the researchers found a difference. In the “Neutral” version, 94% of participants stood by their knowledge claim (“I do”); in the “Stakes” version, 76% of participants stood by; and in the “Evidence” version, 42% stood by. The researchers found similar results in a bank case. The Neutral-Stakes difference suggests that stakes can impact knowledge attributions. The Stakes-Evidence difference indicates that other factors (for example, an attributor’s evidence base) also matter and can have a larger effect than stakes. This difference (76% versus 42%) is one of the larger differences reported in the literature.137Dinges & Zakkou, supra note 136, at 735.

It is not clear if agreement with “standing by” a claim is equivalent to agreement with knowledge of a claim. To “stand by” a claim calls to mind the action associated with the claim (that is, going to the bank today or not). From a cost-benefit perspective, stakes are relevant to action. The rising expected cost of failing to act in light of a possible bank closure or paper typo is relevant to a rational actor’s decision-making. Arguably, some of the observed small impacts of stakes on lay attributions of knowledge could be reflecting lay participants’ actionability judgments: in the high-stakes context, Bob’s knowledge has not changed, but whether he should go to the bank has changed.

Overall, the evidence is mixed concerning whether stakes impact ordinary knowledge attributions. Historically, many philosophers had stakes-sensitive knowledge intuitions, predicted that others would, and developed complex theories about those effects.138See, e.g., DeRose, supra note 24, at 913–18. Yet, a large number of empirical studies of thousands of ordinary participants, across many languages and cultures, have found no impact of stakes, or only a very small effect, on knowledge.139See Schaffer & Knobe, supra note 34, at 703. Very recently, one new study has reignited the debate, finding some support for the impact of stakes on epistemological judgments.140See Dinges & Zakkou, supra note 136, at 729. Another forthcoming paper also adopts a nuanced position that normative facts influence knowledge. See N. Ángel Pinillos, Bank Cases, Stakes and Normative Facts, in 5 Oxford Studies in Experimental Philosophy 375 (Joshua Knobe & Shaun Nichols eds., 2024). Yet, another recent study reports that stakes do not affect judgments about knowledge141Su Wu, Are Folks Purists or Pragmatic Encroachers? New Discoveries of Relation Between Knowledge and Action from Experimental Philosophy, Episteme 1, 11 (2023) (studying Chinese participants). but do affect judgment about action.142Id. at 12. In total, there is evidence pointing in both directions. Resolving the debate will require further empirical research as well as systematic theorizing of the seemingly conflicting empirical results.

Consequently, it remains far from settled that high stakes reduce knowledge for “the ordinary person.” Most studies have found that stakes do not impact knowledge in this way. And even for the studies that do report an effect, it is small. If 95% of participants evaluate that there is knowledge in a low-stakes case, and 80% evaluate that there is knowledge in a comparable high-stakes case, does this imply that the “ordinary person” has stakes-sensitive knowledge intuitions? Advocates of ordinary stakes sensitivity need to spell out why stakes-sensitivity manifesting in 10–15% of ordinary participants implies that the ordinary reader has stakes-sensitive knowledge.

The claim that high stakes impact knowledge figures prominently in the argument for a high-stakes linguistic MQD.143See supra Section II.A. Extant legal literature has drawn heavily on this claim in supporting that “high-stakes” interpretation differs from lower-stakes interpretation. In doing so, it has drawn primarily from hypotheticals in academic philosophy (the “bank cases”) and intuitions about those hypotheticals offered by academic philosophers. Insofar as the legal literature concerns stakes’ impact on ordinary people’s knowledge attributions,144See, e.g., Wurman, supra note 8, at 956–61. those legal debates would benefit from greater engagement with the large body of recent empirical work summarized in the previous Section.

3.  From Philosophy to Legal Philosophy

The previous two Subsections have introduced the debate about stakes and knowledge in epistemology. But it is important to recall that the connection of this debate to legal philosophy requires another step. For example, Doerfler proposes a connection between “clarity” or “plain meaning” of a statute and knowledge about the statute’s meaning: “[T]o say that the meaning of a statute is ‘clear’ or ‘plain’ is, in effect, to say that one knows what that statute means.”145Doerfler, supra note 25, at 527 (emphasis omitted); see also Baude & Doerfler, supra note 127, at 545. The logic appears to be that clarity attributions are a subset of knowledge claims, such that a property demonstrated to affect knowledge claims should transitively affect clarity claims.

Ultimately, this relationship between knowledge and clarity is outside the scope of our Article (the relevant question for the linguistic MQD is stakes’ impact on clarity). However, there are some philosophical questions to raise about the proposed relationship between clarity and knowledge. One, which we described earlier, concerns technical meaning. A layperson might not know what a statute means because it is technical, yet the statute may not be “unclear” to that person in the relevant sense of clarity (that is, ambiguous). As another difference, consider factivity. Philosophers often propose that knowledge is factive: I know p only if p. But it is not obvious that clarity is factive. The meaning of a statute might appear clear (that is, not ambiguous) to an agent while the agent is wrong about the statute’s meaning, and thus the agent lacks knowledge of the statute’s meaning. Such a case would be a counterexample to the claim that an agent knows what a statute means if and only if the meaning of the statute is clear.

Most importantly, the empirical evidence about ordinary attributions of knowledge reviewed here—to the extent that it even does support stakes sensitivity—does not necessarily extend to ordinary determinations of whether statutory text is clear. The studies to date mostly used the bank case, but the bank case presents no rule to which clarity judgments might attach. It might be possible that the clarity of rules is reduced for ordinary people in higher-stakes contexts. Indeed, it is theoretically possible that clarity judgments about textual rules are more sensitive to stakes than knowledge more generally. But it is just as possible that there is a breakage: that is, that clarity claims are not simply a subset of knowledge claims but a special and different kind of knowledge claim. However, as far as we are aware, these are entirely untested empirical hypotheses. Without any empirical evidence specific to clarity claims, it would not be possible to bootstrap ordinary stakes-sensitive clarity from ordinary stakes-sensitive knowledge (moreover, as we have argued, ordinary stakes-sensitive knowledge is also empirically dubious). Part III therefore tests this clarity claim.

B.  Context and Anti-Literalism

Justice Barrett’s concurring opinion in Biden v. Nebraska offers a different argument for the MQD as a linguistic canon. For Justice Barrett, the MQD simply reflects “common sense” inferences about how broader context restricts language’s (literal) meaning.146Biden v. Nebraska, 143 S. Ct. 2355, 2379 (2023) (Barrett, J., concurring) (“Context also includes common sense, which is another thing that ‘goes without saying.’ Case reporters and casebooks brim with illustrations of why literalism—the antithesis of context-driven interpretation—falls short.”). Justice Barrett illustrates this with the babysitter example, claiming that ordinary people understand a delegation to a babysitter to have implicit limits (although a babysitter’s attempt to transgress those normal limits might be allowed by a supplemental clear authorization). This, Justice Barrett suggests, is precisely how an ordinary reader would read a statute delegating authority to an agency, and therefore a canon requiring a clear statement from Congress is justified.147See supra Section I.C.

1.  Anti-Literalism and Context in Ordinary Language

Anti-literalism is an important feature of ordinary language. Consider François Recanati’s discussion of the “You are not going to die” example from Kent Bach:

[Imagine] a child crying because of a minor cut and her mother uttering . . . [“you are not going to die”] in response. What is meant is: “You’re not going to die from that cut.” But literally the utterance expresses the propositions that the kid will not die tout court—as if he or she were immortal. The extra element contextually provided (the implicit reference to the cut) does not correspond to anything in the sentence itself; nor is it an unarticulated constituent whose contextual provision is necessary to make the utterance fully propositional.148François Recanati, Literal Meaning 8–9 (2004).

This example helpfully illustrates that we often understand propositions anti-literally, in light of context, and that the relevant context need not come from the statement itself. The very same words “you’re not going to die,” convey a different meaning when uttered after a child gets a cut than they would in some other context where the literal meaning would be the correct meaning.

The powerful influence of context is not limited to anti-literalism. Extratextual context can also disambiguate. As an example, consider the statement “Do not take drugs and alcohol.” Does this mean “Do not take either one?” Or does it mean “Do not take the two together?” The answer varies across contexts.

If this rule were presented in the context of a substance abuse counseling session, our extratextual knowledge about that session leads us to understand this text [to prohibit each individually]: Don’t take drugs; don’t take alcohol. However, if this rule were presented in the context of a patient’s annual physical, in which the doctor prescribed cholesterol-reducing medications, our extra-textual knowledge about that session encourages [understanding the rule to prohibit the combination].149Kevin Tobia, Jesse Egbert & Thomas R. Lee, Triangulating Ordinary Meaning, 112 Geo. L.J. 23, 51 (2023).

2.  Anti-Literalism in Ordinary Understanding of Legal Rules

Justice Barrett’s argument is attractive in its appeal to context and anti-literalism. And Justice Barrett is not the only modern textualist to appeal heavily to anti-literalism; Justices Gorsuch and especially Kavanaugh have also called attention to the perils of overliteral interpretation.150The Justices use “literal” in various ways, but Justice Gorsuch and Kavanaugh have recently called attention to avoiding inappropriate literalism. See, e.g., Bostock v. Clayton County, 140 S. Ct. 1731, 1750 (2020) (Gorsuch, J.); id. at 1825 (Kavanaugh, J., dissenting) (“[C]ourts must follow ordinary meaning, not literal meaning.”).

For modern textualism, this is a welcome development. Analysis of the (linguistic) meaning of legal rules should attend to context and exceed pure literalism. As one example, consider the linguistic canons. Many linguistic canons reflect intuitive contextual restrictions from literal meaning. “No cars, trucks, or other vehicles may enter the park” might literally prohibit bicycles from the park, as most ordinary people take a bicycle to be a vehicle.151Kevin P. Tobia, Testing Ordinary Meaning, 134 Harv. L. Rev. 726, 757 (2020) (reporting that most laypeople, law students, and judges agree that a bicycle is a “vehicle”). However, the principle of ejusdem generis instructs interpreters to construe the broad, catchall term “vehicle” in light of the listed items (“cars,” “trucks”).152See McBoyle v. United States, 283 U.S. 25, 26–27 (1931). Even if laypeople are not familiar with the name “ejusdem generis,” they intuitively apply this kind of reasoning when analyzing both legal and ordinary rules.153Tobia et al., From the Outside, supra note 18, at 259–60.

People also apply other types of contextual restrictions from literal meaning. This includes some contextual rules that are not currently recognized by courts as linguistic canons. For example, people understand that universal quantifiers like “any” often do not mean literally any.154Id. (reporting studies demonstrating that laypeople intuitively apply a ejusdem generis principle); see also Tobia, supra note 151, at Appendix (reporting that most laypeople do not take “no vehicles in the park” to prohibit a bicycle from the park, even though most laypeople agree that a bicycle is a “vehicle”). If this tendency were at least as systematic in ordinary understanding as those underlying conventional linguistic canons (for example, the tendency to restrict catchall terms as ejusdem generis reflects), a textualist committed to the ordinary reader should employ those new canons (for example, the “quantifier domain restriction canon”).

Recent legal scholarship has also asked whether thinking about context and anti-literalism might reveal that some “substantive” canons are also linguistic canons.155Slocum & Tobia, supra note 37, at 73. Some clear statement rules—such as anti-retroactivity and anti-extraterritoriality—could be seen as linguistic canons, based on our understanding of context. Taken literally, many statutes would seem to apply at all times, in all places.156E.g., 18 U.S.C. § 2119 (“Whoever, with the intent to cause death or serious bodily harm takes a motor vehicle that has been transported, shipped, or received in interstate or foreign commerce from the person or presence of another by force and violence or by intimidation, or attempts to do so, shall . . . be fined under this title or imprisoned not more than 15 years, or both.”). But people understand statutes to communicate temporal and geographical restrictions: while there is some division among laypeople, overall, people tend to understand rules to apply only prospectively, and only territorially.157Slocum & Tobia, supra note 37, at 81–96.

Textualists may rhetorically privilege the “ordinary reader” and express support for anti-literalism, but they have not yet adopted many of these suggestions. For instance, no textualist has adopted an anti-literal “quantifier domain restriction canon” or theorized anti-retroactivity as a linguistic canon (although it is a long-standing clear statement rule). These context-sensitive rules are relatively robust and systematic and are supported by empirical evidence. We have reservations about a textualism that ignores such systematic patterns of anti-literalism while also freely adopting “ad hoc” anti-literal arguments related only to particular cases. On this score, Justice Barrett’s concurrence in Biden v. Nebraska is commendable in hypothesizing about a broader contextual principle that generally guides ordinary understandings of delegations (that is, a principle applying across cases, not an ad hoc appeal to context and anti-literalism related only to the authorization of emergency student loan relief). Whether Barrett’s contextual principle is systematic and empirically supported is a separate question.

Anti-literalism and contextual restriction are powerful ideas that accurately reflect language usage, but if textualists have no theory about when one can appeal to them, there is a danger that textualists can freely frame different readings as “literal” and “anti-literal,” choose liberally among them, or simply ignore non-literal meanings when doing so is convenient.158See id. at 106–08; see also William N. Eskridge, Jr., Brian G. Slocum & Kevin Tobia, Textualism’s Defining Moment, 123 Colum. L. Rev. 1611, 1612–27 (2023) (documenting twelve theoretical choices facing modern textualists and arguing that textualists’ failure to explain their answers to these choices facilitates cherry-picking and undermines rule of law values like predictability). The claim that “in context,” a text does not “literally” mean what it says is also a powerful way for motivated interpreters to escape a text’s clear meaning.

Context matters. But if textualists have no theory about what counts as context and when they must appeal to it, ad hoc appeals to context are like “looking out over a crowd and picking out your friends.”159See Confirmation Hearing on the Nomination of John G. Roberts, Jr. to Be Chief Justice of the United States Before the S. Comm. on the Judiciary, 109th Cong. 200–01 (2005) (on looking to foreign law in U.S. constitutional interpretation). Except here, the “friends” are not even limited to preexisting sources; they also include entirely novel hypothetical examples generated by the judge.

3.  Contextual Restriction of Delegations?

As Section II.B argued, the “anti-literalism” argument of the linguistic MQD needs a stronger premise than simply “people sometimes understand language non-literally.” The mere fact that “you are not going to die” has a nonliteral meaning does not justify the MQD.

The premise necessary to the argument involves a new claim about ordinary understanding of delegations. Justice Barrett proposes that there is some MQD-like principle that is part of ordinary people’s common sense, concerning the limited authorization from a general delegating instruction. It is for this reason that she relies on the babysitter hypothetical, an anti-literalism intuition-pump about an ordinary instruction that delegates power to an agent. General delegation language, Justice Barrett posits, has an anti-literal limitation. Unless there is further specific authorization, that general language is understood to be limited to only the most reasonable actions.

This is an interesting and empirically testable proposition: ordinary people understand general delegations to be limited to only the most reasonable actions falling under the language of the delegation. As far as we know, there is no empirical study that has examined this question. We present a new study to do so in Section III.B.

III.  NEW EMPIRICAL EVIDENCE

This Part tests key empirical claims at the core of the linguistic arguments for the MQD. In both tests, we seek to reduce our researcher degrees of freedom (that is, eliminate cherry-picking scenarios) by relying on the exact cases that advocates of the linguistic defense offer: the high-stakes “bank case” and the “babysitter hypothetical.”

Section III.A presents a study that tests whether ordinary people’s judgments about knowledge are lowered in high-stakes contexts (using the bank case). It also examines, for the first time, whether people’s understanding of a rule is impacted: Are rules perceived as less clear in high-stakes contexts?

Section III.B presents a study to examine the babysitter case: a parent instructs the babysitter to use a credit card to “make sure the kids have fun.” Do ordinary people understand this instruction to license taking the children on a road trip to an amusement park, or do they understand it to be limited to only more reasonable actions?

Section III.C responds to the primary two objections to the studies that have appeared in print since we first publicized this Article’s empirical findings.

A.  Do High Stakes Reduce Knowledge and/or Clarity? The Bank Case

1.  General Overview

The first study examined whether (high) stakes reduce ordinary attributions of (1) knowledge and (2) clarity of rules. We randomly assigned participants to either a low-stakes160Rose et al., supra note 34, at 231. Low:

Bob and his wife are driving home on a Friday afternoon. They both received some money earlier in the day, and so they plan to stop at the bank on the way home to deposit it. But as they drive past the bank, they notice that the lines inside are very long, as they often are on Friday afternoons. Although they generally like to deposit any money they receive at the bank as soon as possible, it is not especially important in this case that it be deposited right away, and so Bob suggests that they drive straight home and deposit their money on Saturday morning. His wife says, “Maybe the bank won’t be open tomorrow. Lots of banks are closed on Saturdays.” Bob replies, “No, I know the bank will be open. I was just there two weeks ago on Saturday. It was open until noon.” As a matter of fact, the bank will be open on Saturday morning.

Id.
or high-stakes161Id. High:

Bob and his wife are driving home on a Friday afternoon. They both received some money earlier in the day and so they plan to stop at the bank on the way home to deposit it. But as they drive past the bank, they notice that the lines inside are very long, as they often are on Friday afternoons. They have recently written a very large and very important check. If the money is not deposited into their bank account before Monday morning, the important check they wrote will not be accepted by the bank, leaving them in a very bad situation. Bob suggests that they drive straight home and deposit their money on Saturday morning. His wife says, “Maybe the bank won’t be open tomorrow. Lots of banks are closed on Saturdays.” Bob replies, “No, I know it’ll be open. I was just there two weeks ago on Saturday. It was open until noon.” As a matter of fact, the bank will be open on Saturday morning.

Id.
condition of the bank case. In each condition, participants read a version of the famous bank case, in which Bob and his wife discuss whether a bank is open on Saturday. Participants answered two types of knowledge questions, drawn from the previous literature.162See id. at 229–32. The basic knowledge question asks:

In your personal opinion, when Bob says “I know the bank will be open” is his statement true?

Yes, Bob’s statement is true.

No, Bob’s statement is not true.

Defenders of context sensitivity have argued that this question more accurately tracks debate about contextualism than questions that simply ask participants to rate “knowledge.”163See DeRose, supra note 31, at 82. The “strict” knowledge question asks:

In your personal opinion, which of the following sentences better describes Bob’s situation?

Bob knows that the bank will be open on Saturday.

Bob thinks he knows that the bank will be open on Saturday, but he doesn’t actually know it will be open.

Next, we randomly assigned participants to one type of rule: Clear, Ambiguous 1, Ambiguous 2, Unclear. The study presented a vignette explaining that Bob’s wife now used her phone to find the bank’s policy on its website. We randomly presented participants with one of four types of rules:

  • [Clear] The bank is open on Saturdays.
  • [Ambiguous 1] The bank is closed on Sundays.
  • [Ambiguous 2] The bank is closed only on Sundays and federal holidays.
  • [Unclear] The bank is open during regular business hours.

Participants rated whether the rule is clear or unclear concerning whether the bank is open on Saturday:

Now imagine that Bob’s wife uses her phone to search for the bank’s policy. She finds a website for the local bank branch. The website’s text states: “[RULE]” In your personal opinion, is this rule’s meaning clear or unclear concerning whether the bank is open on Saturday?

Clear: The bank is open on Saturday.

Clear: The bank is closed on Saturday.

Unclear.

In sum, we experimentally varied two factors: Stakes (low, high) and Rule Type (Clear, Ambiguous 1, Ambiguous 2, Unclear). This study examines whether Stakes affect lay judgment of knowledge (basic and strict). The study also examines whether Stakes affect lay judgment of a rule’s clarity across hypothesized clear, ambiguous, and unclear rules.

2.  Methodological Details

All study materials, hypotheses, exclusion criteria, and primary analyses were preregistered at Open Science.164Kevin Tobia, Stakes and Legal Interpretation, Center for Open Science (July 12, 2023, 09:21 AM), https://osf.io/adw2n [https://perma.cc/9MVV-BR2J]. The study data is also available at the same site. A total of 501 participants were recruited from Prolific.co and compensated $1.00 ($12.00/hr) for a 5-minute task. To be eligible, participants must have completed at least 10 tasks on Prolific, with a 100% approval rating, and they must currently reside in the United States.

Within the study, there were several check questions. First was a simple attention check question, which asked participants to select the answer “purple” in a long list of colors. There was also a manipulation check, clearly labeled as an “attention check”: “Attention check question: According to the story, which of the following statements is correct?” The options were “it is very important that Bob and his wife deposit their money” [correct answer in high-stakes condition] and “it is not very important that Bob and his wife deposit their money” [correct answer in low-stakes condition]. Later in the study, there was a third multiple choice attention check: “Alex is taller than Sam, and Sam is taller than John. Who is the shortest?” [correct answer = “John”; incorrect answers = “Alex,” “Sam,” “They are all the same height”]. Finally, all participants were asked to complete a CAPTCHA. Participants who answered any one of these questions incorrectly were excluded from the analyses. Thirty-two (out of 501; i.e., 6%) participants were excluded from these criteria.

3.  Results

A total of 469 participants were included in the data analysis (mean age = 39.58; 50% men, 48% women, 1% non-binary).

A binomial logistic regression revealed an effect of Stakes on knowledge. Participants attributed knowledge less in high-stakes cases (prob. = 0.86, 95% CI = [0.81, 0.90]) than in low-stakes cases (prob = 0.95, 95% CI = [0.91, 0.97]), odds ratio = 0.35, 95% CI = [0.18, 0.70], z = -2.99, p = 0.003.165See infra Figure 1.

Figure 1.

Figure 1. Percentage attributing knowledge (top panel) and strict knowledge (bottom panel), in low- and high-stakes bank cases. In the high-stakes case, knowledge attributions were slightly (about 10%) lower. Overall, the majority of participants attributed knowledge in low- and high-stakes cases.

A binomial logistic regression revealed an effect of Stakes on strict knowledge. Participants attributed strict knowledge less in high-stakes cases (prob. = 0.66, 95% CI = [0.60, 0.71]) than in low-stakes cases (prob = 0.78, 95% CI = [0.72, 0.83]), odds ratio = 0.55, 95% CI = [0.37, 0.83], z = -2.85, p = 0.004.166See supra Figure 1.

A multinomial logistic regression examined the effect of Stakes (low, high) and Rule Type (Clear, Ambiguous 1, Ambiguous 2, Unclear) on judgment of the bank rule’s clarity (clearly open, clearly closed, unclear). First, consider the effect of Stakes. Comparing clearly open and clearly closed responses, there was no effect of Stakes, z = 0.06, p = 0.956. Comparing clearly closed and unclear responses, there was no effect of Stakes, z = 0.38, p = 0.705. Next, consider the effect of Rule Type. Comparing clearly open and clearly closed responses, there was a significant effect of the clear versus unclear rule, z = -3.07, p = .002. There was no significant effect among the other rule types, |zs| < 0.21, ps > 0.8. Comparing clearly closed and unclear responses, there were no significant rule type effects, |zs| < 0.2, ps > 0.85. Finally, there were no significant Stakes * Rule Type interactions, |zs| < 0.41, ps > 0.68.167See infra Figure 2.

Figure 2.

Figure 2. Percentage attributing a clear meaning (open or closed) or unclarity for four different rules in low- and high-stakes cases. There were large and significant differences among the rules’ perceived meaning: the “Obviously Clear” and “Ambiguous 2” rules were generally understood to mean clearly open; the “Ambiguous 1” rule was understood to be unclear or mean clearly open; and the “Obviously Unclear” rule was unclear. However, there was no impact of high stakes on clarity judgments for any type of rule, whether the rule was clear (for example, Obviously Clear), ambiguous (for example, Ambiguous 1), or unclear (for example, Obviously Unclear).

4.  Discussion

The results regarding stakes and knowledge are consistent with the prior literature. Some previous studies have found a small effect of stakes on knowledge in the United States.168E.g., Rose et al., supra note 34, at 235 (finding a small pattern in the U.S., but not in most other countries). Here, we find a similar small effect: In the low-stakes bank case, 95% attribute knowledge, but in the high-stakes bank case, this number drops to 86%. The “strict knowledge” measure reflects a similarly sized difference (78% versus 66%).

i.  Is Knowledge “Sensitive” to Stakes?

The empirical results clarify the importance of refining this philosophical question: What is it for ordinary knowledge to be “sensitive to stakes”? One (weak) interpretation is that in some circumstances, for some people, stakes affect knowledge attributions. A stronger interpretation is that for most or all people, there are some cases in which knowledge is lost in high-stakes contexts. The strongest interpretation is that in many or most circumstances, high stakes defeat knowledge (for many or most people).

Once we have greater philosophical clarity about what it means to say knowledge is sensitive to stakes, we can analyze those theses in light of the empirical results. The results here straightforwardly provide support for the weak interpretation: the high-stakes manipulation affects (some participants’) attributions of knowledge. But the results do not support the “stronger” or “strongest” interpretations. The vast majority of participants attributed knowledge in low- and high-stakes cases. And even for the “strict knowledge” question, most participants still judged that there was (strict) knowledge in the high-stakes scenario. In other words, for the vast majority of participants, stakes did not impact knowledge.

ii.  Do High Stakes Reduce Clarity?

The results provide a more straightforward answer to this question. The high- versus low-stakes manipulation had no impact on whether people understood rules to be clear or unclear. Importantly, we used four types of rules, which varied in their basic level of clarity. With respect to whether the bank is open Saturday, “the bank is open on Saturday” is obviously clear; “the bank is closed on Sunday” is ambiguous; “the bank is closed only on Sundays and federal holidays” is ambiguous;169Note, we hypothesized that this rule has some ambiguity, given that the scenario does not specify whether the following Saturday is a federal holiday. Participants generally overlooked this possibility or assumed that the next day was not a holiday. Thus, the “Ambiguous 2” stimulus could be treated as another example of “obviously clear” text. The “Ambiguous 1” rule was much more often understood as unclear. and “the bank is open during regular business hours” is unclear. For all of these rules, high stakes did not increase the base level of unclarity.170See supra Figure 2.

B.  Ordinary Understanding of Delegations: The Babysitter Case

The second study examines how ordinary Americans understand delegations in an ordinary context. This Study takes inspiration from Justice Barrett’s recent concurrence in Biden v. Nebraska, which offered a new linguistic defense of the MQD.

1.  General Overview

The second study examined Premise 5 from Justice Barrett’s argument, the second empirical premise: When assessing whether an agent has followed or disobeyed a rule granting authority to perform some actions, do ordinary people restrict the rule’s literal meaning to only the set of most reasonable actions (absent additional context)?171See supra Section I.C. Study 2 examines this question by presenting participants with an ordinary rule granting authority, followed by one of five possible actions. These five actions varied in their anticipated reasonableness, and we examined whether participants evaluated each as following or violating the rule.

As in Study 1, we sought to minimize our researcher degrees of freedom by relying on existing and important test cases that have been offered by advocates of the linguistic MQD. For Study 2, we chose Justice Barrett’s “babysitter” hypothetical, as well as Justice Barrett’s proposed “major” action: a babysitter taking children to an amusement park in response to the instruction “Use this credit card to make sure the kids have fun this weekend.”

We randomly varied the conventional gender of the parent’s name (Patrick or Patricia) and babysitter’s name (Blake or Bridget). This did not affect rule violation judgment. Below is the text of the scenarios with the names Patricia and Blake:

Imagine that Patricia is a parent, who hires Blake as a babysitter to watch Patricia’s young children for two days and one night over the weekend, from Saturday morning to Sunday night. Patricia walks out the door, hands Blake a credit card, and says: “Use this credit card to make sure the kids have fun this weekend.”

Next, the scenario continued in one of five ways:

[MISUSE] Blake only uses the credit card to rent a movie that only he watches; Blake does not use the card to buy anything for the children.

[MINOR] Blake does not use the credit card at all. Blake plays card games with the kids.

[REASONABLE] Blake uses the credit card to buy the children pizza and ice cream and to rent a movie to watch together.

[MAJOR] Blake uses the credit card to buy the children admission to an amusement park and a hotel; Blake takes the children to the park, where they spend two days on rollercoasters and one night in a hotel.

[EXTREME] Blake uses the credit card to hire a professional animal entertainer, who brings a live alligator to the house to entertain the children.

All scenarios concluded with:

The kids have fun over the weekend.

We anticipated that the five scenarios would be seen as varying in their “reasonableness” as a response to the rule “Use this credit card to make sure the kids have fun this weekend,” with the REASONABLE scenario as maximal and the others as less reasonable. As we describe below, this prediction was borne out.

In all of the questions, we randomly varied whether the scenario described the parent’s directive as an “instruction” or “rule.” This also had no effect on rule violation judgment. Below we present the questions using the term “instruction.” After reading the scenario, participants first answered a comprehension question:

Attention check question: According to the story, which of the following statements is correct?

[CORRECT] Patricia’s instruction was “Use this credit card to make sure the kids have fun this weekend.”

Patricia’s instruction was “Do not use this credit card to make sure the kids have fun this weekend.”

Patricia’s instruction was “Use this credit card for anything this weekend.”

Patricia’s instruction was “Do not use this credit card for anything this weekend.”

Next, participants answered the rule violation question:

[Rule Violation] In your personal opinion, which better describes this situation?

Blake followed the instruction.

Blake violated the instruction.

We also measured participants’ judgment of the rule’s literal meaning and purpose.172[Literal Meaning] “Think about what the instruction ‘Use this credit card to make sure the kids have fun this weekend’ means literally. In your personal opinion, did Blake’s actions comply with or violate the literal meaning of the instruction? Blake complied with the rule’s literal meaning; Blake violated the rule’s literal meaning” and [Purpose] “Think about the underlying purposes of Patricia’s instruction. In your personal opinion, did Blake’s actions support or oppose the instruction’s underlying purposes? Blake’s actions supported the instruction’s underlying purpose; Blake’s actions opposed the instruction’s underlying purposes.” Finally, we measured participants’ evaluation of whether the babysitter’s action was a reasonable response to the instruction:

[Reasonableness] Think about how Blake responded to Patricia’s instruction. In your personal opinion, is this an unreasonable or reasonable way to respond to that instruction?

(completely unreasonable) 1  2  3  4  5  6  7 (completely reasonable)

2.  Methodological Details

As for Study 1, all Study 2 materials, hypotheses, exclusion criteria, and primary analyses were preregistered at Open Science.173See Tobia, supra note 164. The study data is also available at the same site. A total of 500 participants were recruited from Prolific.co and compensated $1.00 ($12.00/hr) for a 5-minute task. To be eligible, participants must have completed at least 10 tasks on Prolific, with a 100% approval rating, they must currently reside in the United States, and they must not have taken Study 1. Within the study, there were the same two check questions used as exclusion criteria in Study 1 (attention check and transitivity) and the new comprehension check described in the previous Section. Twenty-four (out of 499; i.e., 4.8%) participants were excluded with this criteria.

3.  Results

A total of 475 participants were included in the data analysis (mean age = 37.74; 48% men, 50% women, 2% non-binary).

First, we examined whether the five acts differed in their perceived reasonableness with respect to the rule. A linear regression revealed significant effects of the Action (misuse, minor, reasonable, major, extreme). Compared to ratings for the “reasonable” act (buying pizza and a movie for the kids), ratings for the misuse act (buying a movie for only the babysitter) were significantly lower, β = -1.67, 95% CI = [-1.89, -1.46], p < .001; ratings for the minor act (playing cards rather than purchasing anything) were significantly lower, β = -0.48, 95% CI = [-0.69, -0.27], p < .001; ratings for the major act (purchasing the amusement park trip) were significantly lower, β = -1.03, 95% CI = [-1.24, -0.82], p < .001; and ratings for the extreme act (purchasing the alligator entertainer) were significantly lower, β = -1.77, 95% CI = [-1.98, -1.56], p < .001.174See infra Figure 3.

Figure 3.

Figure 3: Reasonableness Ratings. Ordinary judgments of an action’s reasonableness in the babysitter hypothetical. Higher scores indicate greater reasonableness (1–7 scale).

Next, we examined which of the five acts participants understood as instances of following or disobeying the instruction. A binomial logistic regression revealed effects of Act type on rule violation. For the misuse case, rule following prob. = 0.15, 95% CI = [0.09, 0.24]; for the minor case, rule following prob. = 0.51, 95% CI = [0.41, 0.61];175This differed significantly from the misuse case, odds ratio = 5.88, 95% CI = [2.94, 11.79], z = 5.00, p < 0.001. for the reasonable case, rule following prob. = 1.00, 95% CI = [0.00, 1.00];176All participants in the reasonableness condition answered, “rule followed.” for the major case, rule following prob. = 0.92, 95% CI = [0.84, 0.96];177This differed significantly from the misuse case, odds ratio = 62.07, 95% CI = [24.73, 155.79], z = 8.79, p < 0.001. and for the extreme case, rule following prob. = 0.90, 95% CI = [0.82, 0.94].178This differed significantly from the misuse case, odds ratio = 49.66, 95% CI = [20.88, 118.11], z = 8.83, p < 0.001.

Table 1.
CaseWas the rule violated?Was the action reasonable (7) or unreasonable (1)?
Reasonable0%6.84 (Most reasonable)
Minor49%5.83 (Highly reasonable)
Major8%4.68 (Reasonable)
Misuse89%3.32 (Unreasonable)
Extreme10%3.12 (Unreasonable)
Note: Table 1 represents the proportion of participants judging that the action violated the rule and the estimated marginal mean ratings of the action’s reasonableness. Some actions that were not the most reasonable (for example, major, extreme) were seen as largely consistent with the rule; others that were seen as fairly reasonable (for example, minor) were also seen as inconsistent with the rule
4.  Discussion

This Study aimed to test the empirical claims underlying the “babysitter hypothetical,” an example that has been used to support claims in a linguistic defense of the MQD.

i.  Do People Understand Different Actions to Vary in Their Reasonableness as a Response to the Rule “Use This Credit Card to Make Sure the Kids Have Fun This Weekend”?

Yes. People evaluated some actions as highly reasonable, such as buying pizza and a movie for the kids. Other actions appeared less reasonable, like taking the kids to an amusement park or simply playing cards (and not buying anything). Others were even less reasonable, such as hiring an alligator entertainer or using the card to only purchase something for the babysitter. These results are unsurprising, but this variation is essential to test the key claim that the babysitter hypothetical has been offered to demonstrate.

ii.  Do People Understand Authorizing Rules to Be Limited to Only the Set of Most Reasonable Actions?

No. Although people evaluate Justice Barrett’s “major” action (taking the kids to an amusement park) as less reasonable than at least one alternative, they nevertheless understand it as consistent with the rule. Moreover, people evaluated the even more extreme example of bringing a live alligator to the house as consistent with the rule.

To be sure, people did rule out some actions as impermissible. In particular, the respondents overwhelmingly said that misuse of the credit card for the babysitter’s benefit rather than that of the children violated the rule. They also divided roughly evenly over the babysitter’s decision to forgo using the credit card at all. We will have more to say about these interesting patterns in Part IV,179See infra Part IV. but for now, the most important thing to note is that two of the less reasonable actions that tested the boundaries of the instruction were nevertheless deemed to be within the parent’s rule.

iii.  Why Do People’s Judgments About an Act’s Reasonableness and Rule Violation Differ?

Our survey also included questions about the rule’s literal meaning and the rule’s purposes. First consider reasonableness judgments by considering the results for purpose and literal meaning. Figure 4 presents the results for the purpose question. On inspection, this pattern of purpose attributions across actions is similar to the pattern of reasonableness ratings (Figure 3): actions seen as more reasonable were also the ones seen as most supportive of the rule’s purposes. The ratings for purpose and reasonableness, r = 0.63, 95% CI = [0.57, .0.68], p < .001, were more highly correlated than the ratings for purpose and literal meaning, r = 0.39, 95% CI = [0.31, .0.47], p < .001.

Next consider judgments about rule violation. Both literal meaning and purpose were correlated with rule violation judgment, but rule violation was more strongly correlated with literal meaning, r = 0.67, 95% CI = [0.62, .0.72], p < .001, than purpose, r = 0.49, 95% CI = [0.42, .0.56], p < .001.

Figure 4.

Figure 4: Purpose Ratings. Ordinary judgments of whether an action supports (rather than opposes) the rule’s purposes in the babysitter hypothetical.

These analyses are exploratory and further work is required to more fully understand the differences in participants’ judgments about whether an action is reasonable and whether it violates a rule, but the Study here clearly shows a difference in these judgments.180See supra Table 1. The question of whether the rule was violated and the question of whether the action was a reasonable response to the rule are understood differently by ordinary people: These questions are not synonymous. The comparisons to the purpose measure suggest a stronger relationship between reasonableness and purpose than rule violation and purpose.

Textualists concerned with the ordinary meaning of rules would presumably favor the rule violation question over the reasonableness question. Textualists who place significant weight on whether an action was “reasonable” with respect to a rule may be incorporating purposive reasoning, which is not as clearly relevant to ordinary people’s straightforward understanding about whether an act violates a rule.

The results reported here about laypeople’s rule violation judgments are consistent with prior work. Previous studies have found that both text (operationalized as literal meaning) and purpose influence rule violation judgment, but the former has a stronger influence.181Ivar R. Hannikainen, Kevin P. Tobia, Guilherme da F. C. F. de Almeida, Noel Struchiner, Markus Kneer, Piotr Bystranowski, Vilius Dranseika, Niek Strohmaier, Samantha Bensinger, Kristina Dolinina, Bartosz Janik, Eglė Lauraitytė, Michael Laakasuo, Alice Liefgreen, Ivars Neiders, Maciej Próchnicki, Alejandro Rosas, Jukka Sundvall & Tomasz Żuradzki, Coordination and Expertise Foster Legal Textualism, 119 Proc. Nat’l Acad. Scis., no. 44, 2022, at 1, 6; Kevin Tobia, Experimental Jurisprudence, 89 U. Chi. L. Rev. 735, 783–91 (2022) (summarizing research on lay judgment about legal interpretation). See generally Guilherme da Franca Couto Fernandes de Almeida, Noel Struchiner & Ivar Hannikainen, Rules, in Cambridge Handbook of Experimental Jurisprudence (Kevin Tobia ed., forthcoming 2024) (reviewing recent empirical studies about the effect of text and purpose on laypeople’s rule violation judgments). In sum, ordinary people lean towards textualism, but not the “common sense” limitations claim at the heart of the linguistic MQD.

C.  Objections

This Section considers the two primary objections that have been raised in print about the results since we first circulated a draft of this Article.

1.  Objection 1: Subjects Must Be Sensitive to Stakes

One objection concerns stakes sensitivity. Wurman writes, “In conversation, Ryan Doerfler has pointed out that it does not appear that the participants [in this Article’s Study 1] were asked whether the rule was clear to Bob, as opposed to themselves, and Bob is the one sensitive to stakes in the example.”182Wurman, supra note 8, at 961 n.271.

It is not clear why this observation constitutes an objection. One version of this objection is that only the judgments of those directly impacted by the stakes are relevant to legal theory, and because our study’s participants are not themselves impacted by the bank’s closure, their judgments about knowledge and clarity are not useful. This objection proves too much. The legal literature theorizing the effects of stakes-on-knowledge and stakes-on-clarity draws heavily on philosophical thought experiments (especially the bank case about Bob). None of these examples involve high stakes for the thought experimenter. The stakes are always for the subject described in the scenario, like Bob. The assumption is that those considering the scenarios can evaluate the significance of stakes (for some other person). If this objection undercuts our experiments, it also undercuts the merit of the original philosophical thought experiments offered to support Wurman’s argument.

A different way to elaborate this observation into an objection is to propose that (1) there is a more subjective relationship between stakes and clarity and (2) that (subjective sense) of clarity is relevant to legal interpretation. For a particular judge, that judge’s determination of clarity depends on the practical stakes to that particular judge. We do not have the space to fully engage with the merits of this theory, but some of its consequences are unusual. Because the subjective practical stakes of a decision may vary between judges, on this subjective view of stakes and clarity, such differences in subjective stakes would appropriately correspond to differing evaluations of clarity. A judge experiencing high practical stakes could deem a text unclear, while a judge experiencing lower practical stakes could deem the same text clear. However, many would think that whether a legal text is clear or unclear (in the sense relevant to legal interpretation) should not vary among judges in this way.183See Richard M. Re, A Law Unto Oneself: Personal Positivism and Our Fragmented Judiciary, 110 Va. L. Rev. (forthcoming 2024) (manuscript at 5), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4687303 [https://perma.cc/E6DW-U4P2] (acknowledging that the dominant approach in jurisprudence is to “identify a ‘general’ theory of law that assertedly applies to most or all legal systems” but also arguing for the possibility and desirability of some degree of “personal” law implemented by judges). On this highly subjective view, to predict whether a law is correctly identified as “clear” (in the eyes of a particular judge), one must know what practical stakes the (particular) judge faces.

Although we find this an unusual view about what clarity means in current legal interpretation, this objection’s underlying claim is an empirically testable one. As such, we investigate this empirical question as a robustness check: Do stakes affect people’s judgment about whether the rule is clear to Bob?

2.  Objection 2: Only Parents’ Views About the Babysitter Hypothetical Count

A recurring objection to our study about the babysitter hypothetical concerns the population surveyed. Both Josh Blackman and Ilan Wurman have suggested that the appropriate audience for Justice Barrett’s hypothetical is parents.184See Josh Blackman, Major Questions or Lax Parents?, Reason (July 27, 2023, 10:28 PM), https://reason.com/volokh/2023/07/27/major-questions-or-lax-parents [https://perma.cc/J74R-XGX7] (“Justice Barrett . . . may be referring to people who are familiar with the process of hiring babysitters. . . . Diversity of views is very important. One facet of diversity is having children.”); Wurman, supra note 8, at 961 n.271 (“It would be worth testing how many participants would agree that the instruction was followed if they were the parents.”). Because Justice Barrett’s hypothetical involves a parent, the objection goes, we should look to (only) the views of parents in understanding the meaning of the parent’s instruction.

Our original study did not collect data about participant’s parental status because we see it as irrelevant to the legal theory debate about the ordinary meaning of the parent’s instruction to the babysitter—more on that below. However even if we had that data in the first study, it is extremely unlikely that filtering by parental status would result in a different bottom line result given that the overall results lean so strongly in one direction.185Consider a back-of-the-envelope calculation. Only 8% of participants responded that the babysitter who took the children to an amusement park violated the rule. Assume, favorably to the objectors, that these 8% of responders all had children. The majority of all parents’ responses would favor the babysitter hypothetical intuition if at least 92% (85/92) of the other participants did not have children. This would imply that, at most, 16% of all participants had children. Given that we recruited a sample of Americans, it is likely that much more than 16% had children. Only 8% of participants shared the intuition that the babysitter violated the parent’s instruction, and of the other 92% of participants, it is unlikely that that the vast majority (say 90%) did not have children. In the next Section, we present a new study that collects this additional demographic data. The results do not vary depending on parental or babysitter-hiring status.

Our more fundamental responses to this objection are theoretical and appeal to longstanding principles of interpretation. First, consider the question of audience: Should the babysitter hypothetical be limited to parents? That is, should textualist interpretation’s “ordinary reader” be limited to only a small subset of ordinary readers?

First, the objection assumes the wrong interpretive perspective. In Justice Barrett’s hypothetical, the parent is the lawgiver and the babysitter is the audience. But the correct textualist focus, according to Justice Barrett, is on interpretation from the “outside[],” not from the “inside[].”186See Barrett, supra note 17, at 2194. Textualists typically view interpretation from the perspective of a “hypothetical reasonable person,” not from the perspective of the lawgiver. Fidelity to the text of the statute, as understood by an ordinary reader, is the best way to remain a faithful agent of Congress (or, as Justice Barrett would have it, as a faithful agent of the people). Thus, even if one specific focus in the babysitter hypothetical were deemed more appropriate, for modern textualists that focus would more likely be that of a babysitter (the instruction’s reader), not a parent (the instruction’s author).187See Tara Leigh Grove, Testing Textualism’s “Ordinary Meaning”, 90 Geo. Wash. L. Rev. 1053, 1057 (2022).

Second, textualists do not subdivide the general class of ordinary people that determines ordinary meaning. Instead, the “Supreme Court tends to employ a one-size-fits-all approach to interpretation.”188David Louk, The Audiences of Statutes, 105 Cornell L. Rev. 137, 193 (2019). We recognize the importance of interpretive communities and the observation that statutes have audiences.189See generally id. (arguing that the varied audiences of statutes may have differing expectations about statutory meaning). However, the concept of audience is most often used in textualist theory and otherwise to support the use of technical meanings (instead of ordinary meanings) for certain specialized statutes. Otherwise, the same statute might mean different things to the different groups subject to it, a position that Justice Scalia (writing for the Court) condemned.190Clark v. Martinez, 543 U.S. 371, 380 (2005) (rejecting the argument that the same statutory provision can have a different meaning depending on the group subject to it). The proposal to find ordinary meaning only in the views of the people most directly implicated by the law is thus a radical departure from modern textualism.

This suggestion (the legal interpretive equivalent of “ask only parents”) also strikes us as unworkable. If an interpreter aimed to limit “ordinary meaning” to the meaning a statute has to the people most directly impacted by it, how do we identify the people in this community? Even in Justice Barrett’s more straightforward babysitter hypothetical (and again setting aside that the babysitter is the audience, not the parent), we could ask: Are the relevant readers all parents, parents who go away for weekends, parents who can also afford babysitters, parents who would be willing to hand a credit card to a babysitter, or parents who would be willing to hand a credit card to a babysitter with limited instructions?

Even if the relevant subcommunity could be identified, it is not clear a judge would be well positioned to identify this narrow subcommunity’s understanding. If the textualist interpretive inquiry shifted from one about ordinary meaning to one about “ordinary meaning for only the audience most directly impacted by this statute,” might judicial intuition be especially unreliable if judges were not part of this latter subcommunity?

This suggestion becomes more bizarre as we shift from the babysitter hypothetical to real legal examples. In Biden v. Nebraska, who is the relevant interpretive community of people: the Department of Education, people with student loans, or some other group? If we take this objection and analogy seriously, it seems we should ask who is the “parent” in Biden v. Nebraska? Presumably, it is Congress. Do Blackman and Wurman suggest that Congress’s views are most relevant in interpretation? If so, this objection offered by Justice Barrett’s defenders, emphasizing a narrow subgroup of people who give or implement this instruction, is inconsistent with Justice Barrett’s broader approach to interpretation, which emphasizes judges as faithful agents of the “people,” not Congress.191See supra notes 186–87 and accompanying text.

In sum, the objection to “ask parents” about the babysitter hypothetical is not persuasive. Theoretically, the legal-interpretive analogue to “ask parents” is unmotivated, unworkable, and inconsistent with modern textualism. Nevertheless, we address this objection in the next part of this Section, in a replication study that asks for the participants’ parental status. Empirically, the results are no different for participants who are parents or who have hired a babysitter.

3.  An Additional Empirical Study

We are not persuaded by the theory underlying these two objections, but we are grateful to those who have raised them. And, setting the theoretical issues aside, it is possible to test these objections empirically. To do so, we conducted one final study.

Five hundred participants were recruited from Prolific to complete Study 1 and Study 2, with a few minor modifications aimed at addressing the objections described previously. A total of 445 participants passed the same attention checks described in Study 1 and Study 2 above and were included in the analysis (mean age = 37.9, 47% men, 51% women, 2% nonbinary). The final demographics section also asked about whether the participants had children (38% yes, 59% no, 2% prefer not to respond), had hired a babysitter (21% yes, 77% no, 2% prefer not to respond), and had worked as a babysitter (48% yes, 51% no, 1% prefer not to respond).

i.  Testing Clarity to Bob (the Agent Sensitive to Stakes)

Participants first read the Study 1 materials concerning Bob and the bank. They were again randomly assigned to high or low stakes and one of the four rule types (Obviously Clear, Ambiguous 1, Ambiguous 2, Obviously Unclear). Participants answered the same questions about knowledge and strict knowledge, as well as a new question that Wurman recommends about clarity to Bob:

[Clarity to Bob] Now imagine that Bob’s wife uses her phone to search for the bank’s policy. She finds a website for the local bank branch. The website’s text states [rule text varying by scenario].

Consider Bob’s perspective on this scenario.

Is this rule’s meaning clear or unclear to Bob concerning whether the bank is open?

Clear: The bank is open on Saturday.

Clear: The bank is not open on Saturday.

Unclear

A multinomial logistic regression examined the effect of Stakes (low, high) and Rule Type (Clear, Ambiguous 1, Ambiguous 2, Unclear) on judgment of the bank rule’s clarity to Bob (clearly open, clearly closed, unclear). First, consider the effect of Stakes. Comparing clearly open and clearly closed responses, there was no effect of Stakes, z = 0.95, p = 0.341. Comparing clearly closed and unclear responses, there was no effect of Stakes, z = 1.26, p = 0.209. There was no significant effect of Rule Type and no significant Stakes * Rule Type interactions.

The results for these questions about whether the rule is clear to Bob also show no effect of stakes. For the Obviously Clear rule, stakes did not affect judgments of clarity to Bob (2% of participants selected unclear in high stakes; 2% in low stakes); for the Ambiguous 1 rule, stakes did not affect judgments of clarity to Bob (29% of participants selected unclear in high stakes; 38% in low stakes); for the Ambiguous 2 rule, stakes did not affect judgments of clarity to Bob (4% of participants selected unclear in high stakes; 12% in low stakes); and for the Obviously Unclear rule, stakes did not affect judgments of clarity to Bob (61% of participants selected unclear in high stakes; 59% in low stakes).

The results for knowledge and strict knowledge were similar to the results found in Study 1. A binomial logistic regression revealed an effect of Stakes on knowledge. Participants attributed knowledge less in high-stakes cases (prob. = 0.85, 95% CI = [0.80, 0.89]) than in low-stakes cases (prob = 0.94, 95% CI = [0.89, 0.96]), odds ratio = 0.38, 95% CI = [0.19, 0.73], z = -2.89, p = 0.004. A binomial logistic regression revealed an effect of Stakes on strict knowledge. Participants attributed strict knowledge less in high-stakes cases (prob. = 0.64, 95% CI = [0.58, 0.70]) than in low-stakes cases (prob = 0.81, 95% CI = [0.75, 0.86]), odds ratio = 0.42, 95% CI = [0.27, 0.65], z = -3.91, p < 0.001.

In sum, one objection to our original Study 1 is that it fails to ask the right question about clarity: it should ask whether the text is clear to Bob, not clear in general. This follow-up study tested that question about clarity to Bob, finding identical results: participants’ judgments about clarity to Bob were not sensitive to high stakes.

ii.  Parents Only

The second objection is that we should consider only the views of parents. Consider the results of the same study, replicated, broken out by whether participants are parents, and have hired a babysitter.192See infra Table 2.

Table 2.
CaseViolation: All ParticipantsViolation: Parents OnlyViolation: Hired Babysitter OnlyWas the act reasonable (7) or unreasonable (1) (All Participants)
Reasonable0%0%0%6.87
Minor30%26%33%6.23
Major8%7%10%4.41
Misuse81%79%84%3.21
Extreme21%18%25%3.00
Note: Table 2 represents the proportion of participants judging that the action violated the rule and the estimated marginal mean ratings of the action’s reasonableness.

Comparing across all participants, parent participants, and those who have hired babysitters, the results are essentially identical. Participants generally disagreed that the babysitter violated the rule/instruction by taking the children to an amusement park overnight, and this did not depend on whether those participants were themselves parents or had hired a babysitter.

4.  Additional Objections

We have responded to the two major objections leveled against the studies since we made a draft of this Article public. However, there are two other objections that strike us as worth pursuing, but which we do not have the space to fully explore here.

The first is that in our babysitter experiment, we should have asked a different question. As a reminder, we asked “which better describes this situation?”—that the babysitter “followed” the instruction/rule or “disobeyed” the instruction/rule? This strikes us as a straightforward way to capture textualists’ concern: What does the rule mean to the ordinary reader? Wurman suggests that we should have asked other questions, like whether participants agree that the instruction “include[s] authorization” to undertake this action, or whether participants think “ordinary, reasonable interpreters of this parent’s instruction would have interpreted it to include this scenario.”193Wurman, supra note 8, at 961 n. 271. (“The question’s framing effectively required the participants to answer whether the babysitter literally violated the instruction. And the answer is of course not. But if the question had been asked another way—‘does the best reading of the parent’s instruction include authorization to undertake this action?’ or ‘do you think the parent’s instruction was intended to include this scenario?’ or ‘do you think ordinary, reasonable interpreters of the parent’s instruction would have interpreted it to include this scenario?’—the answer almost certainly would have been different.”). Wurman does not motivate these suggestions with much theory, and it is not obvious why these phrasings would identify participants’ understanding of the meaning of the rule. For example, recall that there are many theories of interpretation: textualism, purposivism, and consequentialism. It is not obvious that most people think the “reasonable interpreter” is a textualist. Perhaps people think that the “ordinary reasonable interpreter” is not a pragmatist. If so, asking people about their views of “the reasonable interpreter” would reliably generate non-textualist judgments.

Nevertheless, in our third study, we also asked these two additional questions: (1) “In your personal opinion, which better describes this situation?”—(a) The parent’s instruction/rule “authorized” the babysitter “to undertake this action”; or (b) The parent’s instruction/rule “did not authorize” the babysitter “to undertake this action”; and (2) “In your personal opinion, which better describes this situation?”—(a) “An ordinary person interpreting” the parent’s instruction/rule “would understand it to allow what” the babysitter did; or (b) “An ordinary person interpreting” the parent’s instruction/rule “would not understand it to allow what” the babysitter did. The results did not differ in the dramatic way that Wurman predicts. For the first authorization question, 85% of participants agreed that the “major” action was authorized. For the second “reasonable interpreter” question, a majority (57%) agreed that this reasonable interpreter would give the textualist response to the major action case: the reader would understand the instruction/rule to allow what the babysitter did.

A final objection states that the parent-babysitter analogy is a poor analogy for the Congress-agency relationship. This objection is sometimes offered as a critique of the MQD, not a defense, and as a reason why we should not indulge a faulty analogy. Less frequently, it is raised as a defense to the MQD—suggesting that the context of a real-world delegation would surely include consideration of constitutional structure.194See generally Chad Squitieri, Placing Legal Context in Context (Oct. 23, 2023) (unpublished manuscript), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4610078 [https://perma.cc/G7CL-UZBV]. We find the basic point that context matters persuasive, but the muscular vision of ordinary understanding of legal context that Squitieri offers raises serious problems. It cannot be the case that the MQD is supported because an “ordinary” reader who studied the question of congressional delegation closely enough might become skeptical of the delegation of major power to agencies. First, questions of delegation are highly contested on many grounds—even trained lawyers and judges disagree vehemently about the legality and propriety of delegation. See Julian Davis Mortenson & Nicholas Bagley, Delegation at the Founding: A Response to the Critics, 122 Colum. L. Rev. 2323, 2323 (2022) (providing an overview of contemporary debates about the original public meaning of the Constitution’s vesting of legislative power in Congress). Second, on Squitieri’s approach, there must be some limit on the amount of legal context that can be assumed to be known by the “ordinary” reader. Otherwise, there would be nothing constraining judges in the elucidation of matters of ordinary meaning through their own trained, but highly subjective, minds. Ultimately, if the concept of the ordinary reader is to do any work within a textualist theory that constrains judges, it must provide some limits on the amount of legal context that can be assumed by the judge. We are exploring the idea that the babysitter hypothetical misses important relevant context to the evaluation of delegations in a future piece, but for now we set it aside. This Article takes the babysitter hypothetical from Biden v. Nebraska on its own terms. For readers that are skeptical that such an analogy provides insight into the meaning of statutory language, our study offers a second line of critique: the analogy does not offer insight into statutory meaning, and even if it did, its assumption about ordinary readers is faulty.

IV.  IMPLICATIONS

The recent pivot to a linguistic defense of the MQD is a watershed moment for two fields of law that often intersect: statutory interpretation and administrative law. Through the narrowest lens, the reframing of the MQD as “linguistic” attempts to insulate the nascent MQD from scrutiny as hypocritical anti-textualism, allowing conservative judges to use the doctrine to curb the power of the administrative state without turning in their textualist cards.195See supra Section I.A. But the move also resonates much more deeply. If accepted, the connection being drawn between ordinary people and the MQD would move textualism further towards an “outsider” orientation, with implications well beyond the narrow purview of the MQD.196See Barrett, supra note 17, at 2199; Tobia et al., Ordinary People, supra note 103, at 383. Likewise, if accepted, the linguistic defense of the MQD would tend to reinforce trends toward an explicitly “libertarian administrative law,”197Cass R. Sunstein & Adrian Vermeule, Libertarian Administrative Law, 82 U. Chi. L. Rev. 393, 393 (2015). backing it with the force of supposedly ordinary people’s commonsense understanding of how government should work.

The theoretical critiques and original empirical evidence presented thus far in this Article support skepticism about the arguments to adopt the MQD as linguistic. In this Part, we explain why, and we also reflect on what our evidence says more generally about the fields of statutory interpretation and administrative law.

We start in Section IV.A by discussing how our investigation and findings challenge the conclusion that the MQD is a valid linguistic canon. In light of existing empirical work, our new empirical studies, and our new theoretical analysis and objections, we conclude that the two “linguistic defenses” of the MQD do not have adequate empirical support or theoretical clarity to succeed. Of course, defenders of the MQD might propose new arguments or different evidence, but for now, it is difficult to see on what basis one could employ the MQD as a valid linguistic canon.

Section IV.B explains that Justice Barrett and Wurman’s attempts to establish the MQD as a linguistic canon raise serious challenges to textualism. Justice Barrett’s arguments about “common sense” and “context” are so general that they threaten to undermine textualism’s commitment to enforcing the rule of law by privileging semantic content, even when unexpected applications are at issue. In turn, Wurman’s defense of the MQD necessarily involves a broad conception of “ambiguity.” This broad framing of ambiguity has been criticized by Justices Scalia and Kavanaugh and, like Justice Barrett’s arguments, would result in courts using “ambiguity” as a pretext to avoid the semantic meanings of statutes.

Finally, Section IV.C addresses broader implications for administrative law and regulation. We have reservations about any strategy to ground judicial interpretation in “ordinary people’s” understanding of ordinary examples, especially for a topic as technical as administrative law. Nevertheless, for the sake of argument, we consider where such an “ordinary” approach should take textualist interpreters. Empirical evidence about ordinary understanding of law and language suggests a dramatically different approach than what Justice Barrett suggests for the MQD. Ordinary people understand broad delegations to include a wide range of reasonable actions consistent with the delegation. Moreover, our findings reveal something we did not expect: ordinary people are fairly skeptical that underimplementation of delegated authority is consistent with facially broad delegations. These facts do not support the MQD, but they might support other linguistic canons—many of which have more in common with Chevron than the MQD—and they may counsel some rethinking of administrative law’s indifference to agency inaction.

A.  The Major Questions Doctrine Is Not a Valid Linguistic Canon

The most immediate question motivating our studies is whether there is a valid basis for considering the MQD as a linguistic canon of statutory interpretation. As discussed above, canons are traditionally distinguished according to whether they are justified by normative or legal principles (in which case they are substantive) or whether they help determine the linguistic meaning of statutory language (in which case they are linguistic).198The conventional understanding of canons takes these options to be mutually exclusive: the MQD is either a linguistic canon, a substantive canon, or neither—but it cannot be both. See, e.g., Biden v. Nebraska, 143 S. Ct. 2355, 2376 (2023) (Barrett, J., concurring) (proposing that the MQD is a linguistic canon and noting skepticism about (all) substantive canons). Although a canon can be both substantive and linguistic,199Slocum & Tobia, supra note 37, at 70 (arguing that a canon could have both a valid linguistic and substantive basis). the MQD’s defenders have emphasized the MQD’s supposed linguistic properties because of growing concerns among textualists about both substantive canons generally and the MQD in particular. The existing empirical evidence reviewed in Part II and original empirical studies in Part III suggest this is a false start: the linguistic properties identified by the MQD’s defenders do not find support in the intuitions (or “common sense”) of ordinary people. Consequently, at least in the absence of further empirical studies, the MQD cannot, and should not, be defended as a valid linguistic canon capturing how ordinary readers understand delegating statutes.

1.  The Evidence Does Not Support a “High-Stakes” Linguistic Major Questions Doctrine
i.  High Stakes and Knowledge

Start with the theory that the MQD is justified on the grounds that, for ordinary people, the stakes of an interpretive dispute impact the text’s clarity.200Wurman, supra note 8, at 957. This argument begins by appealing to analytic philosophy and legal theory that posits a relationship between stakes and knowledge claims.201See, e.g., DeRose, supra note 24, at 914–15; Doerfler, supra note 25, at 523. The central example is the bank case: when little depends on the bank being open on Saturday, we know that it is open; but, when the stakes of the Saturday deposit are higher, we do not know that it is open.

However, a large empirical literature reports this claim to be false,202See supra Part II. and the entire philosophical literature to be “founded on a myth” about people’s reactions to these cases.203Schaffer & Knobe, supra note 34, at 675. Many studies find that high stakes have no effect at all on knowledge. Moreover, most of these studies use the exact case (the bank case) to which defenders of the linguistic MQD appeal.

The comparatively fewer studies that find an effect on knowledge report a small effect. In those studies, high stakes reduce knowledge for around 10% of participants, but not for the vast majority.204See, e.g., Rose et al., supra note 34, at 233. This Article’s new large empirical study (N = 500) finds a similarly small effect on knowledge, only a 9% difference between the low- and high-stakes cases.205See supra Section III.A.

Textualists are not always clear about how to construct their “ordinary reader,” but it is difficult to see how even this small difference (95% of people in low stakes agree there is knowledge, and 86% of people in high stakes agree there is knowledge) is sufficient to conclude that “the ordinary reader” has less knowledge in high-stakes contexts. For the vast majority of ordinary participants, high stakes have no impact on knowledge; the foundational premise in the “high-stakes” MQD seems to reflect an unordinary epistemology.

ii.  High Stakes and Clarity

The “high-stakes” argument for the linguistic MQD uses this (false) premise about knowledge as a theoretical foundation to support a technically distinct, and to date untested, claim that ordinary people follow the same epistemological pattern when making judgements about the clarity of statutory language. Assuming that people do this, the argument concludes that a high-stakes situation can render otherwise clear statutory language unclear.

The recent “high-stakes” legal interpretation literature seems to assume that statutory interpretation essentially involves a kind of knowledge claim, such that high stakes’ impact on knowledge necessarily carries over into the interpretive context.206See Wurman, supra note 8, at 957; Doerfler, supra note 25, at 523. Conceptually, we disagree with this literature’s equation of knowledge about a text’s meaning and textual clarity: language can be clear (in the relevant sense) even if laypeople do not have knowledge of its meaning. Consider books that report statements like: “The Art Nouveau movement preceded the Art Deco movement,” or “The Sarbanes-Oxley Act established the Public Company Accounting Oversight Board.” Even if a layperson does not have full knowledge about what these statements mean (that is, cannot accurately assess the statements’ truth or falsity, or explain what they mean to someone in reasonable detail), this does not imply that the statements are in any way unclear (in the sense of appearing ambiguous or indeterminate) to that layperson. Our data (the only we are aware of on this point) is not consistent with this transitive logic.207See supra Section III.B. We found that high stakes have a small effect on knowledge, but no effect at all on textual clarity. This finding supports the conclusion that ordinary judgments of knowledge do not rise and fall consistently with ordinary judgments of textual clarity.

More importantly, we find that high stakes have no effect on clarity for texts of varied levels of baseline ambiguity. High stakes did not reduce ordinary people’s sense of clarity for a fairly clear text or even for texts that were initially more ambiguous.208See supra Section III.B. This finding challenges the more critical premise in the “high-stakes” MQD defense (concerning clarity, not knowledge).

Together, these two problems count against the “high-stakes” linguistic defense of the MQD. High stakes have (at best) a small impact on knowledge and no impact on clarity. We have also noted various other theoretical issues with the “high-stakes” linguistic argument. For example, even if high stakes had the hypothesized effects, it is not clear why reduced knowledge or textual clarity puts more weight on judges’ readings of the statutes or implies anti-agency interpretation rather than putting more weight on agency interpretations of the statutes.209See Wurman, supra note 8, at 954–55 and accompanying text.

2.  The Evidence Does Not Support an “Anti-Literalist” Linguistic Major Questions Doctrine
i.  The Data Do Not Support the Stronger Claim Necessary to the “Anti-Literal” Linguistic Major Questions Doctrine

The previously discussed considerations about anti-literalism210See supra Section II.B. are insufficient to support a strong conclusion about the MQD. Just because people sometimes interpret non-literally and display context sensitivity does not imply that courts should interpret general delegating language to authorize only a small subset of agency actions that fall under the text’s meaning. One could easily agree that (1) delegations should not always be interpreted literally, while also holding that (2) anti-literalism does not lead to the MQD.

In Section III.B, we reconstructed Justice Barrett’s argument in sufficient detail to deliver the MQD conclusion. We understood her key empirical claim to be the following: absent additional context, ordinary people understand rules that grant authority to an agent to have significant contextual limitations against all “major” actions; such a rule’s communicative content is limited to authorizing only the set of most reasonable actions. Here, an action is “major” if readers understand it, absent additional context, as not among the set of most reasonable ways to follow the rule. While this is a much stronger premise than mere anti-literalism, an even stronger premise is necessary to conclude that in MQD cases, absent additional context, judges should interpret delegations to exclude all major actions.

Our empirical study tested this claim about ordinary understanding of grants of authority.211See supra Section III.B. Here, we again sought to minimize researcher degrees of freedom and chose cases that have been offered by advocates of the linguistic MQD. In Study 2, we examined Justice Barrett’s “babysitter case.” We found that most ordinary people do not take the babysitter’s actions, that is, taking children on a multi-day trip to an amusement park, to be unauthorized by the parent’s instruction to use the parent’s credit card to ensure that the kids have fun over the weekend. To the contrary, 92% of respondents took the babysitter’s actions to be consistent with the rule/instruction. When we looked at a more extreme hypothetical—bringing a zookeeper to the house to entertain the kids with a live alligator—respondents judged the babysitter’s actions less reasonable but virtually just as authorized by the parent’s instruction to “make sure the kids have fun.”

However, our respondents did not simply think anything followed the rule. Fully 85% of them thought that the babysitter’s decision to use the credit card for something other than the children’s entertainment violated the instruction, and 49% believed that it was a violation of the instruction to entertain the children too little.

Importantly, these different actions varied in their perceived reasonableness. Participants agreed that it is more reasonable to respond to the parent’s instruction by buying the kids pizza, and less reasonable to take the kids to an amusement park or hire an animal entertainer. Nevertheless, participants judged that these latter actions—while not part of the most reasonable set of responses—are fully consistent with the rule.

Ultimately, these findings suggest that even if Justice Barrett is right that context matters for interpreting grants of authority to administrative agencies, that fact alone does not justify the strong MQD. To point to “common sense” and “context” may be entirely reasonable for a judge—we will have more to say about this in the next Section—but referring to them does not rule out “major” or less reasonable agency actions, at least in the minds of ordinary readers.

3.  Limits of the Evidence, and the Bottom Line

Our two studies test the central examples that have been offered by proponents of the MQD as a linguistic canon. Both of those arguments appeal centrally to claims about how ordinary readers understand language; neither of those claims is supported by the studies conducted here. Of course, this Article’s focus is on the linguistic arguments, not the many other defenses of the MQD.212See, e.g., Randolph J. May & Andrew Magloughlin, NFIB v. OSHA: A Unified Separation of Powers Doctrine and Chevron’s No Show, 74 S.C. L. Rev. 265, 289–91 (2022) (discussing the MQD as a separation of powers principle); Nathan Richardson, Keeping Big Cases from Making Bad Law: The Resurgent “Major Questions” Doctrine, 49 Conn. L. Rev. 355, 359 (2016) (discussing the MQD as a safety valve for Chevron deference). And concerning the linguistic case, we are open to future arguments and empirical studies: some future revision of a linguistic defense of the MQD could possibly succeed. In this Section, we briefly highlight some of the limits of our studies and the doors they leave open for proponents of the MQD. We also summarize our “bottom line” about the MQD.

i.  Substantive Arguments for the Major Questions Doctrine

First, and perhaps most obviously, our studies do not foreclose a substantive basis for the MQD. That is, rather than grounding the doctrine in how text is understood, proponents of the MQD might point to constitutional or normative values that should lead judges to depart from the best reading of statutory language when agencies take major actions. The fact that none of the other Supreme Court justices joined Justice Barrett’s concurrence might suggest that at least five justices are comfortable with the idea that the MQD is solely substantive rather than partly or entirely linguistic.

So far, the Court has not clearly articulated the substantive basis of this canon: for Justice Gorsuch, the source of normative substance appears to be the nondelegation doctrine; for Chief Justice Roberts, the source is general separation of powers principles. But this lack of clarity about from where the justices are drawing the MQD’s substantive content does not mean that the MQD might eventually come, through an incremental process, to coalesce around some common narrative that would suffice to justify the MQD as a substantive canon alongside the many other substantive canons that our legal system recognizes. Given the growing textualist skepticism of substantive canons, as well as the contestable premises of the nondelegation doctrine and separation of powers, we doubt that such a defense would be uncontroversial,213See Walters, supra note 12, at 521 (discussing the limits of the argument in favor of the MQD as simply another substantive canon). but this is a topic that falls outside the scope of this Article.

ii.  Linguistic but Non-Ordinary Arguments for the Major Questions Doctrine

Second, our studies focus on linguistic defenses that tie themselves explicitly to appeals to the construct of the “ordinary reader.” While we think this focus is defensible, given the larger textualist commitment to the ordinary reader as the anchor for interpretation,214See Barrett, supra note 17, at 2194. it is also possible to defend a linguistic MQD on the grounds that it represents some kind of generalization about how Congress likely intends delegating statutes to be interpreted. The move here is to ground the MQD in what Beau Baumann calls the “descriptive case”: that is, an empirical assertion about the ordinary context of delegating statutes and the way Congress operates when it passes delegating statutes.215Beau J. Baumann, The Major Questions Doctrine Fiction 11–12 (Mar. 14, 2023) (unpublished manuscript) (on file with authors).

Indeed, the Court in West Virginia v. EPA said as much when it cited a “practical understanding of legislative intent” as a basis for the MQD;216West Virginia v. EPA, 142 S. Ct. 2587, 2609 (2022). both Wurman and Justice Barrett nod to this possibility as well.217As Wurman, supra note 8, at 955–56 puts it:

Deliberate ambiguity benefits both parties when it comes to issues that are not sufficiently important as a general matter to scuttle an entire piece of legislation. But whether to tackle climate change through CO2 regulation, or to regulate cigarettes, or to allow a public health agency to prohibit evictions, are probably not the kinds of things legislators leave to strategic ambiguity; they are the kinds of things that one side wins and the other loses.
On Wurman’s account, it makes sense as a linguistic matter to bake this contextual evidence of how Congress treats important questions into our reading of delegating statutes—that is, to interpret ambiguous statutes as not intended to delegate important matters. Justice Barrett’s concurrence in Biden v. Nebraska makes a similar move. After noting that all interpreters seek to “situate[] text in context,” Justice Barrett posits that “[b]ackground legal conventions . . . are part of the statute’s context.”218Biden v. Nebraska, 143 S. Ct. 2355, 2378 (2023) (Barrett, J., concurring). In a principal-agent relationship, “ ‘the context in which the principal and agent interact,’ including their ‘prior dealings,’ industry ‘customs and usages,’ and the ‘nature of the principal’s business or the principal’s personal situation’ ” help form the background legal conventions that govern delegation.219Id. at 2379. From there, Justice Barrett argues that we know from the context of how Congress usually delegates to agencies that Congress is “more likely to have focused upon, and answered, major questions, while leaving interstitial matters [for agencies] to answer themselves in the course of a statute’s daily administration.”220Id. at 2380.

These kinds of arguments based on the “descriptive case” run into persistent empirical problems—namely, there is ample evidence that Congress often does intend to delegate major questions to agencies through vague language, and only weak and contested evidence that Congress does not so intend.221See, e.g., Blake Emerson, “Policy” in the Administrative Procedure Act: Implications for Delegation, Deference, and Democracy, 97 Chi.-Kent L. Rev. 113, 113 (2022); Alison Gocke, Chevron’s Next Chapter: A Fig Leaf for the Nondelegation Doctrine, 55 U.C. Davis L. Rev. 955, 970–71 (2021); Heinzerling, supra note 5, at 1933–34. Both Wurman and Barrett make much of a study of congressional staffers conducted by Abbe Gluck and Lisa Schultz Bressman that found that over 60% of staffers thought that drafters typically intend for Congress, not agencies, to decide important policy questions. See Wurman, supra note 8, at 951, 954–56 (citing Abbe R. Gluck & Lisa Schultz Bressman, Statutory Interpretation from the Inside—An Empirical Study of Congressional Drafting, Delegation, and the Canons: Part I, 65 Stan. L. Rev. 901, 1003–06 (2013)); Biden v. Nebraska, 143 S. Ct. at 2380 (same). However, the Gluck and Bressman study is at best weak support for the proposition that Congress intends to reserve major questions for itself. See Walters, supra note 12, at 533–34; Ronald M. Levin, The Major Questions Doctrine: Unfounded, Unbounded, and Confounded, 112 Calif. L. Rev. (forthcoming 2024) (manuscript at 145–47), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4304404 [https://perma.cc/W3X3-5GXM]. These kinds of arguments are also in significant tension with textualism, which generally eschews evidence of legislative intent except insofar as it is “objectified” in statutory language. However, given the evidence presented in this Article, these arguments may still be more promising for proponents of the MQD than a linguistic defense premised on ordinary meaning.

On the whole, then, it does not seem like the doors that are left open by our study are ones that would be attractive to the textualist justices who have given us the MQD. But we cannot deny another possibility: that textualism itself may evolve (or dissolve?) in ways that accommodate the MQD on these other grounds. We turn to that topic in the next Section, but before doing that, we would reiterate that the ordinary-meaning defense of the MQD is, by all appearances, a total dead end. Textualists would be hard-pressed to continue to defend the MQD on this theory of the case and this record of decision.

iii.  The Bottom Line

This Section has briefly noted some limitations of the Article. We make no claims about other (non-linguistic) defenses of the MQD. And we are, of course, open to the possibility that some future argument or evidence could rehabilitate the linguistic defense of the MQD.

However, it is important to emphasize that we endorse a firm conclusion about the current state of affairs for the linguistic MQD and textualists’ use of the canon. The two extant linguistic defenses of the MQD depend on empirical claims about specific hypotheticals (for example, the bank case) that are not supported by empirical studies of ordinary Americans. Proponents of the linguistic MQD may offer new, more workable arguments, with different thought experiments, or different empirical support. But until then, there is no basis to employ it as a linguistic canon, and there is now significant evidence counting against core claims of the two publicly stated linguistic arguments.

Second, even for textualist judges with no interest in the linguistic defense, the empirical data about ordinary readers counts against the MQD’s consistency with ordinary language. Given ordinary readers’ understanding of language, there is more evidence in favor of treating the MQD as an anti-linguistic canon than a linguistic canon.222For example, it appears false that people intuitively understand delegations to be limited to the most reasonable set of actions consistent with the language’s literal meaning. With further empirical study, one could imagine refining a canon that captures ordinary judgment about delegation. Most plausible candidates are at odds with the MQD. We discuss this idea further in Section IV.C, infra. And as Justice Kagan remarked, judges who appeal to such non-principles over linguistic interpretation are not really textualists.223West Virginia v. EPA, 142 S. Ct. 2587, 2641 (2022) (Kagan, J., dissenting).

B.  Broader Implications for Modern Textualism

Justice Barrett and Wurman’s arguments have implications for textualism beyond the narrow (but hugely important) issue of whether the MQD is a linguistic canon. Textualism’s claim to distinctiveness centers on a commitment to interpretation according to a text’s linguistic meaning, thereby promoting rule of law values.224See William N. Eskridge, Jr., Brian G. Slocum & Kevin Tobia, Textualism’s Defining Moment, 123 Colum. L. Rev. 1611, 1613 (2023) (explaining how textualism claims to promote the rule of law). Textualism thus abjures judicial discretion to depart from that linguistic meaning.225Tara Leigh Grove, Comment, Which Textualism?, 134 Harv. L. Rev. 265, 269 (2020) (advocating for formalistic textualism). As Justice Scalia emphasized, judges should not exercise an unbounded “personal discretion to do justice.”226See Antonin Scalia, The Rule of Law as a Law of Rules, 56 U. Chi. L. Rev. 1175, 1176 (1989). Instead, judges should be restrained even when some results may have been unanticipated by the legislature.227See John F. Manning, The Absurdity Doctrine, 116 Harv. L. Rev. 2387, 2392–95 (2003).

Justice Barrett’s expansive view of “context,” “common sense,” and non-literal interpretation expands, but also challenges, these foundations of textualism. Justice Barrett admirably argues for a sophisticated version of textualism that rejects literalism and recognizes implied terms.228See supra Section I.C; see also Kevin Tobia, Brian G. Slocum & Victoria Nourse, Progressive Textualism, 110 Geo. L.J. 1437, 1475 (2022) (arguing that textualism should more willingly acknowledge that linguistic meaning can often include implied terms). Even so, existing interpretive canons that recognize implied terms are narrow, and thus do not undermine textualism’s commitment to linguistic meaning.229See supra Section I.C. In contrast, Justice Barrett’s “common sense” interpretive canon is unbounded, granting judges considerable discretion to claim that a wide range of actions fall outside of the text’s meaning (or “reasonable meaning”).

Wurman’s arguments also have implications that threaten to expand, if not unravel, textualism. Recall that Wurman, unlike Justice Barrett, frames the MQD as a tiebreaker canon that resolves statutory ambiguity.230See supra Section I.B. Wurman is correct that the Court has referenced “ambiguity” in MQD cases. This framing of the MQD, however, requires a broad view of ambiguity that would make its determination even more discretionary, and likely more pretextual.

1.  Justice Barrett’s Theory of Non-Literal Interpretation

Justice Barrett’s general appeals to context and non-literal interpretation are consistent with modern textualist scholarship and thinking. Justice Kavanaugh has also repeatedly emphasized the distinction between literal and ordinary meaning and has insisted that courts should avoid overly literalist meanings.231See Bostock v. Clayton County, 140 S. Ct. 1731, 1825 (2020) (Kavanaugh, J., dissenting) (“[C]ourts must follow ordinary meaning, not literal meaning. And courts must adhere to the ordinary meaning of phrases, not just the meaning of the words in a phrase.”). Similarly, John Manning argues that “the literal or dictionary definitions of words will often fail to account for settled nuances or background conventions that qualify the literal meaning of language and, in particular, of legal language.”232Manning, supra note 227, at 2393.

Textualism, though, purports to privilege semantic meaning, thereby giving a relatively limited role to non-literal meanings informed by context and pragmatics. Thus, while Manning endorses some non-literal interpretation, his “background conventions” are narrow ones relevant to the “relevant linguistic community” subject to the law, such as common law criminal defenses.233Id. at 2466–67. Besides these limited examples, according to Manning, judges “have a duty to enforce clearly worded statutes as written, even if there is reason to believe that the text may not perfectly capture the background aims or purposes that inspired their enactment.”234See John F. Manning, Second-Generation Textualism, 98 Calif. L. Rev. 1287, 1290 (2010). Doing so ensures “Congress’s ability to use semantic meaning to express and record its agreed-upon outcomes.”235Id.

A coherent textualism would thus recognize a narrow role for implied terms. Crucially, an implied term must be one that would be obvious to the discourse participants, rather than one imposed by the interpreter for other reasons. An implied term must therefore reflect a presupposition about meaning that is warranted in the circumstances.236See Emiel Krahmer, Presupposition and Anaphora 3 (1998); Alan Cruse, A Glossary of Semantics and Pragmatics 139 (2006) (explaining that presuppositions are a ubiquitous aspect of language).

Statutes are often drafted at a high level of generality, and Justice Barrett is correct that readers of those rules understand that sometimes the rules expressed are not meant to be taken literally in all respects. Crucially though, the relevant existing interpretive canons are implicated in narrow circumstances and provide relatively specific rules for limiting literal meaning.237See Tobia et al., From the Outside, supra note 18, at 281–87 (providing examples of textual canons that narrow literal meaning); Slocum & Tobia, supra note 37, at 75 (providing examples of substantive canons that are also linguistic and which serve to narrow literal meaning). Furthermore, empirical evidence supports these narrow rules as linguistic and thus consistent with how ordinary people interpret legal texts.238See Slocum & Tobia, supra note 37, at 75.

Justice Barrett’s view of implied terms as governed by “common sense” and “context” is similar to Richard Fallon’s approach. Fallon argues that “[o]rdinary principles of conversational interpretation call for us to ascribe a reasonable meaning to prescriptions and other utterances unless something about the context indicates otherwise.”239Richard H. Fallon, Jr., The Meaning of Legal “Meaning” and Its Implications for Theories of Legal Interpretation, 82 U. Chi. L. Rev. 1235, 1260–61 (2015). Fallon reasons that “[i]n ordinary conversation, we do not waste time and breath offering elaborations and qualifications of our utterances that ought to be obvious to any reasonable person.”240Id. at 1261. Instead, a “reasonable person” understands that “[t]he moral reasonableness of a particular ascribed meaning possesses a distinctive importance.”241Id. at 1261–62. Both Fallon and Justice Barrett draw on principles of conversational communication and context, and while Fallon references “reasonable meaning” and Justice Barrett “common sense,” the two are essentially the same idea. In fact, Justice Barrett uses the word “reasonable” in relation to interpretation eleven times in her Biden v. Nebraska opinion (e.g., “reasonable understanding,” “reasonable view,” “reasonable interpreter”).242Biden v. Nebraska, 143 S. Ct. 2355, 2376–84 (2023) (Barrett, J., concurring). Furthermore, her appeal to “common sense” and “reasonable” interpretations has, like Fallon’s view, room for moral and normative beliefs to motivate non-literal interpretations.

The similarities between the interpretive approaches of Justice Barrett and Fallon should be surprising and troubling to textualists. Fallon’s interpretive principle is in furtherance of his decidedly anti-textualist view of interpretation.243See generally Richard H. Fallon, Jr., The Statutory Interpretation Muddle, 114 Nw. U. L. Rev. 269 (2019) (arguing against the idea that statutes have determinate linguistic meanings). Justice Barrett’s principle of “common sense,” guided by “context,” is supposedly in furtherance of textualism, but it raises questions that do not have easy textualist answers. Can the principle always defeat the literal meaning of a statute? How can “common sense” even be defined? Even if “common sense” could be defined, do judges share the same “common sense” as ordinary people, or do judges speak with what Eskridge and Nourse refer to as an “upper-class accent?”244See William N. Eskridge, Jr. & Victoria F. Nourse, Textual Gerrymandering: The Eclipse of Republican Government in an Era of Statutory Populism, 96 N.Y.U. L. Rev. 1718, 1811 (2021). The ability for judges to appeal, with little restraint, to “common sense” and “context,” calls to mind Scalia’s fears about non-textualist judging: “personal discretion to do justice” as the judges saw fit.245Scalia, supra note 226, at 1176.

2.  The Anti-Textualist Broad View of Ambiguity

An additional threat to textualism is posed by a broad view of “ambiguity.” Recall that Wurman argues that the MQD is a linguistic canon that resolves statutory ambiguity.246See supra Section I.B. In support of this claim, Wurman quotes from MQD decisions where the Court argues that the relevant statutes are “ambiguous.”247See Wurman, supra note 8, at 915. This defense of the MQD is unsurprising. Textualism is much more permissive about available arguments and interpretive sources when a provision has been deemed “ambiguous.”

There are two key drawbacks in viewing the MQD as serving a tiebreaking role in resolving ambiguity. First, doing so understates the MQD’s role in the Court’s precedents. The MQD has not merely resolved “ties” between meanings; it has caused the Court to choose meanings it would not otherwise have selected. Second, Wurman’s view requires a definition of ambiguity that should be especially troubling to textualists, and the significance of the issue extends beyond the MQD.

Wurman’s argument raises an essential question: On what basis can a provision be deemed “ambiguous”? Wurman suggests that a provision can be “ambiguous” even when a court can determine the provision’s “best reading.”248Id. Thus, crucially, the question of ambiguity does not require that a provision be indeterminate. In other words, the semantic meaning of the provision’s terms could be clear (even if broad) but still “ambiguous,” based on non-textual considerations like the novelty and importance of an agency’s actions.

Use of the “ambiguity” label often obscures rather than clarifies linguistic issues. Specifically, it glosses over the distinctive linguistic features of the prototypical statute involved in MQD cases, which is a statute with broad but semantically clear terms. These features—broad but semantically clear—should represent for textualists a prima facie case against the MQD. After all, textualists assert that courts should focus on the semantic meaning of statutes.

Outside of MQD cases, some textualists have recognized the potential dangers associated with a judicial focus on “ambiguity.” Most significantly, Justice Kavanaugh has criticized “ambiguity” as an interpretive doctrine because its identification is standardless and subjective.249See Brett M. Kavanaugh, Fixing Statutory Interpretation, 129 Harv. L. Rev. 2118, 2121 (2016) (reviewing Robert A. Katzmann, Judging Statutes (2014)). Its discretionary identification and legitimizing power, however, make “ambiguity” an especially attractive interpretive tool for judges. “Ambiguity” is extremely useful because it gives a court cover to interpret a statute narrowly or broadly on the basis of normative concerns. For instance, an explicit announcement of ambiguity allowed the Court in King v. Burwell to “avoid the type of calamitous result that Congress plainly meant to avoid” and gave it justification for “interpret[ing] the Act in a way that” improves health insurance markets and does not destroy them.250King v. Burwell, 576 U.S. 473, 498 (2015).

“Ambiguity’s” legitimizing power explains why the Court in MQD (and other) cases is motivated to label a provision as “ambiguous” without much consideration about whether it is applying a coherent definition of ambiguity. It may be activist to interpret a clear statute narrowly because doing so would be in tension with the provision’s linguistic meaning. In contrast, resolving statutory “ambiguity” is necessary to decide the interpretive dispute, and choosing the narrower interpretation does not conflict with the provision’s linguistic meaning. Thus, if a provision is problematically broad, labeling it as “ambiguous” does not require the Court to explicitly reject its literal meaning.

If a provision can be “ambiguous” even when a court can nevertheless determine its “best reading,” “ambiguity” would mean something like “any uncertainty about the meaning of a provision.” But this sort of definition would make ambiguity ubiquitous and is inconsistent with how it is used in Chevron and other tiebreaker canons like the rule of lenity.251See Eskridge, Slocum & Tobia, supra note 224, at 1656 (discussing how the Court’s textualists determine ambiguity). If instead “ambiguity” means that a provision must actually be indeterminate, there is no “best reading” of a provision, but merely possible competing meanings.

The question of ambiguity thus hinges on whether “ambiguity” is synonymous with “indeterminacy.” Even if the terms are synonymous, framing the MQD in terms of “ambiguity” should be unappealing to textualists. The MQD would still be a matter of judgment that depends on how one weighs semantic and pragmatic evidence. In other words, a combination of meaning and context makes a provision clear or, conversely, ambiguous. Univocal semantics and univocal pragmatics may uncontroversially result in a clear provision, and multivocal semantics and multivocal pragmatics in an ambiguous provision, but other combinations are contestable and subject to normative resolution via highly discretionary judgments.

The choice is thus between a narrow definition of “ambiguity” that would require the semantic meaning of the statutory text be indeterminate in some way, and a broad definition that would allow even semantically clear language to be deemed “ambiguous” based on non-language concerns like statutory purpose. Justice Scalia argued that the broad view of ambiguity is “judge-empowering” and mocked the idea that “[w]hatever has improbably broad, deeply serious, and apparently unnecessary consequences . . . is ambiguous!”252Bond v. United States, 572 U.S. 844, 870 (2014) (Scalia, J., concurring in the judgement). A broad definition of ambiguity would allow the label to be used at any time by emphasizing any number of pragmatic considerations, such as the problematically broad semantic meaning of terms or the “novelty” of an agency’s interpretation. If instead, as Justice Scalia argues, pragmatic evidence can only clarify semantically indeterminate text, ambiguity would therefore require indeterminate semantic meaning and be a narrower, less discretionary doctrine.253See id.

Textualists in MQD cases should be honest about their use of “ambiguity.” If they use the term broadly, they should explain why Justice Scalia’s critique of the broad definition is mistaken. If they instead agree with Justice Scalia, the MQD cases involving clear (but broad) semantic meaning should thus be viewed by textualists as similar to situations not involving ambiguity. In such cases, if the Court wishes to narrow the literal meaning of the language, it should state so explicitly, giving reasons for why such narrowing is consistent with the judicial function.

C.  Broader Implications for Administrative Law

This Article has taken textualists’ defenses of the MQD at face value. But some harbor a more realist or critical take on the MQD. Perhaps the MQD is animated by neither constitutional values nor language, but rather by the aim of limiting the administrative state’s power. And perhaps leaving questions about the MQD’s legitimacy unresolved allows strategic ambiguity, which is better for this purpose.254See Sohoni, supra note 6, at 266; see also Patrick J. Sobkowski, Of Major Questions and Nondelegation, Yale J. on Reg.: Notice & Comment (July 3, 2023), https://www.yalejreg.com/nc/of-major-questions-and-nondelegation-by-patrick-j-sobkowski [https://perma.cc/23GL-D2G6] (noting that the MQD is currently marked by “strategic ambiguity” that “allows the Justices to strike down or uphold policies without being criticized by other actors for judicial activism and aggrandizement”). Some go even further to argue that the justices are engaged in a form of constitutional hardball, seeking to aggrandize themselves vis-à-vis the other branches of government.255See, e.g., Josh Chafetz, The New Judicial Power Grab, 67 St. Louis U. L.J. 635, 635 (2023); Allen C. Sumrall & Beau J. Baumann, Clarifying Judicial Aggrandizement, 172 U. Pa. L. Rev. Online 24, 24 (2023); Mark A. Lemley, The Imperial Supreme Court, 136 Harv. L. Rev. F. 97, 97 (2022). It is certainly difficult to overlook the hostility that many of the justices express toward modern administrative government and the legislative acts that authorized it.256See generally Beau J. Baumann, Americana Administrative Law, 111 Geo. L.J. 465 (2023) (discussing examples including National Federation of Independent Businesses v. Department of Labor, Occupational Safety and Health Administration, 142 S. Ct. 661, 669 (Gorsuch, J., concurring); Gundy v. United States, 139 S. Ct. 2116, 2134–35 (2019) (Gorsuch, J., dissenting); and City of Arlington v. FCC, 569 U.S. 290, 327 (2013) (Roberts, C.J., dissenting)).

Yet, turning our attention away from the Supreme Court and toward the broader legal community, our findings about how ordinary people understand delegations of authority have significant implications for administrative law well beyond the MQD. While we acknowledge that there are good reasons to be skeptical about outsourcing questions of administrative law to laypeople, insofar as textualist principles animate the statutory interpretation questions at the heart of administrative law, it is worth asking where ordinary people’s intuitions lead.257Indeed, an emerging literature does just this, often using survey experiments to investigate questions important to administrative law and the administrative state. See generally Brian D. Feinstein, Legitimizing Agencies, 91 U. Chi. L. Rev. 919 (2024); Edward Stiglitz, The Reasoning State (2022). Below, we highlight a couple takeaways from this exercise. An irony of textualist’s turn to “ordinary people” to support the MQD may be that it actually supports a significantly cabined judicial role in controlling delegation of authority to the administrative state. Far from endorsing a kind of “libertarian administrative law” that treats delegations of authority to administrative agencies with suspicion and seeks almost perfunctorily to narrow them,258See Sunstein & Vermeule, supra note 197, at 410. ordinary people appear to take general ordinary delegations to license a range of reasonable actions.

To be sure, we considered ordinary judgments of an ordinary, private delegation (that is, the babysitter), but critics of the administrative state have made that ordinary context relevant by insisting that general principles of private agency and/or ordinary delegations law should inform public law delegation.259Biden v. Nebraska, 143 S. Ct. 2355, 2379–80 (2023) (Barrett, J., concurring); Phillip Hamburger, Is Administrative Law Unlawful? 386 (2014); Gary Lawson & Guy Seidman, “A Great Power of Attorney”: Understanding the Fiduciary Constitution 104 (2017). We are also skeptical that there is an easy way to study the “ordinary person’s” view of specific cases. As prior research has shown, interpreters’ values affect their interpretation.260Ward Farnsworth, Dustin F. Guzior & Anup Malani, Ambiguity About Ambiguity: An Empirical Inquiry into Legal Interpretation, 2 J. Legal Analysis 257, 259 (2010). Asking ordinary people whether the EPA has authority to issue broad climate change regulations under the Clean Air Act is likely to tell us more about people’s values and politics than their understanding of language. Thus, the implications we spell out depend on the validity of this ordinary analogy—the one made by the linguistic MQD’s defenders (recall the “high stakes” appeal to the ordinary bank case and the “common sense” appeal to the ordinary babysitter case).

To start, our study of the babysitter hypothetical revealed a surprising result about what ordinary people would think of the amusement park hypothetical. Taking the children to the amusement park might not be the most reasonable response to the instruction to “use this credit card to make sure the kids have fun this weekend,” but it certainly does not violate it (after all, an amusement park is “fun”). The study also revealed that the vast majority of ordinary people believe that the parent’s instruction extends to the even more unusual action of bringing a live alligator to the house. This surprising finding suggests that people do not limit delegations to only the most reasonable actions or the ones most consistent with the rule’s purpose.

Ordinary readers approached the limits of broad delegations through a textual and purposive lens. Compared with the amusement park, alligator, and movie scenarios, respondents were far more likely to say that the babysitter violated the instruction when the babysitter failed to achieve the purpose of the instruction (as in the case of not using the credit card and potentially shortchanging the children’s fun) and when the babysitter actively undermined it (by using the credit card for the babysitter’s own enjoyment). This finding is difficult to understand unless ordinary readers understand delegations in large part as remedial—that is, as seeking to empower the agent to solve a problem or achieve some goal—rather than exclusively delimiting—that is, as setting out the scope of the agent’s power.261This explanation is largely consistent with Brian Feinstein’s discovery that ordinary people are prompted to increase their trust in government when they believe it is being undertaken by an agent with expertise to fulfill social functions. See Feinstein, supra note 257, at 919. In both Feinstein’s studies and ours, delegations are understood by ordinary people to be about problem solving.

The modern textualist commitment to ordinary people’s understanding as a basis for interpretation262See, e.g., Barrett, supra note 17, at 2194. and linguistic canons263See, e.g., Wurman, supra note 8, at 909. opens the door to uncovering a linguistic basis for other canons, including new canons.264See Tobia et al., From the Outside, supra note 18, at 288–90. As a hypothetical, imagine if a textualist were to carefully consider evidence about ordinary people’s understanding of delegating language (e.g., in the babysitter case) and attempt to “canonize” those intuitions into administrative law doctrine. The result would probably be a fundamental recalibration of the field—but not in the way the MQD imagines. Were one to follow the evidence, it seems to instead support canonizing a sort of “counter-MQD” that presumes that general delegations should be interpreted broadly (or at least not as restrictively as Justice Barrett’s argument claims), significantly curtailing judicial power to limit Congress’s attempts to empower administrative agencies.

In addition, and relatedly, our findings are in some tension with administrative law’s traditional approach to questions of underimplementation of statutory delegations. A variety of administrative law doctrines insulate agency discretion to decline to enforce the law: for instance, Heckler v. Chaney provides that agency nonenforcement decisions are almost never reviewable by courts,265Heckler v. Chaney, 470 U.S. 821, 821 (1985). and Norton v. Southern Utah Wilderness Alliance makes it impossible for challengers to force agency action unless they can point to a discrete duty (rather than a more general failure to pursue broad policy goals of a statute).266Norton v. S. Utah Wilderness All., 542 U.S. 55, 55 (2004). These doctrines insulate agency underuse of delegated regulatory authority from judicial scrutiny. Yet our findings suggest that ordinary readers may be more troubled by delegated authority’s underuse than uses that fit with the language but exceed an observer’s sense of reasonableness.267Both using the credit card for only the babysitter’s needs (“misuse”) and bringing an alligator to the house for entertainment (“extreme”) were judged as “unreasonable,” while failing to use the card and entertaining the children with card games (“minor”) was judged as “reasonable.” But rule violation judgments did not rise and fall with these evaluations of reasonableness. The extreme action was more consistent with the rule than the minor action, and both were more consistent than the misuse action. On the flip side, when agencies do take action pursuant to their delegations, judges often artificially narrow those delegations.268Cass R. Sunstein, Nondelegation Canons, 67 U. Chi. L. Rev. 315, 315–16 (2000). Canons that might theoretically push in the opposite direction—toward liberally construing “remedial” statutes, for instance—have fallen into disrepute.269Dir., Off. of Workers’ Comp. Programs, Dep’t of Lab. v. Newport News Shipbuilding & Dry Dock Co., 514 U.S. 122, 135 (1995) (noting that the remedial canon is the “last redoubt of losing causes”). This basic asymmetry in the treatment of delegations to agencies—deep skepticism of exercises of delegated authority coupled with indifference toward failures to exercise delegated authority at all270See Daniel E. Walters, Symmetry’s Mandate: Constraining the Politicization of American Administrative Law, 119 Mich. L. Rev. 455, 455–56 (2020).—may be exactly backwards if ordinary people’s intuitions are to be the guide.

Again, we do not endorse any particular changes to administrative law here. There are many good reasons, such as the institutional constraints under which agencies operate, to disfavor outsourcing administrative law into ordinary people’s linguistic or legal intuitions (whatever those may be).271Eric Biber, The Importance of Resource Allocation in Administrative Law, 60 Admin. L. Rev. 1, 1–2 (2008). There are also many countervailing concerns, such as fair notice and due process, that may justify curtailing expansive ordinary readings of delegating statutes.272See, e.g., Jennifer Lee Koh, Crimmigration and the Void for Vagueness Doctrine, 2016 Wis. L. Rev. 1127, 1153–59 (2016). But we also believe that for those inclined to remake administrative law through the eyes of the ordinary reader, it is worth grappling with facts rather than judicial hypotheticals about those ordinary readers. People are far more comfortable with broader interpretation of general-language delegations than many textualists have assumed, and they appear to be disproportionately uncomfortable with violations through underuse of delegated authority.

CONCLUSION

The MQD is the most influential interpretive development at the modern Supreme Court.273See supra notes 2–4 and accompanying text. Yet it lacks a compelling theoretical basis and a satisfactory explanation of its consistency with textualism, the interpretive theory held by the MQD’s advocates. The new “linguistic MQD” purports to solve both problems: because the MQD reflects ordinary understanding of language, it is a valid linguistic canon and thus consistent with textualism.

This Article has taken this linguistic defense on its own terms and studied the two central ordinary examples offered by its advocates. We find that ordinary people do not understand language as textualists have assumed. High stakes do not undermine knowledge or impact textual clarity, and people do not understand general delegations to be limited to only the most reasonable set of actions. These results challenge the essential empirical claims at the heart of the arguments for the linguistic MQD. While scholarly debate should continue, judges must take stock of the evidence and decide whether to employ the canon—and whether to do so in the name of linguistics and ordinary people. In our view, there is insufficient empirical support and theoretical clarity to cast the MQD as a valid linguistic canon. Arguably, the linguistic defense is the only viable theory for textualists to consistently employ the MQD. Thus, unless they offer a successful alternative, the results here support the broader conclusion that consistent textualists should not employ the MQD.

97 S. Cal. L. Rev. 1153

Download

* Associate Professor of Law, Georgetown University Law Center.

† Associate Professor of Law, Texas A&M University School of Law.

‡ Stearns Weaver Miller Professor, Florida State University College of Law. For helpful comments and/or discussion, we thank Cary Coglianese, Anuj Desai, Ryan Doerfler, Rebecca Kysar, Edouard Machery, Ángel Pinillos, Larry Solum, Ilya Somin, Ilan Wurman, and audiences at the American Association of Law Schools Annual Conference, and at Cornell, Georgetown, NYU, and USC law schools. Thanks to the Southern California Law Review for excellent editorial assistance. We also thank Kirsten Worden and Michael Cooper for their research assistance.

Poking a Sleeping Bear: Cultural Landscapes in the 1906 Antiquities Act

The American Antiquities Act of 1906 permits a president to designate “objects of historic and scientific interest”—and the federal lands associated with them—as national monuments. The Act is foundational cultural heritage preservation legislation and has been used by presidents for over a century to protect everything from burial grounds to marine landscapes. Through an overview of the statute and its history, this Note argues that a proper reading of the Antiquities Act includes cultural landscapes, or networks of natural and constructed places that people interact with and add meaning to. Indigenous communities, archaeologists, and heritage professionals have long recognized cultural landscapes. I survey recent monument declarations that explicitly protect Indigenous cultural landscapes and provide for Indigenous co-management and argue that the Antiquities Act is a helpful tool for Tribes to protect culturally significant areas. One such cultural landscape is Bears Ears National Monument, located in Southeastern Utah. Bears Ears—and the Antiquities Act—is currently under fire from opponents that wish to limit the scope of the President’s declaration authority. Given over a century of use of and challenge to the Act, I argue that Bears Ears is well within the purview of the Act and authority it grants.

INTRODUCTION

In southeastern Utah, a bear’s stone ears rise from the horizon. The twin mesas crown a pristine piece of land that suggests an “earlier eon.”1Proclamation No. 9558, 3 C.F.R. § 9558 (2017). Red sandstone spires soar to cloudless blue skies. The landscape is absent any man-made sound; at night, our galaxy illuminates near-absolute darkness. Canyons cut by the San Juan River trace the terrain. The area features distinct high desert and lowland microclimates, innumerable endemic species of flora and fauna, and some of the most robust geological and paleontological resources in North America, including one of the best “continuous rock records of the Triassic-Jurassic transition” in the world.2Id. This is Bears Ears National Monument: an extraordinary natural landscape in every way imaginable.

Bears Ears occupies an important place in the collective consciousness of the region’s Indigenous communities. It is a landscape covered with pottery shards, rock art, burial sites, and stone dwellings built into the cliffsides, left by ancestral Puebloans, Fremont, and other “precursor societies” to—and ancestors of—the region’s Indigenous peoples and modern Tribal communities.3Id.

However, Bears Ears holds cultural value beyond these natural features or material objects: the landscape itself is directly connected to Native history and heritage. “It is not easy to explain the meaning of Bears Ears to non-Native people,” explains Woody Lee, Executive Director of Utah Diné Bikéyah, an all-Native American nonprofit that developed a proposal to designate Bears Ears a national monument.4Gavin Noyes, Utah Diné Bikéyah: Celebrating the Ten Year Healing Journey of Bears Ears 1 (2022), https://utahdinebikeyah.org/wp-content/uploads/2022/01/UDB-Celebrating-10-Years-EC.pdf [https://perma.cc/S927-QE2L]. To Indigenous communities, “Bears Ears is a ‘who,’ not a ‘what.’ ”5Id. In the languages of all people who have ancestral connections to the land, she is known as “Bears Ears”: Hoon’Naqvut, Shash Jáa, Kwiyagatu Nukavachi, and Ansh An Lashokdiwe.63 C.F.R. § 9558 (2017). Bears Ears is a place where “culture, language, and religion were born.”7Noyes, supra note 4, at 1. The area includes the place the Diné (Navajo) emerged from the earth, and where “epic battles and the fate of humankind [were] determined”8Id.—lofty spires that touch the sky are Diné warriors turned to stone.93 C.F.R. § 9558. Cliffs scored by the San Juan River tell stories of creation and healing.10Id. Cultural and spiritual traditions of many distinct communities continue at Bears Ears, as they have for millennia: Native peoples continue “hunting, fishing, gathering, and wood cutting;” collect “medicinal and ceremonial plants, edible herbs, and materials for crafting items like baskets and footwear;”11Id. and conduct ceremony.12Noyes, supra note 4. Bears Ears is a landscape of living cultural heritage that continues to shape Native history, identity, and expression, and tells the stories of ancestors. She is, therefore, not only a natural landscape, but also a cultural landscape.

In the early 2000s, mining and drilling activities were on the horizon in Southeastern Utah.13See Bears Ears Inter-Tribal Coalition, Proposal to President Barack Obama for the Creation of Bears Ears National Monument 34–35 (2015) [hereinafter Bears Ears Proposal], https://www.bearsearscoalition.org/wp-content/uploads/2015/10/Bears-Ears-Inter-Tribal-Coalition-Proposal-10-15-15.pdf [https://perma.cc/4G8H-A4YU]. After the approval of a state public lands bill, Washington County disposed of previously protected lands for real estate development.14Id.; Suzanne Struglinski & Nancy Perkins, Bennett Pushes Southern Utah Land Bill, Deseret News (July 12, 2006, 9:25 AM), https://www.deseret.com/2006/7/12/19963273/bennett-pushes-southern-utah-land-bill [https://perma.cc/VYB4-BM63]. Then, in 2009, a bombshell hit San Juan County. Sixteen residents of the small town of Blanding were arrested for looting Native objects from nearby public lands, including Bears Ears, and selling them. A raid of their homes that followed—the result of a two-year sting operation—uncovered more than 40,000 archaeological objects.15Howard Berkes, Artifacts Sting Stuns Utah Town, Nat’l Pub. Radio (July 1, 2009, 12:34 AM), https://www.npr.org/2009/07/01/106091937/artifacts-sting-stuns-utah-town [https://perma.cc/P7XQ-46ME]; Kyle Swenson, Pilfered Artifacts, Three Suicides and the Struggle Over Federal Land in Utah, Wash. Post (Dec. 5, 2017, 6:53 AM), https://www.washingtonpost.com/news/morning-mix/wp/2017/12/05/pilfered-artifacts-three-suicides-and-the-struggle-over-federal-land-in-utah [https://perma.cc/6LGU-K4MJ]; Noyes, supra note 4. In response, a coalition of sovereign Tribal nations, including the Navajo Nation, Hopi Tribe, Uintah and Ouray Ute Tribe, Ute Mountain Ute Tribe, and Pueblo of Zuni—the Bears Ears Inter-Tribal Coalition—joined together, petitioning President Obama and Congress to designate the landscape a national monument.16Bears Ears Proposal, supra note 13. The Inter-Tribal Coalition’s petition marked the first time that any Tribe, let alone five Tribes in a political alliance, had asked the federal government to designate a national monument.17Press Release, Bears Ears Inter-Tribal Coalition, Five Tribes Formally Petition President Obama and Congress to Create Tribally Co-Managed Bears Ears National Monument in Utah (Oct. 15, 2015), bearsearscoalition.org/five-tribes-formally-petition-president-obama-and-congress-to-create-tribally-co-managed-bears-ears-national-monument-in-utah [https://perma.cc/W653-5QAQ].

The concerns of environmentalists, to this day, regularly take the main stage when advocating for public lands protections, designations, and management strategies.18See Jedediah Purdy, Environmentalism’s Racist History, The New Yorker (Aug. 13, 2015), newyorker.com/news/news-desk/environmentalisms-racist-history [https://perma.cc/QNX2-4EGJ]; Association for Environmental Studies and Sciences & Antioch University, Race and the Environmental Movement: History and Legacies, YouTube (June 4, 2020), https://youtu.be/L8PIQVbJBE8 [https://perma.cc/42J8-CKB2]. Two mistaken assumptions should be avoided: (1) that individual, sovereign Tribes are a monolith with the same interests when Tribal political interests and cultures differ substantially and (2) that Tribal and “environmental” interests are one and the same, when they may align, but often conflict. See Lauren Sloss, Clean Energy, Cherished Waters and a Sacred California Rock Caught in the Middle, N.Y. Times (Oct. 24, 2023), https://www.nytimes.com/2023/10/24/travel/chumash-marine-sanctuary-morro-bay-california.html [https://perma.cc/897P-DEKS] (providing an example of a Tribal-led collaborative land management model thrown off-course by a renewable energy project); Morgan Conley, Tribe Says FERC Ignoring Dam Project’s Cultural Site Impacts, Law360 (Aug. 13, 2021, 8:02 PM), https://www.law360.com/articles/1412713 [https://perma.cc/55V8-YTPE] (demonstrating how projects with environmental advantages may ignore cultural site impacts). Environmentalism, however, is movement historically fostered by white men who championed “untouched,” exclusionary wilderness and frequently removed Native communities from their ancestral lands.19See Purdy, supra note 18; see also Ward v. Race Horse, 163 U.S. 504, 510 (1896) (stating that legislation establishing Yellowstone National Park was the legal foundation for efforts to keep Native peoples off public lands), abrogated by Herrera v. Wyoming, 139 S. Ct. 1686 (2019). By contrast, Tribal concerns and the interests of Native peoples led the way for the protection of Bears Ears.

The efforts of the Bears Ears Inter-Tribal Coalition were successful. In December 2016, President Obama designated Bears Ears National Monument, exercising executive authority granted under the American Antiquities Act of 1906.20Proclamation No. 9558, 3 C.F.R. § 9558 (2017). The Antiquities Act is foundational cultural heritage legislation in the United States, used 291 times by Presidents to establish, expand, and redesignate over 100 active national monuments that contain exemplary “objects of historic or scientific interest.”2154 U.S.C. § 320301. This figure does not reflect uses of the Act to diminish or reduce monuments or uses of the Act by Congress; in total, Antiquities Act authority has been utilized 291 times. This figure was last verified by author as of January 13, 2024. National Monument Facts and Figures, Nat’l Park Serv. (Feb. 8, 2024) [hereinafter Monument Data], https://www.nps.gov/subjects/archeology/national-monument-facts-and-figures.htm [https://perma.cc/8JXL-KJ4K]. Unlike other legislation in the United States’ cultural heritage preservation regime, the Antiquities Act protects not only antiquities themselves, but also reserves the lands around them, including historic sites, landmarks, and landscapes. The Bears Ears National Monument designation restricted opportunities for commercial activity on and under the parcel, including future mining and drilling, and established protections for its heritage sites and objects.22John C. Ruple, Robert B. Keiter & Andrew Ognibene, National Monuments and National Conservation Areas: A Comparison in Light of the Bears Ears Proposal 12–13 (2016), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2836986 [https://perma.cc/EKJ4-WBFS]; Brent J. Hartman, Extending the Scope of the Antiquities Act, 32 Pub. Land & Res. L. Rev. 153, 161 (2011); see Bears Ears Proposal, supra note 13, at 34–35. Among others, these protections provide for Tribal consultation and co-management; limit access to areas otherwise open to off-road vehicles; direct and designate foot traffic; and include funding for conservation efforts, enforcement, and visitor education.23Ruple et al., supra note 22, at 3, 12–13.

Protection of Bears Ears, however, did not last. One year later, she returned to the national rostrum. Protests erupted at the Utah State Capitol against then-President Trump’s proposal to rescind the prior administration’s reservation of both Bears Ears National Monument and her sister monument, Grand Staircase-Escalante National Monument,24Benjamin Wood, Monument Supporters Rally Against Trump’s Plans to Shrink Bear Ears, Grand Staircase-Escalante, Salt Lake Trib. (Dec. 5, 2017, 4:16 PM), https://www.sltrib.com/news/2017/12/02/monument-supporters-rally-against-trumps-plans-to-shrink-bear-ears-grand-staircase-escalante [https://perma.cc/5VDU-QHLC]. designated by President Clinton in 1996.25Proclamation No. 6920, 3 C.F.R § 6920 (1997). In December 2017, President Trump slashed the acreage of Bears Ears by eighty-five percent and Grand Staircase-Escalante by fifty percent.26Julie Turkewitz, Trump Slashes Size of Bears Ears and Grand Staircase Monuments, N.Y. Times (Dec. 4, 2017), https://www.nytimes.com/2017/12/04/us/trump-bears-ears.html [https://perma.cc/Z2EM-AFKA]; Proclamation No. 9681, 3 C.F.R § 9681 (2018). President Trump’s cuts to the monument eliminated key cultural areas within the Bears Ears and broader Cedar Mesa region, reducing the area protected to separate, distinct, and “non-contiguous parcels.”273 C.F.R § 9681; see also Archaeological Organizations’ Brief as Amici Curiae in Support of Plaintiffs at 15, Hopi Tribe v. Trump, No. 17-cv-02590, 2019 U.S. Dist. LEXIS 106244 (D.D.C. Mar. 20, 2019) [hereinafter Archaeological Amicus Brief].

The Bears Ears National Monument size reduction triggered a chain of three lawsuits against the Trump Administration—led by the five Coalition Tribes, joined by environmental groups including the Natural Resources Defense Council, and supported as amici by professional archaeological organizations.28See Archaeological Amicus Brief, supra note 27. These federal lawsuits, later consolidated, challenged President Trump’s use of the Antiquities Act to reduce national monument parcel reservations and revoke national monument status.29Two lawsuits specifically challenging the modification of Grand Staircase-Escalante were also filed. Courtney Tanner, Here’s a Breakdown of the 5 Lawsuits Filed Against Trump That Challenge His Cuts to 2 Utah National Monuments, Salt Lake Trib. (Dec. 10, 2017, 6:12 PM), https://www.sltrib.com/news/politics/2017/12/11/heres-a-breakdown-of-the-5-lawsuits-filed-against-trump-challenging-his-cuts-to-two-utah-national-monuments [https://perma.cc/Y5A7-CKLK]; Complaint for Injunctive & Declaratory Relief at 1, Hopi Tribe v. Trump, No. 17-cv-02590, 2019 U.S. Dist. LEXIS 106244 (D.D.C. Mar. 20, 2019); Complaint for Injunctive & Declaratory Relief at 3–4, Nat. Res. Def. Council v. Trump, No. 17-cv-02606 (D.D.C. Dec. 7, 2017).

On October 8, 2021, President Biden reestablished the Obama-era boundaries of Bears Ears and Grand Staircase-Escalante National Monuments by presidential proclamation, effectively ending the pending litigation.30Proclamation No. 10285, 3 C.F.R. § 10285 (2022). After a motion to stay in the lead case, the consolidated cases were administratively closed. Order, Hopi v. Trump, No 17-cv-2590 (D.D.C. Mar. 8, 2021). This, however, was not the ultimate salvation of Bears Ears. In August 2022, the state of Utah, joined by Garfield and Kane counties, filed a lawsuit in Utah District Court, once again attempting to diminish protection of Bears Ears.31Complaint for Declaratory & Injunctive Relief at 51, Garfield Cnty v. Biden, No. 22-cv-00059, 2023 U.S. Dist. LEXIS 142044 (D. Utah Aug. 11, 2023) [hereinafter Utah Complaint]. Garfield and Kane Counties are those adjacent to (to the west of) Bears Ears National Monument. Notably, the state of Utah was not joined by San Juan County, the county that encompasses Bears Ears. Id. at 1. Dismissed by the Utah District Court, the case is now on appeal before the Tenth Circuit and set for oral argument in September 2024.32Cassidy Wixom, Utah Leaders Appeal Dismissal of Bears Ears, Grand Staircase-Escalante Monuments Lawsuit, KSL.com (Nov. 1, 2023, 6:21 AM), https://www.ksl.com/article/50771434/utah-leaders-appeal-dismissal-of-bears-ears-grand-staircase-escalante-monuments-lawsuit [https://perma.cc/C8CG-X7DM]. Unlike prior litigation, in which the primary issue was executive authority to reduce a designated monument, the state of Utah presents a more fundamental challenge in the case at bar: the scope of the 1906 Antiquities Act itself, a President’s authority to designate national monuments thereunder, and what “objects of historic and scientific interest” may be protected.33Utah Complaint, supra note 31, at 51–52. The state of Utah intends to undermine the Antiquities Act in its entirety, accepting Chief Justice Roberts’s recent invitation to challenge the Antiquities Act on its merits in the Supreme Court.34Statement of Chief Justice Roberts Respecting the Denial of Certiorari at 1, Mass. Lobstermen’s Ass’n v. Raimondo, 141 S. Ct. 979 (Mar. 22, 2021); see Jeff Parrott & Jacob Scholl, Federal Judge Tosses Utah Lawsuit Seeking to Shrink Bears Ears and Grand Staircase-Escalante Monuments, Salt Lake Trib. (Aug. 11, 2023, 3:20 PM), https://www.sltrib.com/news/politics/2023/08/11/federal-judge-tosses-utah-lawsuit [https://perma.cc/SV3T-2LGF] (“ ‘We will appeal the dismissal in order to stand up against President Biden’s egregious abuse of the Antiquities Act.’ ”).

Much has been written about Bears Ears. The emphasis typically rests on conservation and environmental principles, separation of powers issues, and inherent executive authority.35Prior modifications and executive authority to revoke or modify national monuments are beyond the scope of this Note. See generally John C. Ruple, The Trump Administration and Lessons Not Learned from Prior National Monument Modifications, 43 Harv. Env’t L. Rev. 1 (2019). What has been lost along the way is the purpose of the Antiquities Act—heritage preservation law. This Note proposes a more faithful interpretation of the language of the Antiquities Act: that the Act was intended to and has served to protect cultural landscapes such as Bears Ears. The Act should be characterized, as legislation intended, to encompass not only discrete objects and sites, but also the archaeological, natural, and cultural heritage landscapes that inform them. A reading of the statute in its entirety—historical context, ordinary meaning, drafting history, past practice, management obligations, and existing case law—support this interpretation and a conclusion that the scope and size of Bears Ears National Monument is proper and within the boundaries of executive authority granted by the Act. A cultural landscape-based interpretation of the Antiquities Act should be applied in the state of Utah’s lawsuit and presents a unique opportunity for Tribes to utilize existing legislation to protect and manage culturally significant lands.

Part I describes the purpose, fit, and distinct characteristics of the Antiquities Act, defining the term “cultural landscape” and discussing historical context, drafting considerations, and other heritage protection mechanisms in the United States.

Part II discusses past use of the Act and foundational, unsuccessful challenges in the more than 100 years since its enactment.

Part III applies a cultural landscape-based understanding of the Act and principles derived from the relevant case law to the state of Utah’s pending lawsuit, demonstrating that Bears Ears National Monument—a fundamental example of a cultural landscape—is well within the confines of presidential proclamation authority.

Finally, Part IV explains how a cultural landscape perspective presents the Act as an attractive, feasible heritage protection mechanism for Tribes, and recent instances in which Tribes have, fittingly, made use of it.

I.  HISTORY AND PURPOSE OF THE ANTIQUITIES ACT: THE DAWN OF MODERN ARCHAEOLOGY

A.  Historical Context and Drafting Considerations

The American Antiquities Act of 1906 was signed into law by President Theodore Roosevelt on June 8, 1906. In relevant part, it reads:

[T]he President of the United States is hereby authorized, in his discretion, to declare by public proclamation historic landmarks, historic and prehistoric structures, and other objects of historic or scientific interest that are situated upon the lands owned or controlled by the Government of the United States to be national monuments, and may reserve as a part thereof parcels of land, the limits of which in all cases shall be confined to the smallest area compatible with the proper care and management of the objects to be protected.3654 U.S.C. § 320301.

The Antiquities Act was conceived as cultural heritage legislation. It is widely known as the first United States law to provide legal protection for cultural resources on federal lands and has been described as “the nation’s first archaeological preservation law.”37Archaeological Amicus Brief, supra note 27, at 12. See generally Ronald F. Lee, Nat’l Park Serv., The Antiquities Act of 1906 (1970), http://npshistory.com/publications/antiquities-act-1906.pdf [https://perma.cc/V2EN-K5SF] (describing the historical context of American archaeology and the efforts of the American Archaeological Association and Archaeological Institute of America, joined by the Smithsonian Institution, to draft and advocate for passage of the Antiquities Act). The central motive for its enaction was the preservation of archaeological sites. In the late nineteenth and early twentieth centuries, movement to the Western United States and general interest in the history and archeology of the Southwest—including a growing public mythos surrounding Indigenous communities—led to substantial demand for artifacts.38Archaeological Amicus Brief, supra note 27, at 12. A resulting increase in vandalism of archeological sites and historic structures, in addition to looting of potsherds, arrowheads, and other archeological and cultural resources, began to concern the Archaeological Institute of America (“AIA”),39Id. along with other archaeological societies that had begun exploring and recording the American Southwest in the 1880s.40Id. The AIA is the oldest professional archeological organization in North America, founded in 1879 for the purpose of “furthering and directing archeological and artistic investigation and research.”41History: Our Story, Archaeological Inst. Am., https://www.archaeological.org/about/history [https://perma.cc/RM76-EYHX].

Archaeological practice itself underwent radical change in the late nineteenth century. “Gentlemen” tomb raiders had made way for a growing canon of archaeological practice that emphasized recording obligations, systematic scientific analysis and conclusions, and greater cultural context, including recognition of the ethics surrounding archaeological practice and its relationship with existing peoples.42See Dennis Harding, Rewriting History: Changing Perceptions of the Past 17 (2019). To this day, archaeologists work to repair the trust their predecessors broke and to further develop strong standards of archaeological ethics. See Lynn Meskell, The Intersections of Identity and Politics in Archaeology, 31 Ann. Rev. Anthropology 279, 279 (2002) (discussing a growing recognition of the ethical role archaeology plays in nationalism and the importance of cultural heritage to contemporary communities).

In 1899, the AIA and the American Association for the Advancement of Science formed a committee, intending to draft a bill to protect archaeological and historical objects.43Archaeological Amicus Brief, supra note 27, at 12. The impetus for this action was not simply the preservation of “isolated structures or objects,” but rather the “impacts that ‘indiscriminate digging’ and vandalism were having on the integrity of archaeological sites in the Southwest” as a whole.44Id. Amateur excavators and thieves were damaging site context and disposing of items as they saw fit, which caused substantial, “irretrievable loss of scientific knowledge” about the peoples and history of the region.45Id.; Mark Squillace, The Monumental Legacy of the Antiquities Act of 1906, 37 Ga. L. Rev. 473, 477–78 (2003). Archaeologist T. Mitchell Prudden, in an article published shortly before the Act’s passage, said,

[I]t is now evident that to gather or exhume specimens—even though these be destined to grace a World’s Fair or a noted museum—without at the same time carefully, systematically, and completely studying the ruins from which they are derived, with full records, measurements, and photographs, is to risk the permanent loss of much valuable data and to sacrifice science for the sake of plunder.46T. Mitchell Prudden, The Prehistoric Ruins of the San Juan Watershed in Utah, Arizona, Colorado, and New Mexico, 5 Am. Anthropologist 224, 288 (1903). The landscape Prudden describes includes Bears Ears and Grand Staircase-Escalante National Monuments. See id. Moreover, this trend of looting was devastating for, and continues to devastate, Indigenous communities: without context or knowledge about objects, many institutions cannot identify the communities to which they should return funerary objects and human remains subject to the Native American Graves Protection and Repatriation Act. Stolen ancestors continue to languish in collections without proper ceremony—and in some cases are removed from rest to be used as teaching tools. See Logan Jaffe, Mary Hudetz, Ash Ngu & Graham Lee Brewer, America’s Biggest Museums Fail to Return Native American Human Remains, ProPublica (Jan. 11, 2023, 5:00 AM), https://www.propublica.org/article/repatriation-nagpra-museums-human-remains [https://perma.cc/8PMR-XL22]  (“[T]he American Museum of Natural History has not returned some human remains taken from the Southwest, arguing that they are too old to determine which tribes—among dozens in the region—would be the correct ones to repatriate to.”); Mary Hudetz & Graham Lee Brewer, A Top UC Berkeley Professor Taught with Remains That May Include Dozens of Native Americans, ProPublica (Mar. 5, 2023, 8:00 AM), https://www.propublica.org/article/berkeley-professor-taught-suspected-native-american-remains-repatriation [https://perma.cc/5PN7-NEBG].

Drafters of and advocates for the Act shared a desire to protect Indigenous American artefacts, sites, and the scientific information their context provides.47Archaeological Amicus Brief, supra note 27, at 13. The AIA drafted several early competing iterations of the Antiquities Act, three of which were heavily debated in Congress.48Freddie Wolf, Addressing the Deficiencies of the Antiquities Act: Can a President Modify or Revoke a Designated National Monument?, 11 Geo. Wash. J. Energy & Env’t L. 55, 56 (2020); Archaeological Inst. Am., supra note 41. Then-Secretary of the Interior Ethan Hitchcock was “dissatisfied with the bills as he believed them to be too narrow . . . [t]hey were either too limited in the permissible reservation area, or merely criminalized harming an aboriginal antiquity.”49Wolf, supra note 48 (footnote omitted). Hitchcock and allied proponents of the Act wished for broader landscapes to be included within its purview.50Squillace, supra note 45, at 477. The resulting statute was brief, but serves two key purposes that reflect the desires of the Act’s drafters. First, it gives the President authority to declare national monuments to protect “objects of historic or scientific interest.”5154 U.S.C. § 320301(a). Second, it allows the President to reserve “parcels of land as part of the national monuments”52Id. § 320301(b).—the surrounding area that informs and contextualizes these objects. In other words, “under the plain text of the Act, the objects and the surrounding reserved land together comprise a monument.”53Archaeological Amicus Brief, supra note 27, at 13.

B.  Context Is Crucial: Archaeological and Cultural Landscapes Defined

Both professional and avocational archaeologists recognize that preservation of both objects and the context associated with them is crucial to systematic, holistic, and scientific study of the archaeological record.54Id.; Catherine Sease, Conservation and the Antiquities Trade, 36 J. Am. Inst. for Conservation 49, 51–52 (1997) (“Context is extremely important to the archaeologist; . . . artifacts are only of scientific value when their context is known.”). The archaeological significance of an object depends on careful recordkeeping and connections, including the “stratigraphic,” or soil layers, where the object rests, the broader geological context, and a network of sites, other objects, historical information, and cultural associations.55Archaeological Amicus Brief, supra note 27, at 13–14. The phrase that describes these relationships is the “archaeological landscape,” or the bounded area of land in which human behavior and its relationship with the natural environment over space and time can be viewed and understood through the study of the interconnected networks of sites, artifacts, and natural features that exist within it.56Tim Denham, Landscape Archaeology, in Encyclopedia of Geoarchaeology (Allan S. Gilbert ed., 2016), https://link.springer.com/referenceworkentry/10.1007/978-1-4020-4409-0_168 [https://perma.cc/Z8HZ-UGT5]. “Landscape-level analysis” means that findings at discrete sites are deprived of meaning if not viewed in the context of the broader archaeological and environmental landscape. In other words,

Artistic and utilitarian objects, faunal and floral remains, architectural features, human remains, and their original contextual relationship to each other are all equally essential in achieving an optimal understanding of the past. This full body of contextualized information is a destructible, nonrenewable cultural resource. Once it is destroyed, it cannot be regained.57Patty Gerstenblith, Controlling the International Market in Antiquities: Reducing the Harm, Preserving the Past, 8 Chi. J. Int’l L. 169, 171–72 (2007).

Moreover, since the professionalization of archaeology in the nineteenth century, even prior to the conception of the Antiquities Act, archaeologists working with Indigenous communities have recognized the value of “cultural landscapes,” defined as “networks of natural and constructed places perceived and made meaningful by particular human communities,” in shaping modern community identities and informing archaeological preservation and practice.58Severin Fowles, The Southwest School of Landscape Archaeology, 39 Ann. Rev. Anthropology 453, 455 (2010). As the AIA notes in its Amicus Brief in the prior Bears Ears National Monument litigation, “Fowles reviews the development of landscape archaeology in the American southwest, arguing that a ‘rigorous investigation of past landscapes must also seek to understand the way in which they were perceived and experienced on the ground by culturally situated individuals.’ ” Archaeological Amicus Brief, supra note 27, at 15 n.43. Cultural landscapes “are as critical to archaeological meaning as singular built structures,” discrete objects, and the adjacent findings and stratigraphic context that constitute the archaeological landscape.59Archaeological Amicus Brief, supra note 27, at 15. They draw no line between the natural, tangible, and cultural resources of a place; these are inseparable.60See Bears Ears Proposal, supra note 13, at 30. Cultural landscapes are not passive, nor are they frozen in the past. They are active heritage areas—living landscapes that communities continue to interact with.

Cultural landscapes are professionally recognized by archaeologists and embraced by many Indigenous communities, including those of the Bears Ears Inter-Tribal Coalition,61Id.; Cultural Resources from an Indigenous Perspective, Nat’l Oceanic & Atmospheric Admin.: Nat’l Marine Sanctuaries, https://sanctuaries.noaa.gov/tribal-landscapes/cultural-resources.html [https://perma.cc/45MA-TRBE]. acknowledged internationally by the United Nations Educational, Scientific and Cultural Organization (“UNESCO”), a United Nations agency which is recognized as a global cultural heritage authority,62Cultural Landscapes, UNESCO World Heritage Convention, https://whc.unesco.org/en/culturallandscape [https://perma.cc/ULT2-XTK4]. and which informs the practice of United States federal agencies.63Federal agencies provide their own definitions of cultural landscapes and describe the value of a cultural landscape approach to integrated resource management. Cultural Resources from an Indigenous Perspective, Nat’l Oceanic & Atmospheric Admin.: Nat’l Marine Sanctuaries, https://sanctuaries.noaa.gov/tribal-landscapes/cultural-resources.html [https://perma.cc/45MA-TRBE]. UNESCO defines cultural landscapes as “[c]ombined works of nature and humankind [that] express a long and intimate relationship between peoples and their natural environment.”64UNESCO World Heritage Convention, supra note 62. UNESCO has specifically designated 121 properties as cultural landscapes on its World Heritage List.65In 1992, the World Heritage Convention “became the first international legal instrument to recognise and protect cultural landscapes.” After this Convention, UNESCO began including cultural landscapes as part of the World Heritage List. Id. These properties

[O]ften reflect specific techniques of sustainable land-use, considering the characteristics and limits of the natural environment they are established in, and a specific spiritual relation to nature. Protection of cultural landscapes can contribute to modern techniques of sustainable land-use and can maintain or enhance natural values in the landscape.66Id.

Context and landscape have significant historic, natural, scientific, and cultural value. With this understanding in mind, the Antiquities Act is best understood as cultural heritage legislation intended to protect cultural landscapes. The Act was developed by archaeologists for the benefit of long-term archaeological practice and cultural preservation. It would have been neither drafted nor passed without the influence of American archaeological societies that helped raise awareness of the looting and destruction of American cultural heritage, especially in the Southwest, to the national stage and assisted Congress in developing the legislation. Preserving cultural landscapes, which includes preserving access for descendant communities, helps us better understand layers of human meaning and value in a specific place, today and throughout history and prehistory. With them, we can develop a historical, scientific, and cultural understanding of peoples and places that the objects alone, removed from their broader context, could not share. This is what the framers of the Antiquities Act had in mind.

C.  A Perfect Fit: Characteristics of National Monuments and the Practical Impact of National Monument Status as a Federal Land Use Designation

Given the drafting context of the Antiquities Act, the Act was clearly intended to protect cultural landscapes. Indeed, it is well-tailored to do so. As discussed above, the Act intentionally protects not only objects, but also the areas of land that surround them, as necessary for the “proper care and management” of the objects.6754 U.S.C. § 320301(b). Moreover, the Act’s designation criteria, including its wording, specific protections and prohibitions, and flexible management structures, demonstrate that cultural landscapes are, and have in practice been, the key purview of the Antiquities Act.

National monuments are distinct from national parks, which require Congressional authorization under the Organic Act, by contrast to sole presidential authorization.68NPS Organic Act, U.S. Dep’t Interior, https://doi.gov/ocl/nps-organic-act [https://perma.cc/D7N8-MGBD]. National parks are areas “set apart by Congress for the use of the people of the United States generally,” typically because of an outstanding “scenic feature or natural phenomen[on].”69Robert Sterling Yard, The National Parks Portfolio 4 (Isabelle F. Story ed., 6th ed. 1931). Under current National Park Service policies, parks must be “sufficiently large to yield to effective administration and broad use.”70Id. Qualities considered for national park designation are the “inspirational, educational, and recreational”71Id. values of the area, by contrast to national monument criteria, which include “historic landmarks, historic and prehistoric structures, and other objects of historic or scientific interest.”7254 U.S.C. § 320301(a). National parks and national monuments, therefore, were intended to serve different purposes, and do so in practice.73Hartman, supra note 22, at 160–61. National parks are defined as lands emphasizing educational and recreational interests; national monuments are areas set aside specifically for historic and scientific preservation.

Both the President and Congress may exercise authority to designate a national monument pursuant to the Antiquities Act.74See Carol Hardy Vincent, Cong. Rsch. Serv., R41330, National Monuments and the Antiquities Act 4 (2024), https://sgp.fas.org/crs/misc/R41330.pdf [https://perma.cc/3SNQ-RKYD]. Congress has rarely exercised its independent authority to designate national monuments.75See id. It did so primarily in the mid-1940s to 1960s, when Presidents were abstaining from use of the Act in the wake of President Franklin D. Roosevelt’s designation of Jackson Hole National Monument and the subsequent national outcry.76See id. at 2; The Proclamation of National Monuments Under the Antiquities Act, 1906-1970, Nat’l Park Serv. (Mar. 6, 2023) [hereinafter Proclamation of Monuments], https://www.nps.gov/articles/lee-story-proclamation.htm [https://perma.cc/SD95-KX24]. For a complete dataset of uses of the Antiquities Act, including which designations were made under executive by contrast to congressional authority, see Monument Data, supra note 21. Current monuments and their proclamation authority are also listed at 54 U.S.C. § 320301. Congress may also narrow the reach of the President and the Antiquities Act. See Vincent, supra note 74. In several cases, however, Congress has done the opposite, leaving the Antiquities Act untouched in pushes to modify it and, in some cases, intentionally reaffirming and preserving presidential authority. See The Antiquities Act and America’s National Monuments: A Timeline of Milestones, Pew (Mar. 8, 2019), https://www.pewtrusts.org/en/research-and-analysis/fact-sheets/2019/03/the-antiquities-act-and-americas-national-monuments [https://perma.cc/4V85-XT2D]. For example, in 1976, Congress passed the Federal Land Policy and Management Act, or FLPMA, which repealed the executive branch’s public lands withdrawal authority and prohibited the Secretary of the Interior from modifying or revoking any monuments created by executive action under the Antiquities Act. Id. Additionally, Congress may modify national monuments and, in some cases, has abolished them or converted them into different protective designations.77Vincent, supra note 74. Recognizing the different values of environmental and recreation law compared with cultural heritage preservation law and following the 1916 passage of the Organic Act, Congress has abolished some monuments to redesignate them as national parks. See id. at 2–3. Even if the use of national monument and national park status in practice has been blurred, the legislative purposes of their enacting statutes are unquestionably distinct. Hartman, supra note 22, at 160–161. National monument designations under the Act apply to lands owned or controlled by the federal government.78Vincent, supra note 74, at 6. The Antiquities Act also provides that if “objects” are on privately owned lands, the property “may be relinquished to the Federal Government.” 54 U.S.C. § 320301(c). There is no case law that elucidates whether nonfederal lands must be relinquished voluntarily (donated, purchased, or exchanged) or whether the President may convert private property to federal property; no President has yet converted private property, though some monuments include donated lands. Vincent, supra note 74, at 6–7.

The Antiquities Act does not specifically cap the size of monument designations. Language proposed to cap monument size, found in the legislative history of the Act, was removed, though it was clear that land should be reserved “only so much . . . as may be absolutely necessary,”79H.R. Rep. No. 59-2224, at 1 (1906). ultimately, confined to the “smallest area compatible with the proper care and management of the objects to be protected.”8054 U.S.C. § 320301(b). The acceptance of a more ambiguous act that included vague definitions and did not include specific size limits suggests that Congress intended the Act to have a broad purview. The President has discretion to determine the acreage necessary to ensure protection of the objects in question, which “can be a particular archaeological site or larger features or resources.”81Vincent, supra note 74, at 5. The Grand Canyon, for example, initially had national monument status, protecting an area of nearly one million acres.82Id. President Theodore Roosevelt, the first executive to employ the Act, determined that this size was necessary to protect the “object” in question: the canyon itself.83Id.

Though typically managed by the National Park Service, monuments may be managed by any federal agency, including the Bureau of Land Management (“BLM”), United States Forest Service (“Forest Service”), Department of Fish and Wildlife Service, and National Oceanic and Atmospheric Administration.84Id. at 7–8. Most monuments remain under the authority of the agency that managed the area prior to its new designation.85Id. Bears Ears National Monument, for example, is jointly managed by the BLM and Forest Service.86Bears Ears National Monument Management, U.S. Dep’t Interior: Bureau Land Mgmt. [hereinafter Bears Ears Management], https://www.blm.gov/programs/national-conservation-lands/utah/bears-ears-national-monument [https://perma.cc/2LK9-VAFP]. Tribal guidance is provided by a Bears Ears Commission of the five Inter-Tribal Coalition Tribes; volunteers represent other stakeholders, including local and environmental interests, on the Bears Ears Monument Advisory Committee.87Id. After President Biden’s redesignation of Bears Ears, the BLM, Forest Service, and five Inter-Tribal Coalition Tribes signed a first-of-its-kind Inter-Governmental Cooperative Agreement for coordinated land use planning and implementation, long-term resource management, and programmatic goal development, including Tribal outreach efforts, at Bears Ears.88Inter-Governmental Cooperative Agreement for the Cooperative Management of the Federal Lands and Resources of the Bears Ears National Monument, Bears Ears Commission-U.S. Department of the Interior, Bureau of Land Management and U.S. Department of Agriculture, Forest Service, June 18, 2022 [hereinafter Inter-Governmental Agreement], https://www.blm.gov/sites/default/files/docs/2022-06/BearsEarsNationalMonumentInter-GovernmentalAgreement2022.pdf [https://perma.cc/4RAB-UPMQ]; Amy Joi O’Donoghue, ‘One of a Kind’ Deal Means 5 Tribes Will Help Manage Bears Ears Area, Deseret News (June 21, 2022, 4:18 PM), https://www.deseret.com/utah/2022/6/21/23176953/bears-ears-national-monument-utah-biden-trump-native-american-tribes-public-lands-west-politics [https://perma.cc/4BCC-2TUL]. More recently, the Bears Ears Commission collaborated with the Bureau of Land Management and Forest Service to draft a Resource Management Plan for Bears Ears, another unprecedented example of tribal participation in co-management.89U.S. Dep’t Interior: Bureau Land Mgmt., Bears Ears National Monument Draft Resource Management Plan and Environmental Impact Statement (2024), https://eplanning.blm.gov/public_projects/2020347/200531796/20105487/251005487/BENM_DraftRMP-EIS_Vol1_508.pdf [https://perma.cc/C7M5-VBL2]. This is one example of the flexible, sovereign-to-sovereign, and creative management authority a national monument designation can provide.

Permitted uses of lands with national park or national monument status are also distinct. Monuments managed by the National Park Service are subject to the same use limitations as national parks.90Hartman, supra note 22. Permitted uses for monuments outside the park system vary, depending on both the supervising agency and management objectives specified in the proclamation.91Id. A “primary objection” to national monument designations is that the new status of the land would change management of the area and restrict or muddle various previously permitted uses, including development, off-road activity, and timber cutting. Vincent, supra note 74, at 9–10. The overriding goal, however, is to protect the objects described in each proclamation.92Vincent, supra note 74, at 9–10. Existing uses of the land not specifically precluded by the proclamation may continue, but the proclamation becomes the “dominant” reservation status when lands were previously reserved for other purposes.93Id.; Squillace, supra note 45, at 515. All national monuments prohibit new mineral leases and permanent development activities.94See id. at 516–17; Hartman, supra note 22. At Bears Ears, no mining and drilling is permitted, but the Department of the Interior may continue issuing cattle grazing leases.95Turkewitz, supra note 26.

The Antiquities Act is elastic—by design—to provide broad designation authority, flexible management priorities and structures, and to limit uses as necessary to steward objects in situ and the land that encompasses them, or the cultural landscape. Each new monument has distinct requirements because the objects and landscapes it protects are different.

The Antiquities Act is the only United States cultural heritage legislation that serves, in intent and in practice, the goal of protecting cultural landscapes writ large. Dominant federal statutes protecting cultural materials, themselves limited in number and authority,96The United States does not conceptualize cultural patrimony according to the national ownership model used in many European countries. Our cultural heritage protection options are therefore more limited and present unique challenges. For discussion, see generally William R. Ognibene, Lost to the Ages: International Patrimony and the Problem Faced by Foreign States in Establishing Ownership of Looted Antiquities, 84 Brook. L. Rev. 605 (2019). include: (1) the National Historic Preservation Act, emphasizing protection of historic sites and landmarks and creating the National Register of Historic Places, the list of National Historic Landmarks, and State and Tribal Historic Preservation Offices, in addition to implementing a required federal review procedure, known as Section 106 review, that requires federal agencies to consider the effects of projects they “carry out, approve, or fund” on historic properties included in, or eligible to be included in, the National Register;97Advisory Council on Historic Pres., Protecting Historic Properties: A Citizen’s Guide to Section 106 Review 4 https://www.achp.gov/sites/default/files/documents/2017-01/CitizenGuide.pdf [https://perma.cc/TP3M-ZFB3]. (2) the Archaeological Resources Protection Act, governing the excavation of archaeological sites on federal and Tribal lands, and the removal and disposition of objects from those sites;98Marina F. Rothberg, Indiana Jones and the Illicit Excavation and Trafficking of Antiquities: Refining Federal Statutes to Strengthen Cultural Heritage Protections, 63 B.C. L. Rev. 1555, 1564–67 (2022). and (3) the Native American Graves Protection and Repatriation Act, providing for the repatriation of Native American human remains, funerary objects, sacred objects, and objects of cultural patrimony from federal agencies and institutions that receive federal funds.99Native American Graves Protection and Repatriation Act, U.S. Dep’t Interior: Indian Affs., https://www.bia.gov/service/nagpra [https://perma.cc/37BK-FV2Y]. In December 2023, new implementation rules for NAGPRA were announced, including several substantial changes. For more, see Interior Department Announces Final Rule for Implementation of the Native American Graves Protection and Repatriation Act, U.S. Dep’t of the Interior (Dec. 6, 2023), https://www.doi.gov/pressreleases/interior-department-announces-final-rule-implementation-native-american-graves [https://perma.cc/5SRQ-EFCT]. Notably, these other cultural heritage laws, younger than the Antiquities Act by more than fifty years,100Rothberg, supra note 98, at 1564–67. intend to protect sites, buildings, objects, or their excavation, but never all of these or the context that surrounds them.

Moreover, given its goals and the ramifications of its use, including, for example, prohibitions on renewable energy development in addition to more extractive natural resource projects, the Antiquities Act is not in purpose or effect similar to environmental conservation laws or other public land designation statutes. Many of these statutes are exceptionally long, well-defined, and heavily regulated,101See generally Selected Environmental Law Statutes: 2022-2023 Educational Edition (2022) (compiled by Robin Kundis Craig). The Clean Air Act, as one example, is longer than the United States Tax Code. 42 U.S.C. §§ 7401–7671q. by contrast to the brevity and ambiguous language that the Antiquities Act employs.102Ruple et al., supra note 22, at 2–3. The Antiquities Act is not environmental or wilderness legislation; it is cultural preservation law, allowing Presidents to tailor protection of cultural landscapes to the specific needs of each monument.

II.  BROAD DISCRETION OVER A CENTURY OF USE AND CHALLENGE

From its ideation to its drafting to its results, the Antiquities Act has been understood as a cultural heritage preservation law intended to broadly encompass cultural landscapes, or distinct objects and their surrounding context. President Theodore Roosevelt was the first President to utilize the Act. Roosevelt proclaimed eighteen national monuments, including areas now part of Grand Canyon, Lassen Volcanic, Olympic, and Petrified Forest National Parks.103Monument Data, supra note 21. Eighteen of the twenty-one Presidents serving in office since the Act came into effect—including President Trump—have proclaimed a total of 163 monuments and utilized the Act’s grant of authority 291 times to establish, modify, and expand national monuments.104Id.; Vincent, supra note 74, at 7–8, 16.

National monuments range in size from 0.34 acres (Belmont-Paul Women’s Equality National Monument)105Proclamation No. 9423, 81 Fed. Reg. 22505 (Apr. 12, 2016). to approximately 372 million acres (Papahānaumokuākea Marine National Monument);106Papahānaumokuākea Marine National Monument, NOAA Fisheries (Sept. 25, 2018) https://www.fisheries.noaa.gov/pacific-islands/habitat-conservation/papahanaumokuakea-marine-national-monument [https://perma.cc/4KUM-N63S]. the latter more than three hundred times the size of Bears Ears, where current boundaries under President Biden’s redesignation total 1.36 million acres.107Bears Ears Management, supra note 86. While it is true that many of the larger monuments were created over the past half century, there are several examples of monument proclamations greater than one million acres in size throughout the history of the Antiquities Act, including Katmai National Monument, established in 1918 with 1.1 million acres, Glacier Bay National Monument, proclaimed in 1925 as 1.4 million acres, and Wrangell-St. Elias National Monument, designated in 1978 with 10.95 million acres.108Vincent, supra note 74, at 4–5. Of the 163 national monuments designated, burial grounds, individual houses, geological features, marine and continental landscapes, forts, cave systems, and battlegrounds are represented.109Id. at 16–24. Historical precedent reveals broad use of the Act to proclaim a great variety of monuments that differ in size by several volumes of magnitude, consistent with an intent to protect individual cultural landscapes, with different boundaries, that inform the monuments’ various “objects” and their management needs.

The Act has enjoyed over a century of use by nearly all Presidents, all of whom had substantially different policy objectives. When the Act has found itself in a courtroom, the President has been given latitude: broad discretion to use the Act has always been upheld. Not once has the Act’s breadth been proscribed, rather than expanded, by federal court mandate. Every test to date has been unsuccessful.

The first challenge to the Antiquities Act came shortly after President Theodore Roosevelt’s designation of Grand Canyon National Monument, nearly one million acres in size, in 1908, only two years after the Act became law. In Cameron v. United States, the plaintiff, owner of a lode mining claim that Roosevelt’s proclamation removed from the “operation of the public land laws and . . . of the mineral land law,” challenged the monument proclamation, arguing that there was “no authority for its creation” under the Antiquities Act.110Cameron v. United States, 252 U.S. 450, 454–55 (1920). On appeal from the Ninth Circuit, the United States Supreme Court—whose discussion of this issue lasted no more than a short paragraph—held that the Grand Canyon itself was an “object” of unusual scientific interest.111Id. at 455–56. In making this finding, the Court wrote that the Grand Canyon

[I]s the greatest eroded canyon in the United States, if not in the world, is over a mile in depth, has attracted wide attention among explorers and scientists, affords an unexampled field for geologic study, is regarded as one of the great natural wonders, and annually draws to its borders thousands of visitors.112Id. at 456.

These exemplary characteristics were sufficient to defer to the President’s authority, as empowered by the Act, to define an “object of historic or scientific interest.”113See id. at 455–56. Not only was a landscape as a “container” for objects consistent with the Act, but a landscape, too, could itself be an object protected by a monument.

The Antiquities Act was next impugned in the mid-twentieth century, following President Franklin D. Roosevelt’s proclamation of Jackson Hole National Monument in Wyoming. The proclamation sparked “tremendous and bitter opposition” in Wyoming, national outcry, and vehement debate in Congress;114Proclamation of Monuments, supra note 76. in the resulting lawsuit, the Wyoming District Court went so far as to assert that “propaganda [had] been circulated in forums and through the press of the Nation.”115State v. Franke, 58 F. Supp. 890, 896 (D. Wyo. 1945). Opponents characterized the proclamation as executive overreach that designated a wilderness with no historic sites, landmarks, or archaeological resources as a national monument.116See id. at 895. The plaintiff in State v. Franke, sought a construction of the Antiquities Act, under the Federal Declaratory Judgement Act, that would void Roosevelt’s proclamation, arguing that

The segregated area, by virtue of the Proclamation over which the defendant threatens management and control, is outside the scope and purpose of the Antiquities Act under which the Proclamation was issued in that such area contains no objects of an historic or scientific interest required by the Act; that the Proclamation is void and of no effect in that it is not confined to the smallest area compatible with the proper care and management of a National Monument; that by said Proclamation an attempt has been made to substitute, through the Antiquities Act, a National Monument for a National Park, the creation of which is within the sole province of the Congress, thereby becoming an evasion of the law governing the segregation of such areas . . . .117Id. at 892 (emphasis added). Note that this is the first case challenging the Act not solely on the grounds that it contains no qualifying “objects” of historic and scientific interest, but also specifically on the grounds that the size of the monument designation was too expansive.

The court, reluctantly, found that it had “limited jurisdiction to investigate and determine whether or not [Roosevelt’s] Proclamation [was] an arbitrary and capricious exercise of power under the Antiquities Act so as to be outside of the scope and purpose of that Act.”118Id. at 894. The plaintiff and the defendant presented conflicting evidence regarding the presence or absence of “objects of historic or scientific interest.”119Id. at 895. However, the court held, the President acted on evidence “of a substantial character” and therefore properly availed himself of the discretion duly granted to him by Congress through the Antiquities Act.120Id. at 895–96. The court distinguished between a “bare stretch of sage-brush prairie” that could contain no objects of historic and scientific interest, in which case the presidential monument proclamation would be arbitrary and capricious, and the monument at issue, for which experts provided some “substantial” evidence of such objects, including “trails and historic spots in connection with the early trapping and hunting of animals formulating the early fur industry of the West, structures of glacial formation and peculiar mineral deposits and plant life indigenous to the particular area.”121Id. at 895. Following the ruling, while Jackson Hole’s monument designation was upheld, Congress limited the President’s authority by “requiring congressional authorization for extensions or establishment of monuments in Wyoming, and by making withdrawals in Alaska exceeding 5,000 acres subject to congressional approval.” Vincent, supra note 74, at 1 (footnote omitted). Though it did not agree that testimony would support the President’s claim regarding the presence of objects of historic and scientific interest in Jackson Hole under a preponderance rule, the court was bound by his exercise of discretion.122State v. Franke, 58 F. Supp. 890, 896 (D. Wyo. 1945).

Decades passed before the Antiquities Act was contested again. In the cases that followed Franke, federal courts continued to recognize a President’s discretion and support interpretations of the Act that confirmed its substantial scope and latitude regarding both objects to be preserved and the size of parcels proclaimed. In Tulare County v. Bush, challenging President George W. Bush’s proclamation of Giant Sequoia National Monument, the D.C. Circuit Court of Appeals held that ecosystems and scenic vistas were appropriate “objects” for protection under the Act, which is “not limited to protecting only archeological sites.”123Tulare Cnty. v. Bush, 306 F.3d 1138, 1142 (2002) (emphasis added); see also Cappaert v. United States, 426 U.S. 128, 142 (1976). This case further cements the conclusion that biodiversity, ecosystems, and scenery are part and parcel of what the Act was intended to protect—archeological, scientific, and historic resources together with the natural and cultural context that informs them.

This trend continued in Utah Association of Counties v. Bush, an earlier challenge by the state of Utah to national monument designations within its borders.124Utah Ass’n of Cntys. v. Bush, 316 F. Supp. 2d 1172 (D. Utah 2004). This was the first time that Bears Ears’ sister monument, Grand Staircase-Escalante National Monument, was on the chopping block. The Utah District Court rejected plaintiff counties’ claims that Grand Staircase-Escalante, 1.87 million acres, exceeded the “smallest area compatible” with the protection of the objects at issue, holding that President Clinton lawfully exercised his discretion pursuant to the Antiquities Act as to both the nature of “objects” to be protected and the size of the parcel reserved.125Id. at 1183. The court also characterized the overriding purpose of the Antiquities Act as “identify[ing] and protect[ing] important scientific and historic objects and [] set[ting] aside the necessary surrounding land to insure their continued protection.” Id. at 1192. The court found it significant that the proclamations discussed in detail both the monument’s natural and archaeological resources, and why the designated area was the smallest consistent with the protection of those specific resources: this “clearly indicate[d] that the President considered the principles that Congress required him to consider.”126Id. at 1186. Again, this exercise of discretion was not subject to judicial review.127Id. at 1172. The court engaged in two other notable discussions: its conclusion that the Antiquities Act was not intended to limit protection to man-made objects, which appeared so obvious to the court that it was observed only in a footnote,128Id. at 1186 n.8. and a constitutional nondelegation argument.129Id. at 1190–91. The court settled the latter issue, holding that

The Antiquities Act sets forth clear standards and limitations. The Act describes the types of objects that can be included in national monuments and a limitation on the size of monuments. Although the standards are general, “Congress does not violate the Constitution merely because it legislates in broad terms, leaving a certain degree of discretion to executive or judicial actors.”130Id. at 1191 (citation omitted).

The most recent challenge to the Antiquities Act—other than the present challenge—is Massachusetts Lobstermen’s Association v. Ross, a 2019 D.C. Circuit Court of Appeals case dismissed for failure to state a claim that contested the designation of Northeast Canyons and Seamounts Marine National Monument.131Mass. Lobstermen’s Ass’n v. Ross, 945 F.3d 535, 545 (D.C. Cir. 2019). On the fishermen’s argument that the monument was not limited to the smallest area compatible with the care and preservation of the objects at issue, the court wrote

[The Fishermen] allege only that the Monument reserves large areas of submerged land beyond the canyons and seamounts. Although those allegations “might well have been sufficient if the President had identified only [the canyons and seamounts] for protection, . . . he did not.” Instead, the Monument protects not only “the canyons and seamounts themselves,” but also “the natural resources and ecosystems in and around them.”132Id. at 544 (emphasis added) (quoting Tulare Cnty. v. Bush, 317 F.3d 227, 227 (D.C. Cir. 2003) (per curiam)).

It was incumbent on the fishermen to allege that some part of the monument did not contain natural resources that the President sought to protect. They failed to do so.

Over nearly one hundred years of the Antiquities Act’s use, every test has come up short, despite substantive challenges to large monuments and courts that clearly expressed reluctance to uphold proclamations. Courts have not only considered the history and purpose of the Act and chosen, given this history, to interpret its brief language broadly,133See Utah Ass’n of Cntys. v. Bush, 316 F. Supp. 2d 1172, 1186 n.8. (D. Utah 2004). but have also substantially deferred to the President’s proclamations, refused to submit them as arbitrary and capricious,134See State v. Franke, 58 F. Supp. 890, 896 (D. Wyo. 1945). and declined to engage in judicial review of discretion Congress properly granted—with clear standards.135Utah Ass’n of Cntys., 316 F. Supp. 2d at 1183. What followed Massachusetts Lobstermen’s, however, was plaintiffs’ petition for a writ of certiorari to the United States Supreme Court. Chief Justice Roberts’s statements in his order denying certiorari serve as the cornerstone of the state of Utah’s current lawsuit. Chief Justice Roberts invites challenges to the Antiquities Act, making it clear that his denial of certiorari is on procedural grounds alone and indicating that the Supreme Court readies itself to reconsider the scope of the Antiquities Act on its merits:

While the Executive enjoys far greater flexibility in setting aside a monument under the Antiquities Act, that flexibility, as mentioned, carries with it a unique constraint: Any land reserved under the Act must be limited to the smallest area compatible with the care and management of the objects to be protected. Somewhere along the line, however, this restriction has ceased to pose any meaningful restraint . . . [and the Presidential power granted by the Act] has been transformed into a power without any discernible limit to set aside vast and amorphous expanses of terrain above and below the sea. . . . We have never considered how a monument of these proportions . . . can be justified under the Antiquities Act . . . [and] we have not explained how the Act’s corresponding “smallest area compatible” limitation interacts with the protection of such an imprecisely demarcated concept as an ecosystem . . . . Despite these concerns, this petition does not satisfy our usual criteria for granting certiorari. . . . We may be presented with other and better opportunities to consider this issue without the artificial constraint of the pleadings in this case.136Statement of Chief Justice Roberts Respecting the Denial of Certiorari, supra note 34, at 3–4 (citation omitted).

III.  UTAH POKES THE SLEEPING BEAR: THE PRESENT CHALLENGE TO THE ANTIQUITIES ACT

One might conclude that, given this precedent, the broad purview of the Antiquities Act and its protection of objects and the natural and cultural landscapes that encompass them have been put to bed. Following Chief Justice Roberts’ invitation, however, Utah decided to poke the sleeping bear. Joined by Garfield and Kane counties and, in a companion suit, the BlueRibbon coalition,137The BlueRibbon Coalition is a nonprofit organization that advocates on behalf of recreationalists and those seeking motorized access to public lands. About: History of the BlueRibbon Coalition/Sharetrails, BlueRibbon Coal., https://www.sharetrails.org/about [https://perma.cc/WDB2-H43B]. the state filed suit against the Biden Administration in Utah District Court in late 2022, objecting to President Biden’s redesignation of Bears Ears National Monument. Tribes, including the Navajo Nation and Hopi Tribe, later followed by the Ute Mountain Ute Tribe and Pueblo of Zuni, quickly sought to intervene; their motions were granted.138Proposed Intervenors’ Second Amended Rule 24 Motion to Intervene, Garfield Cnty. v. Biden, No. 22-cv-00059, 2023 U.S. Dist. LEXIS 142044 (D. Utah Aug. 11, 2023); Order Granting Movants Hopi Tribe, Navajo Nation, Pueblo of Zuni, & Ute Mountain Ute Tribe’s Amended Motion to Intervene, Garfield Cnty. v. Biden, No. 22-cv-00059, 2023 U.S. Dist. LEXIS 142044 (D. Utah Aug. 11, 2023). Similarly, in December 2022, a collection of professional archaeological societies, including the AIA, submitted a motion to intervene in the lawsuit (in the interest of disclosure, the Author was a declarant for this motion). Though their motion was denied, their motion speaks to the gravity of the lawsuit and an understanding that the Antiquities Act is law written by archaeologists for the purposes of historic and cultural preservation. Motion to Intervene as Defendants & Memorandum in Support, Garfield Cnty. v. Biden, No. 22-cv-00059, 2023 U.S. Dist. LEXIS 142044 (D. Utah Aug. 11, 2023); Memorandum Decision & Order on Proposed Intervenors’ Motions to Intervene, Garfield Cnty. v. Biden, No. 22-cv-00059, 2023 U.S. Dist. LEXIS 142044 (D. Utah Aug. 11, 2023).

In a challenge nearly identical to those that have come before, relying heavily on Chief Justice Roberts’ statement and concerns about the size of the monument, Utah argues that “[t]he Act does not authorize the president to draw boundaries around an enormous land area and then stitch together hundreds of items and features within those boundaries to try to reverse engineer a landscape-scale national monument.”139Complaint for Declaratory & Injunctive Relief at 2, Garfield Cnty. v. Biden, No. 22-cv-00059, 2023 U.S. Dist. LEXIS 142044 (D. Utah Aug. 11, 2023) . In their desire to exclude “landscape-scale” areas from the scope of the Antiquities Act, opponents of the Act and of Bears Ears National Monument misconstrue the term “landscape.” They miss the true meaning and intention of the Antiquities Act. A landscape is not simply an environment, microclimate, or collection of flora and fauna, but rather a collective set of sites, objects, geological features, natural elements, and the living things that interact with them. This web of understanding—this cultural landscape—is essential to historic and scientific study. It was clearly recognized by both the proponents of the Antiquities Act and by Congress, and the President is granted broad discretion to determine what objects are protected and the size of the area necessary for their care.

Bears Ears National Monument is a quintessential example of a cultural landscape given meaning by the peoples of the Colorado Plateau. Her objects epitomize relationships between ancient and modern Native peoples and the natural world: rock art from both ancestral peoples and modern Tribal communities, “[t]he remains of single family dwellings, granaries, kivas, towers, and large villages and roads linking them together[, which] reveal a complex cultural history.”140Proclamation No. 9558, 3 C.F.R. § 9558 (2017). “Moki steps,” or hand and toe holds carved into the walls used to access cliff dwellings that Native people still climb today.141Id. Flora and fauna are still used for subsistence, medicinal, and religious traditions.142Id. Understanding her landscape itself as an object also illuminates these relationships. Ceremonial practices are conducted in situ. Seeing “cliff faces suitable for granaries and rock art, alcoves for habitation, floodplains for growing crops, etc.[] and viewsheds when such viewsheds are relevant to human subsistence (i.e., observation points along big game migration corridors)” help us understand interactions between people and place.143Declaration of Jerry Spangler at 3 n.3, Garfield Cnty. v. Biden, No. 22-cv-00059, 2023 U.S. Dist. LEXIS 142044 (D. Utah Nov. 23, 2023). These are only some examples—of many.

President Obama’s proclamation recognized the scientific, historical, and cultural objects and values included within the original borders of Bears Ears National Monument. The second sentence of the Bears Ears proclamation describes the area as “one of the densest and most significant cultural landscapes in the United States.”144Id. The term “landscape” appears a dozen times.145Archaeological Amicus Brief, supra note 27. President Obama, and later President Biden, demonstrated careful research and presented “substantial” evidence in their proclamations that Bears Ears National Monument contains objects that the Act was intended to protect and that the area set aside, in its entirety, contains and informs these objects. To consider these “separate objects stitched together” is erroneous and simplistic. They are part of a whole.

BlueRibbon Coalition plaintiffs emphasize, more heavily than the state of Utah, an argument regarding the breadth of the term “object”: “The proclamation[] also designate[s] as ‘objects’ a variety of ‘imprecisely demarcated concept[s]’ scattered across those landscapes’ country-sized boundaries. These so-called ‘objects’ include entire ecosystems, habitats, and even animal species . . . .”146Complaint for Declaratory & Injunctive Relief at 2, Dalton v. Biden, No. 22-cv-00060 (D. Utah Aug. 25, 2022) (citation omitted).

Setting aside the volumes of case law that defer to the President’s discretion to define “object,” determine that canyons and ecosystems themselves fit within this term, and recognize that the standards Congress provided were broad, but not impermissibly so, an interpretation of the Antiquities Act that acknowledges cultural landscapes allows for a definition of “objects” per the term’s ordinary use: antiquities. Bears Ears unquestionably includes antiquities. Every acre is the context that necessarily informs them. Though we can, we need not say an ecosystem or habitat is an object. These do, however, comprise the cultural landscape necessary for the proper care and preservation of objects.

Moreover, preserving objects and their cultural landscapes, and thereby maintaining relationships to the living peoples around them, nurtures their historic and scientific value. Relationships between Indigenous people and the environment, “as personified and enriched by the Native experience at Bears Ears . . . ha[ve] every opportunity to lead to excellent public programs and outreach as well as outstanding opportunities for scientific, historical, and philosophical research by both Native and non-Native scholars and experts.”147Bears Ears Proposal, supra note 13, at 2. In other words,

The traditional . . . knowledge amassed by the Native Americans whose ancestors inhabited this region, passed down from generation to generation, offers critical insight into the historic and scientific significance of the area. Such knowledge is, itself, a resource to be protected and used in understanding and managing this landscape sustainably for generations to come.1483 C.F.R. § 9558.

As discussed, the objects protected are not “imprecisely demarcated concept[s]”—they are tangible. Nor is the land area set aside that encompasses the cultural landscape unbounded, infinite, or limitless. The breadth of a cultural landscape can be defined. UNESCO has done so, successfully, at 121 locations.149UNESCO World Heritage Convention, supra note 62. The Antiquities Act already limits its reach to federally-owned lands, and Presidents have demonstrated that they are loath to disturb the balance and test an open question regarding relinquishment of nonfederal lands.150Vincent, supra note 74, at 6–7. Moreover, the Obama-era, and now Biden-era, boundaries of Bears Ears National Monument were deliberately limited: the Inter-Tribal Coalition had proposed a far more expansive area of 1.9 million acres, not the 1.36 million acres that were ultimately designated.151See Bears Ears Proposal, supra note 13, at 20; 3 C.F.R. § 9558. This recognizes that the scope of a cultural landscape may be refined to allow for feasible care, management, and preservation activities.

Sites, landmarks, and “objects”—even if the term is understood as limited to its common meaning, despite the prior courts’ analyses—cannot be protected under the Act absent consideration and preservation of the cultural landscape in the lands that surround them. This care would be improper. It would decimate their historic and scientific value, in direct opposition to the standards Congress set forth in the Antiquities Act.

One need only examine the Trump-era demarcation of Bears Ears to witness how boundaries affect cultural landscapes and their constituent archaeological and scientific value. The Trump proclamation emphasized distinct sites, reducing Bears Ears from a complete cultural landscape to “a series of separate and disconnected objects”152Archaeological Amicus Brief, supra note 27. that divided the monument into “non-contiguous parcels of land.”153Proclamation No. 9681, 3 C.F.R § 9681 (2018). Areas removed were left open for development, motorized vehicles, and other destructive activities with the potential to significantly affect the ecosystem, sites and objects, and cultural values encompassed in the landscape.154Kate Groetzinger, Energy Developers and Uranium Miners Eye Land Near Bears Ears National Monument, KUER 90.1 (June 1, 2021, 5:31 PM), https://www.kuer.org/health-science-environment/2021-06-01/energy-developers-and-uranium-miners-eye-land-near-bears-ears-national-monument [https://perma.cc/AVQ4-S2AQ]. This fundamentally altered the nature of the monument: “Maintaining some sites . . . [did] not compensate for excluding other sites and fragmenting their associated cultural landscapes,” placing them at “greater risk of damage or destruction.”155Archaeological Amicus Brief, supra note 27. Trump’s boundaries were insufficient to be “compatible with the proper care and management” of the sites, landmarks, objects, and cultural heritage landscape that is Bears Ears.

The courts deferred to an exercise of presidential authority at Jackson Hole, a monument far less dense in archaeological and cultural history. They upheld Grand Staircase-Escalante’s larger acreage as sufficiently “small.” Bears Ears exemplifies the purpose of the Antiquities Act. She is not a replacement for a national park or solely an ecological resource. She is a distinct, definite, living cultural landscape. The Antiquities Act allows the President to proclaim areas that embody its purpose; the executive’s exercise of discretion is unreviewable.

In August 2023, the Utah District Court judge agreed: “President Biden’s judgment in drafting and issuing the Proclamations as he sees fit is not an action reviewable by a district court.”156Memorandum Decision & Order Granting Motion to Dismiss at 28, Garfield Cnty. v. Biden, No. 22-cv-00059, 2023 U.S. Dist. LEXIS 142044 (D. Utah Aug. 11, 2023). Federal Defendants’ and Tribal Nations’ motions to dismiss were granted with prejudice.157Id.; Parrott & Scholl, supra note 34. Utah quickly expressed its intent to appeal “immediately”; as of April 2024 the case is on appeal before the Tenth Circuit and set for oral argument in September 2024. Utah Governor Spencer Cox intends to push the case to the Supreme Court, saying that the District Court’s ruling “helps us get there even sooner.”158Opening Brief of Plaintiffs-Appellants, Garfield Cnty. v. Biden, No. 23-4106 (10th Cir. Oct. 30, 2023); Wixom, supra note 32. Tribes quickly responded, emphasizing that “Bears Ears remains an essential landscape that members of Tribal Nations regularly visit to practice their spirituality and connect with their history.”159Hopi Tribe, Navajo Nation, Pueblo of Zuni, & Ute Mountain Ute Tribe Response Brief at 14, Garfield Cnty. v. Biden, No. 23-4106 (10th Cir. Jan. 9, 2024).

As before, courts are bound to respect the President’s discretion. Even if, however, the Act was challenged on its merits, Bears Ears is a cultural landscape the Antiquities Act foresaw over a century ago. In the words of its Tribal advocates, “[a] region more worthy of protection under the Antiquities Act is hard to imagine.”160Id.

IV.  NOT SO ANTIQUE: A USEFUL, MODERN TOOL FOR TRIBES

Indigenous communities have long recognized intangible relationships between people and landscape: “Native people always have, and do now, conceive of and relate to the natural world in a different way than does the larger society.”161Bears Ears Proposal, supra note 13, at 2. In Bears Ears, “[w]e can still hear the songs and prayers of our ancestors on every mesa and in every canyon,” describes Malcolm Lehi of the Ute Mountain Ute.162Id. at 3. Other members of the Coalition Tribes describe similar connections between their contemporary experiences in Bears Ears and her relationship to their history, traditions, and cultural identities.163See id. at 3–4. The relationship between people, objects, and place is obvious—and precisely what the Antiquities Act was designed to and serves well to protect.

The Bears Ears Inter-Tribal Coalition was the first to recognize that the Antiquities Act could serve as a vehicle to protect Tribal cultural landscapes. In their petition, the Coalition wrote,

Our discussion here is not intended to catalogue all the many ways that this area holds significant geological, paleontological, archaeological, historical, cultural, and biological “objects” within the meaning of the Antiquities Act . . . . [W]e offer this section to highlight some of the main considerations that justify monument status for Bears Ears. This includes . . . critically, the multifaceted relationship between Native American people and this landscape that has developed over the course of eons.164Id. at 4–5 (emphasis added).

It is no surprise that the Inter-Tribal Coalition’s decision to petition for National Monument status—and their success—has sparked a movement of Tribes asking for national monument protections for sacred heritage landscapes.

In southern Nevada, Avi Kwa Ame, or Spirit Mountain, is the “mythical creation site for Yuman-speaking Tribes like the Fort Mojave, Cocopah, Quechan, and Hopi. Their stories place it “at the center of the universe.”165Alex Schechter, ‘The Place Where Shamans Dream’: Safeguarding Spirit Mountain, N.Y. Times (Jan. 24, 2023), https://www.nytimes.com/2023/01/24/travel/nevada-avi-kwa-ame-national-monument.html [https://perma.cc/G6DY-ZUTZ]. All but one of the member Tribes in the Inter-Tribal Council of Nevada, and all the Tribes of the Inter-Tribal Association of Arizona, adopted resolutions endorsing a national monument.166Dan Michalski, Biden Commits to Honoring Tribes by Protecting Public Lands in Nevada, Wash. Post (Nov. 30, 2022, 6:57 PM), https://www.washingtonpost.com/climate-environment/2022/11/30/avi-kwa-ame-monument-nevada [https://perma.cc/S7ZS-CXLN]. In March 2023, President Biden proclaimed Avi Kwa Ame National Monument.167Proclamation No. 10533, 88 Fed. Reg. 17987 (Mar. 21, 2023). The proclamation explicitly recognizes the significance of landscape (mentioned thirty-seven times): including, for example, place-based traditional songs that connect to landmarks and enable Tribal members to “navigate across the diverse terrain, find essential resources, and perform healing, funeral, and other rituals.”168Id. The cultural landscape informs our understanding of the “people [that] have lived, traveled, and worked in [Avi Kwa Ame] for more than 10,000 years,” as evidenced by projectile points, pictographs, potsherds, and other tangible archaeological objects illuminating Indigenous history.169Id.

In August 2023, decades after Tribes were forcibly removed from lands that later became Grand Canyon National Park, President Biden established the Baaj Nwaavjo I’tah Kukveni—Ancestral Footprints of the Grand Canyon National Monument,170Proclamation No. 10606, 88 Fed. Reg. 55331 (Aug. 8, 2023). another sacred cultural landscape. This, the Grand Canyon Tribal Coalition fought to protect.171Bobby McEnaney, At Long Last, the Vision of the Grand Canyon Tribal Coalition Is Realized, Nat. Res. Def. Council (Aug. 21, 2023), https://www.nrdc.org/bio/bobby-mcenaney/long-last-vision-grand-canyon-tribal-coalition-realized [https://perma.cc/5JUA-8VTJ]. Not only are features of the landscape sacred components of the origin and histories of many Tribes, but “Tribes note that their ancestors are buried here and refer to these areas as their eternal home, a place of healing, and a source of spiritual sustenance.”17288 Fed. Reg. 55331. The “natural and cultural objects of the [Grand Canyon] lands” do indeed “have historic and scientific value that is unique, rich, and well-documented”173Id. —they were documented, and upheld, in the first major challenge to the Antiquities Act, 115 years prior.174See Cameron v. United States, 252 U.S. 450, 455 (1920).

Tribal Nations have seen success. Now, they have greater capacity to petition for monument status and more examples to draw from. Three national monuments that cover significant cultural landscapes were championed by Tribal advocates and incorporate Indigenous co-management.175McEnaney, supra note 171. Despite the ongoing challenge to Bears Ears and the fate of the Antiquities Act in limbo, and although the Antiquities Act does not reserve lands for the special or exclusive use of Tribes, it is nonetheless well-suited for one purpose: cultural heritage preservation. Tribes are choosing to utilize a law intended for cultural heritage, not environmental or recreational protection.176The designation of Avi Kwa Ame, in fact, clashed with a desire for renewable energy projects in the area, now off-limits. Prior to designation, the area comprising the monument was a wilderness area, which has substantially different prohibitions on development. Schechter, supra note 165. Again, the interests of Tribes, heritage protection advocates, and the environmental community are not always aligned. This is a natural and welcome outgrowth of the intent and flexibility of the Antiquities Act—areas that embody cultural and historic value are protected because of their connection to this nation’s first peoples.

Proper care and management of objects, as mandated by the Act, requires Tribal consultation and continued access to cultural landscapes. Indigenous knowledge and robust Tribal consultation—for planning and decision making—can and should be prioritized. Consider the Inter-Governmental Cooperative Agreement at Bears Ears, a vast recognition of the value of Tribal support and guidance in public lands management, especially for culturally significant landscapes.177Inter-Governmental Agreement, supra note 88. In the words of Bears Ears Commission Co-Chair and Lieutenant Governor of Zuni Pueblo Carleton Bowekaty:

[I]nstead of being removed from a landscape to make way for a public park, we are being invited back to our ancestral homelands to help repair them and plan for a resilient future. We are being asked to apply our traditional knowledge to both the natural and human-caused ecological challenges, drought, erosion, visitation, etc. . . . What can be a better avenue of restorative justice than giving tribes the opportunity to participate in the management of lands their ancestors were removed from?178O’Donoghue, supra note 88.

This manner of intergovernmental co-management is desirable for all parties. In addition to serving a restorative justice role, recognizing the resilience and continuing knowledge of Native communities, respecting a sovereign-to-sovereign relationship, and expiating hundreds of years of removal and genocide, it is well recognized that Indigenous traditional land management yields collective benefits in many regions of the world, including supporting biological diversity and conservation goals.179Kevin K. Washburn, Facilitating Tribal Co-Management of Federal Public Lands, 2022 Wis. L. Rev. 263 (2022); UNESCO World Heritage Convention, supra note 62.

Additionally, collective stewardship of public lands, preserved for future generations and contemplated by a land use designation like national monument status, is a better cultural fit for Native communities. Indigenous perspectives are often misaligned with the concept of private property and extractive use.180Talia Boyd, Native Perspectives: Land Ownership, Grand Canyon Tr. (June 29, 2021), https://www.grandcanyontrust.org/blog/native-perspectives-land-ownership [https://perma.cc/BWQ8-VTQ7]. For culturally significant areas important to multiple sovereign nations, national monument status and the collective management models it allows may encourage cooperation. At the least, coming together to advocate for monument status encourages political alliances between Tribal Nations and identification of key sites and priorities for an area.

Tribal use of the Antiquities Act is not without its challenges or its faults. Too often, protecting Tribal cultural heritage requires sharing information about sensitive sacred sites. This is no different. Though Presidents can strategically word proclamations to limit information disclosed, while still including sufficient detail to ensure the right areas are protected, this is a delicate balance to strike.181In the Avi Kwa Ame proclamation, this is explicitly recognized: “Some of the objects are also sacred to Tribal Nations; are sensitive, rare, or vulnerable to vandalism and theft; or are dangerous to visit and, therefore, revealing their specific names and locations could pose a danger to the objects or the public.” Proclamation No. 10533, 88 Fed. Reg. 17987 (Mar. 21, 2023). Moreover, public lands are public lands. They are not exclusively governed by Tribal Nations. Though Tribes in management roles can work to develop strategies that reduce impacts to key sites, preserve Native access for contemporary practice, and educate visitors, national monuments encourage tourism and result in greater visitation. This involves risk; there is always the question of whether the ends justify the means. However, for cases in which using the Act to preserve a landscape is “less a choice, and more a necessity,”182Noyes, supra note 4. A point is well-taken that monument designation depends entirely on how friendly a Presidential administration is toward tribal entities and public lands. Here, at least, there is no substantial barrier of Congressional approval. it is a feasible and effective solution.

CONCLUSION

In his denial of certiorari for Massachusetts Lobstermen’s, Chief Justice Roberts states: “The Northeast Canyons and Seamounts Marine National Monument at issue in this case demonstrates how far we have come from indigenous pottery.”183Statement of Chief Justice Roberts Respecting the Denial of Certiorari, supra note 34, at 3. But in setting Indigenous pottery as the starting line, Chief Justice Roberts demonstrates a fundamental misunderstanding of the history, ends, and means of the 1906 Antiquities Act. The Act was intended by its advocates and drafters to capture not only potsherds—the status of which, Blanding’s sting operation demonstrates, is still at risk—but also to secure the land around them, or the living cultural context that informs these objects.

The Act has served its purpose well in the more than one hundred years since its enactment. Only three of twenty-one Presidents since the signature of the Act have chosen not to use it. Legal challenges to the Act consistently fail, with federal courts interpreting its language broadly. Chief Justice Roberts decries that the “objects” to be protected might include “canyons and seamounts themselves.” He fails to mention that the very first challenge to the Act, right after it was signed, held that the Grand Canyon was an “object.” Historical context, statutory purpose, and past practice all support the conclusion that the President has broad authority to use the Antiquities Act to protect cultural landscapes at large, not just “objects” in the narrowest sense of the word.

The Act is not only unique in what it protects, but is also crucial to the fragile and limited American cultural heritage management scheme. Without it, archaeological and cultural resources on our public lands could face irreparable and irreversible harm, and communities who depend on cultural heritage for their economic survival through tourism revenue might lose care, education, and access.184See Nate Hegyi, Tourism Worries and Few Takers as More Utah Land Offered for Drilling, Mining, Nat’l Pub. Radio (Feb. 9, 2020), https://www.npr.org/2020/02/09/804232481/tourism-worries-and-few-takers-as-more-utah-land-offered-for-drilling-mining [https://perma.cc/GVH6-DRTT]. The Indigenous communities to whom these landscapes matter most are not to be forgotten—they would be severely impacted by the desecration of their cultural practices and connections to their ancestors.

Curiously, even large national monuments designated around the same time as Bears Ears—such as Mojave Desert, of greater acreage, also designated in 2016 by President Obama185Mojave Desert encompasses 1.6 million as compared with Bears Ears’ 1.32 million acres. Proclamation No. 9395, 81 Fed. Reg. 8371 (Feb. 12, 2016).—have not faced the same wave of repeated challenges as Bears Ears. If opponents of Bears Ears can demonstrate that she does not fall within the scope of the Antiquities Act, no cultural landscape will. Bears Ears is a fundamental example of the meaning and importance of cultural landscapes. Those who target Bears Ears do so because if you can prove in this context—in the American Southwest, the cradle of and impetus for the Antiquities Act—that the Act is limited to structures and objects alone, and does not encompass cultural landscapes, you can do so anywhere.

The battle to preserve the cultural landscape that is Bears Ears National Monument continues. The Antiquities Act is not a piece of legislation that has been molded, bent out of shape, or expanded to include Hoon’Naqvut, Shash Jáa, Kwiyagatu Nukavachi, and Ansh An Lashokdiwe. It was born in this place and intended, from the beginning, to protect her.

97 S. Cal. L. Rev. 1119

Download

* Executive Senior Editor, Southern California Law Review, Volume 97 (2023 Scribes Award Winner for Excellence in Note Writing); J.D. 2024, University of Southern California Gould School of Law; B.A. Political Science 2019, Stanford University. This Note was written on the unceded sacred and traditional lands of the Tongva (Gabrielino), the first native peoples. The Tongva are the original stewards of this land and continue in this role today. This land has been used to share knowledge for generations. I hope this publication continues the Tongva tradition of teaching respect for and connection to land and landscape. Thanks to Amber Madole for her endless support of USC Gould’s Native American Law and Law Students Association and tireless advocacy for Tribal and Native representation at Gould and in legal scholarship. Thank you to Professor Robin Craig, for serving as my mentor and advisor, to Fran Ferrance, and to my parents, for their support and pride. Finally—thanks to Dr. SB, who brainstormed titles for publication with me next to the Nafplion harbor. Though I served as a declarant for the Archaeological Institute of America in its motion to intervene in Garfield County et. al v. Biden, views expressed are my own and not those of any litigating organizations or their attorneys. Information regarding the status of the pending lawsuit and the number of national monument designations is current as of the dates specified and is subject to change.

Defining the Relationship: California’s Noncompete Laws and Exclusivity in the Acting Industry Leading Up to the 2023 SAG-AFTRA Strike

California is firm in its stance against post-term noncompete clauses. This Note examines early Hollywood and the historical and economic context in which talent contracts arose. It analyzes the shift in talent contracts from the harsher terms of the 1930s studio system to the modern terms which give more control to actors.

Exclusivity exists in both the film and television industry. However, the in-term and post-term treatment of exclusivity provisions and noncompetes has received conflicting treatment by California and Ninth Circuit Courts, suggesting that perhaps the California Supreme Court should weigh in on the matter as they did in 2008 with Edwards v. Arthur Andersen and articulate whether Section 16600 can apply to in-term noncompete and exclusivity provisions. While it is widely held that Section 16600 does not apply to in-term noncompetes, the holding in ITN Flix, LLC v. Hinojosa suggests that certain situations in the acting industry may trigger its application and deem an in-term noncompete invalid if unduly harsh.

Regardless, the ability of actors and unions to negotiate with studios for mutually beneficial terms has allowed common practices in entertainment contracts to shift over time without much recent legislation. This suggests that, while the applicable law will provide one side with bargaining power, negotiations and collective-bargaining agreements will largely continue to set the standards for common entertainment contract practices.

INTRODUCTION

From 2005 to 2010, American actress Katherine Heigl appeared in the hit ABC television medical drama, Grey’s Anatomy, as the supporting role of Dr. Izzie Stevens.1Katherine Heigl, IMDb, https://www.imdb.com/name/nm0001337 [https://perma.cc/K8VD-9CVP]. Simultaneous to her work on the television series, Heigl starred in films that would soon become classics of the 2000s, such as Knocked Up (2007), 27 Dresses (2008), and The Ugly Truth (2009).2Id. By 2008, Heigl had been deemed the new “It Girl” of Hollywood by Vanity Fair as a result of her acting endeavors.3Leslie Bennetts, Heigl’s Anatomy, Vanity Fair (Jan. 1, 2008), https://www.vanityfair.com/news/2008/01/heigl200801 [https://perma.cc/YG3J-MH7L]. Knocked Up had been a box office hit, and Heigl had taken home the 2007 Primetime Emmy Award for Outstanding Supporting Actress in a Drama Series for her television work on Grey’s Anatomy.4Katherine Heigl, Television Academy, https://www.emmys.com/bios/katherine-heigl [https://perma.cc/3LKF-AKBV]. Heigl had pursued a career in film while still appearing as a series regular on Grey’s Anatomy, rendering her a household name across the United States. Meanwhile, the lead of Grey’s Anatomy—Ellen Pompeo—had not appeared in a role outside of Dr. Meredith Grey since the series inception in 2005.5Pompeo has not appeared in other roles since Grey’s Anatomy was first released, apart from a cameo in a Taylor Swift music video and a “meow” in a children’s animated series voiceover. Dan Clarendon, A Recap of Ellen Pompeo’s Roles Outside of ‘Grey’s Anatomy’, TV Insider (Aug. 13, 2022, 12:00 PM), https://www.tvinsider.com/1055398/ellen-pompeo-tv-movie-roles-greys-anatomy [https://perma.cc/V9S6-5VXZ]. While this could be attributed to personal choices, it may instead be the result of contractual differences between a series’s lead actress and a supporting actress, stemming from varying exclusivity terms. It is possible that Ellen Pompeo, as the lead role of Meredith Grey, was “contractually forbidden from acting elsewhere.”6Id. In 2018, Pompeo told The Hollywood Reporter, “I don’t get to do anything else, and that’s frustrating for me creatively. I make 24 episodes of TV a year, and as part of this deal, I cannot appear anywhere else.”7Id. While exclusivity terms can be frustrating to an actor,8“Actor” is used throughout this Note to capture both actors and actresses. as demonstrated by Pompeo’s statement to The Hollywood Reporter, many studios view exclusivity as vital to production. Studios use exclusivity to coordinate production schedules and ensure talent’s availability as they invest time and money in the creation of a particular character.

In 2022, exclusivity terms could be freely negotiated in the entertainment industry when union actors were paid above an “exclusivity money break.”9SAG-AFTRA Netflix Agreement, SAG-AFTRA (Aug. 10, 2022), https://www.sagaftra.org/files/sa_documents/SAG-AFTRA_Netflix_2022.pdf [https://perma.cc/2MSC-6T7V]. The exclusivity money break is a minimum salary bargained for by SAG-AFTRA,10SAG-AFTRA is a prominent actors union, discussed in detail in later sections. Because the vast majority of Hollywood actors are union members of SAG-AFTRA, nonunion actors are not the focus of this Note. above which producers and talent can negotiate exclusivity freely. This would often result in producers paying substantial salaries for the exclusivity of top talent.11Charles Rivkin, A New Threat to California Film, Television and Streaming Jobs (Opinion), Variety (Aug. 1, 2022, 9:38 AM), https://variety.com/2022/film/news/charles-rivkin-mpa-california-bill-ab-437-television-streaming-jobs-1235330603 [https://perma.cc/VS9A-KQE5]. Meanwhile, union actors paid below the exclusivity money break (non-star talent) were restricted in their ability to grant exclusivity.12SAG-AFTRA, supra note 9; Rivkin, supra note 11. While these restrictions are discussed later in detail, actors in this latter non-star category who worked on a streaming show were often able to appear in feature films, as guests on other shows, and in commercials, yet they sometimes had to receive permission from studios in order to do so.13Rivkin, supra note 11. Practical difficulties sometimes arose in the process of coordinating outside roles, as the actor’s main show would take scheduling precedence and restrict the actor’s ability to alter their appearance.14SAG-AFTRA, supra note 9. This tension between actors, who often want the freedom to appear in additional roles, and studios, who face the challenges of coordinating production schedules and may have invested in an actor’s image, has led to debate in Hollywood surrounding the use of exclusivity provisions in talent contracts. Exclusivity, along with other issues concerning residuals and the use of Artificial Intelligence, contributed to SAG-AFTRA’s 2023 strike against Hollywood’s major studios, lasting 118 days from July to November of 2024 while new contract boundaries were negotiated.15SAG-AFTRA Slams ‘Bullying Tactics’ as Strike Talks Break Down With Studios, TIME (Oct. 12, 2023, 12:31 PM), https://time.com/6323071/actors-strike-talks-suspended [https://perma.cc/RQ46-RH8M]; Gene Maddaus, SAG-AFTRA Approves Deal to End Historic Strike, Variety (Nov. 8, 2023, 4:40 PM), https://variety.com/2023/biz/news/sag-aftra-tentative-deal-historic-strike-1235771894 [https://perma.cc/6QNZ-5M3T]. This Note will examine the contractual exclusivity terms leading up to the 2023 strike. The rise of online streaming has increased the demands placed on actors, as shorter series seasons contribute to more idle time for actors. The landscape is changing rapidly, resulting in the frequent renegotiation of terms and resulting standstills, exemplified by the 2023 SAG-AFTRA strike.

Exclusivity provisions are contractual terms that prevent an employee from working elsewhere during the term of their employment, resulting in the employee giving their undivided effort to the employer. Noncompete clauses are another set of contractual terms that also appear in talent contracts and typically restrict an employee from working for certain third-parties. Noncompetes may last for only the term of the employment, and thus be in-term restrictions, or they may last after the employee no longer works for the current employer, which is a post-term restriction.16Ananya Nair, What Is the Difference Between a Non-Compete Clause and an Exclusivity Clause?, LinkedIn (Oct. 17, 2021), https://www.linkedin.com/pulse/what-difference-between-non-compete-clause-exclusivity-ananya-nair [https://perma.cc/865C-UTQL].

Arguments exist on both sides of the general debate around exclusivity provisions and noncompetes in the talent industry. Opponents argue that they hinder competition by restricting the free movement of talent and ideas.17See Lindsey Schmidt, A More Reasonable Approach to Noncompete Employment Agreements in California, 48 J. Legis. 145, 155 (2021). In the talent industry specifically, SAG-AFTRA has argued that exclusivity terms in actors’ personal service agreements are often used to “hold series regulars off the market and unable to work for unreasonably long periods of time”18See David Robb, SAG-AFTRA Board Overwhelmingly Approves Deal with AMPTP That Sharply Limits Exclusivity in TV Actors’ Personal Service Agreements, Deadline (Aug. 20, 2022, 5:36 PM), https://deadline.com/2022/08/sag-aftra-board-approves-deal-amptp-limits-exclusivity-tv-actors-personal-service-agreements [https://perma.cc/S63E-FATW]. while a show is in an off period.19An “off period” in a television series is a period during which the show is not actively filming, but the actor is still under contract as the next season of filming is approaching. SAG-AFTRA has noted that these exclusivity provisions were less burdensome when talent worked three quarters of the year on twenty episode seasons, but that the rise of online streaming platforms, year-round production cycles, and short seasons of thirteen episodes (or sometimes fewer) have left lower and mid-level actors short of work.20Joe Otterson, How Exclusive Contracts Leave Writers and Actors Scrambling to Navigate Supercharged Job Market, Variety (Dec. 23, 2021, 10:15 AM), https://variety.com/2021/tv/entertainment-industry/exclusive-deals-streamers-prestige-television-1235142536 [https://perma.cc/Y892-CNW7]. Some actors are “not being paid while on hold between seasons, but they’re also not allowed to accept other paying jobs. These contracts mean that actors often find themselves collecting unemployment, struggling to pay their bills and unable to build a career.”21Duncan Crabtree-Ireland, Forced Exclusivity Terms in Actor Contracts Add a Dark Side to Hollywood’s Golden Age (Opinion), Variety (Aug. 5, 2022, 10:00 AM), https://variety.com/2022/tv/news/sag-aftra-duncan-crabtree-ireland-exclusivity-law-act-1235333015 [https://perma.cc/FPL7-MXWS]. The arguments against exclusivity and noncompetes have received national attention.22Mitch Danzig & Paul Huston, How FTC Could Regulate Noncompetes After Biden’s Order, Law360 (July 15, 2021, 3:14 PM), https://www.law360.com/articles/1403236/how-ftc-could-regulate-noncompetes-after-biden-s-order [https://perma.cc/FPL7-MXWS]. In 2021, President Biden issued an executive order attempting to limit the effect of noncompetes, authorizing the Federal Trade Commission (“FTC”) to “interpret noncompetition agreements as an unfair method of competition, and thereby declare them unlawful.”23Id. SAG-AFTRA was “ ‘thrilled to see President Biden take steps to curtail the use of unfair non-compete clauses, which are a major problem for . . . actors . . . .’ ”24See SAG-AFTRA Applauds President Biden’s Effort to Address Anti-Competitive Employment Practices, SAG-AFTRA (July 9, 2021), https://www.sagaftra.org/sag-aftra-applauds-president-bidens-effort-address-anti-competitive-employment-practices [https://perma.cc/FP4V-3H7S]; see also SAG-AFTRA Celebrates 10th Anniversary of Merger of Screen Actors Guild and American Federation of Television and Radio Artists, SAG-AFTRA (Mar. 30, 2022), https://www.sagaftra.org/sag-aftra-celebrates-10th-anniversary-merger-screen-actors-guild-and-american-federation-television [https://perma.cc/E27A-K8RC].

Conversely, proponents of exclusivity provisions and noncompetes argue that without them, employers have little incentive to invest in the development of their employees.25See Schmidt, supra note 17. In the acting industry specifically, production studios have concerns regarding the coordination of “complex production schedules involving hundreds—or at times even thousands—of people [with the] talent’s availability.”26See Rivkin, supra note 11. The California Chamber of Commerce argues that without exclusivity, talent contracts are far “less valuable, which will lead to a reduction in wages paid to actors.”27Tom Tapp, California’s AB 437, Which Would Limit Exclusivity in TV Stars’ Deals, Faces Crucial Vote, Deadline (Aug. 1, 2022, 1:42 PM), https://deadline.com/2022/08/californias-ab-437-exclusivity-tv-stars-deals-sag  [https://perma.cc/DA2F-K5MK]. Charles Rivkin, Chairman and CEO of the Motion Picture Association, views exclusive employment agreements as the backbone of scheduling for film, television, and streaming productions: they “provide the certainty necessary for producers to finance, insure, plan for and complete major feature film, television and streaming projects, particularly those involving long-term story arcs. They assure writers and showrunners that characters developed in one season can be brought back for subsequent storylines.”28Rivkin, supra note 11. Further, studios note an interest in preserving the investment of “millions [of dollars] in developing and promoting a show [and its actors, and] don’t want to see the lead actor show up in another series on a rival network.”29Gene Maddaus, California Considers Bill That Would Free Actors from Exclusivity Deals, Variety (Aug. 1, 2022, 8:40 AM), https://variety.com/2022/tv/news/california-exclusivity-legislation-1235329639 [https://perma.cc/HV73-QZ2A]. A studio may seek to control the image of a particular actor to protect its franchise from actions which could negatively impact the performance of the franchise, and thus, the studio itself.

Section 16600 of the California Business & Professions Code (“Code”) acts as a per se ban on noncompete agreements in California, as it “does not permit non-compete clauses, even if they are reasonable in scope and purpose.”30The Validity of California Non-Compete Clauses, The Nourmand Law Firm, APC (Mar. 11, 2021), https://www.nourmandlawfirm.com/blog/the-validity-of-california-non-compete-clauses [https://perma.cc/2UX6-N6TD]. While many states operate under reasonableness standards, enforcing noncompetes when they are reasonable in scope and duration,31For example, Massachusetts will enforce noncompete agreements “if they: [1] are reasonable in duration, geographic area, and scope, [2] are necessary to protect a legitimate business interest, [3] are consonant with public policy, and [4] contain a ‘garden leave’ clause.” Non-Compete Agreements—When Are They Enforceable?, Katz Law Group, P.C., https://www.katzlawgroup.com/non-compete-agreements [https://perma.cc/DH64-P5SH]. the California Supreme Court case Edwards v. Arthur Andersen is strong in both its language and policy rationale against the enforcement of post-term employment restrictions. Following the 2008 decision in Edwards, discussed later in this Note, any post-term employment restriction is likely to fail. Discussion around the Code has focused on Silicon Valley, where the ban on noncompetes has often allowed technology companies and start-ups to innovate rapidly as employees move from company to company. However, little attention has been paid to the application of Section 16600 in Hollywood, particularly to its recent extension by the Ninth Circuit to reach in-term agreements in the acting industry in ITN Flix, LLC v. Hinojosa.32ITN Flix, LLC v. Hinojosa, 686 F. App’x 441, 444 (9th Cir. 2017).

This Note investigates noncompete agreements and exclusivity in the entertainment industry through the lens of Section 16600 and critically analyzes recent decisions involving the extension of Edwards to in-term noncompetes and exclusivity agreements in talent contracts. Historically, case law involving Section 16600’s ban on noncompetes has been limited to post-term employment restrictions. In the acting context, post-term noncompetes are those which restrict an actor’s employment options after they no longer work with a particular studio. However, a recent Ninth Circuit case (ITN) broadens the scope of Section 16600 to potentially invalidate in-term noncompete contracts that restrict an actor’s work, even if only for the duration of the employment contract. This Note discusses the Ninth Circuit’s decision, weighing the tension between exclusivity proponents and opponents to explore the extension of Section 16600 to in-term noncompetes and exclusivity clauses in the talent context.

Part I examines early Hollywood and the historical and economic context in which talent contracts arose. It analyzes the shift in talent contracts from the harsher terms of the 1930s studio system to the modern terms which give more control to actors. It provides a summary of common industry practices prior to the 2023 SAG-AFTRA strike. The agreements between SAG-AFTRA and major studios that led up to the strike are also explored, highlighting the prevalence of exclusivity while weighing the tension between its proponents and opponents.

Part II discusses Section 16600 and the significant 2008 California Supreme Court decision, Edwards v. Arthur Andersen, after which any post-term restriction on employment in California will likely fail.

Part III analyzes the possible extension of Edwards to in-term noncompete agreements and the changes that this extension may bring to exclusivity in the acting industry. To do so, it touches again on common exclusivity practices as well as in-term and post-term noncompete practices in the acting industry, while critically analyzing case law to explore how Section 16600’s extension to in-term noncompete and exclusivity provisions may bring unintended results. It concludes with a suggested theoretical legal standard that would consider an actor’s fame when analyzing exclusivity and noncompetes.

Part IV summarizes the case law findings and asks the California Supreme Court to weigh in on the conflicting lower court precedent, and the conclusion summarizes the findings from this Note.

I.  HISTORY OF THE ENTERTAINMENT INDUSTRY

[T]he legal framework . . . in which all entertainment and media businesses operate is constantly challenged and in need of regular review and adjustment.

—Harold L. Vogel33Harold L. Vogel, Entertainment Industry Economics: A Guide for Financial Analysis 55 (Cambridge Univ. Press, 10th ed. 2020).

A.  The Economics of the 1930s Studio System

Shortly after film was introduced to the United States in the early 1900s, the major film studios realized there was immense potential for vertical integration and cost minimization in the film industry.34Id. They quickly began to operate almost every stage of film production, from “production [and] distribution [to] exhibition.”35Studios with control of all three stages of production were dubbed “the Big Five”: Warner Brothers, RKO, Twentieth Century Fox, Paramount, and GM. Smaller companies had trouble competing, although Universal and Columbia shadowed the Big Five with control over production and distribution, but not exhibition. Id. at 91–92. See generally Thomas Schatz, The Genius of the System (Metropolitan Books, 1988). The West Coast (and Southern California in particular) emerged as the heart of this new and emerging studio system, as Hollywood was “far for the Trust enforcers to reach [and] . . . provide[d] low-cost nonunion labor and an advantageous climate and geography for filming.”36Vogel, supra note 35, at 91–92. After the Great Depression, it was only “the companies with the most vertical integration . . . that survived,” further concentrating the control over the movie industry.37Id. An incredibly “productive [and] efficient” synergy emerged as the major companies cooperated in “a ‘mature oligopoly’ ” with a significant share of the Hollywood market.38Schatz, supra note 35, at 18–20.

Vertical integration characterized this era of film production, with the major companies exerting control over large parts of the industry.39Id. By the 1930s, this studio system led to stars signing “long-term contracts.”40Brent Lang, How Olivia de Havilland Took on the Studio System and Won, Variety (July 27, 2020, 12:59 PM), https://variety.com/2020/film/news/olivia-de-havilland-lawsuit-gone-with-the-wind-warner-bros-1234717146 [https://perma.cc/8KB2-T4KF]. While accompanied by cost minimization and efficiency, these contracts were often harsh and demanding, leaving little autonomy to actors.41Id. This early coordination of “studio operations [with] marketing strategies” brought “substantial [cost] savings [to] the studio [system.]”42Schatz, supra note 35, at 49. Exclusive contracts between stars and studios could last up to seven years, and in practice, even longer. 43Aljean Harmetz, Hollywood, the Marriage of Studios and Stars Is Back, N.Y. Times (Jan. 8, 1984), https://www.nytimes.com/1984/01/08/arts/hollywood-the-marriage-of-studios-and-stars-is-back.html [https://perma.cc/2JAN-PGF9]; see also Star System, Film Reference, http://www.filmreference.com/encyclopedia/romantic-comedy-yugoslavia/star-system-the-studio-system-and-stars.html [https://perma.cc/E88Z-YMP3]. Actors such as “Bette Davis and James Cagney were constantly suspended without pay by Warner Bros. for refusing roles.”44Id. Actors who were suspended, such as Davis and Cagney, due to their “refus[al] to be loaned out to another studio or declin[ing of] a role . . . could be suspended without pay[, with the] length of the suspension . . . added to that of the contract,” extending contracts beyond seven years.45Lang, supra note 40; see also Harmetz, supra note 43.

Early challenges to these contractual practices were unsuccessful, as demonstrated by Bette Davis’s 1937 lawsuit against Warner Brothers.46Davis sought to be released from her contract with Warner Brothers after being cast in a series of unfavorable roles; she wanted to pursue films in England that she believed would be a better fit. Her lawsuit, alleging that the contract was unenforceable due to its inequitable suspension and extension clauses that added time to the contract for “suspension periods incurred during the contract term,” was unsuccessful, and Davis was required to return to Warner Brothers and fulfill her term contract. John M. Broderick, Warner Bros. v. Nelson: A Prelude to the De Havilland Law, 41 Loy. L.A. Ent. L. Rev. 111, 111 (2021); see also Richard Brody, The Clippings File: Bette Davis and the System, The New Yorker (Sept. 6, 2012), https://www.newyorker.com/culture/richard-brody/the-clippings-file-bette-davis-and-the-system [https://perma.cc/2H4G-8RTL]. The same year as Davis’s unsuccessful lawsuit, California (home of Hollywood and longtime proponent of employee rights) enacted Section 2855 of the Code to limit the indefinite employment contracts often abused by studios.47Krishna Parekh & Brandon Anand, The “Seven Year Rule”: CA Labor Code § 2855 & The Entertainment Industry / 7 Year Rule, Anand Law, https://www.anandlaw.com/the-seven-year-rule-california-labor-code [https://perma.cc/CC8B-EFHE]. Commonly referred to as the “Seven Year Rule,” Section 2855 “limits the term of personal service employment to seven years,” rendering any personal-service contract unenforceable past the seven-year mark.48Id. This Section was tested in 1943, when Olivia de Havilland sued Warner Brothers.49Lang, supra note 40. The studio had refused to release De Havilland from her seven-year contract (despite the seven years lapsing) and claimed that her refusal to accept certain roles over the years had resulted in the addition of six months to her contract.50Id. This practice of adding time to contracts was common in Hollywood, and these “suspension/extension” provisions (previously upheld in Bette Davis’s case) “could double the term of an actor’s contract.”51Broderick, supra note 46, at 111. De Havilland successfully argued that Warner Brothers was breaching its contract—as the contract was for seven years regardless of her refusal of certain roles—and violating labor law in doing so, as California had a statutory limit of seven calendar years on the enforcement of employment contracts.52Id. This marked the beginning of a new era of bargaining power for employees. This monumental decision applied “to more than just Hollywood[, as it] applied to every employee in California.”53Lang, supra note 40. While the extension provisions of De Havilland’s contract were deemed illegal in 1943, her concern around being held off the market is still shared by many series regulars today in the debate around exclusivity. An actor’s desire to pursue additional roles may conflict with a studio’s desire to coordinate production schedules.

Until the late 1940s, the major studios had maintained almost complete vertical integration of the film production process, evading various antitrust charges through government deals.54Among the antitrust charges was block booking, which is “illegally conspiring to restrain trade by . . . causing an exhibitor who wanted any of a distributor’s pictures to take all of them.” Vogel, supra note 33, at 92. But in 1948, Paramount was found guilty of price-fixing by the Supreme Court in an antitrust lawsuit.55Erin Blakemore, How TV Killed Hollywood’s Golden Age, History (June 1, 2023), https://www.history.com/news/how-tv-killed-hollywoods-golden-age [https://perma.cc/ULW7-JD2K]. This case, widely known as the beginning of the end of Hollywood’s Golden Age, forced film studios to break up their vertically integrated practices.56Star System, supra note 43. A decree was signed by the major studios which “separated production and distribution from exhibition.”57Vogel, supra note 33, at 92.

This separation of exhibition from other links in the production chain played a transformative role in fundamentally shifting entertainment industry practices, replacing the long-term contracts of the 1930s with the disintegrated model of the 1950s.58Star System, supra note 43. With the collapse of the vertically integrated studio system, long-term contracts and standard seven-year exclusivity provisions were phased out and replaced by the freelance model that is still in place today.59Id.

Later, the emergence of television caused movie theater attendance to decline, leading studios to limit film production.60Id.; see also Blakemore, supra note 55. “[C]ontracted stars . . .  became a hugely expensive overhead,” moving the industry into a freelance model as studios looked to cut costs.61Star System, supra note 43. The relationship between studios and talent shifted as stars were given more freedom to choose their roles. Still, studios often incorporated exclusivity terms into deals as “series regular actors were busy working almost the entire year, with long production periods and short hiatuses that made their employment similar to other full-time jobs.”62Crabtree-Ireland, supra note 21. As film and series productions have grown in size and scale, studios argue that the exclusivity of actors involved in a production is essential to the coordination of various schedules and logistics. The ability to contract for the exclusivity of certain well-known actors may offer large incentives for a studio to invest in a production.63The early 2000s marked another shift as cable television transitioned into online streaming, creating new opponents to exclusivity provisions as series actors were cast for shorter seasons with more off time. Netflix emerged in 2007, followed by the introduction of Disney Plus (Disney’s online streaming service) in 2019, Warner Media’s HBO Max in 2020, and NBCUniversal’s Peacock streaming service in 2020. “[T]he Big Three entertainment companies launch[ing] their video platforms” solidified the substitution of traditional media with online entertainment. “[S]treaming services [are ordering] fewer episodes and cancel[ing] series after shorter runs, [thus employees] are having to switch jobs more frequently” to stay working. See Brooks Barnes, The Streaming Era Has Finally Arrived. Everything Is About to Change., N.Y. Times (Nov. 19, 2019), https://www.nytimes.com/2019/11/18/business/media/streaming-hollywood-revolution.html [https://perma.cc/DPR6-839M]. New arguments against exclusivity criticize the forced idle time it leads to as series regulars have shorter production schedules and are left unable to work during breaks. See also Crabtree-Ireland, supra note 21.

B.  Modern Talent Contracts

Currently, Hollywood does not operate by the onerous long-term contracts that once existed, as modern talent contracts are no longer set at seven-year terms of exclusive work as they were in the 1930s. Instead, talent is cast specifically from project to project, often incorporating exclusivity clauses in both the film and television industry. Special contract terms, such as option contracts and pay or play contracts, raise similar issues to in-term exclusivity surrounding an actor’s ability to pursue other roles. The following paragraphs state the entertainment industry terms as they existed prior to the 2023 SAG-AFTRA strike.

1.  Film

Film production begins and ends on (more or less) defined dates.64Jill L. Smith, Perk Points, L.A. Law., May 2015, at 18, https://www.kleinberglange.com/wp-content/uploads/2015/05/Jill_Smith_Los_Angeles_Lawyer.pdf [https://perma.cc/WBH9-5FXD]. As a result, exclusivity terms in movie deals are common and are rarely a source of extreme debate. An actor’s film contract will often explicitly include the dates for “consecutive exclusive preproduction services, a specified number of weeks for shooting, and a maximum number of days for postproduction services.”65Id. at 18–20 (emphasis added). In-term exclusivity provisions tend to accompany the preproduction and production period,66Preproduction often includes rehearsals and costume fittings, while the production period largely revolves around actual filming. Id. as exclusivity is often used to coordinate scheduling among large casts and crews, and actors are left with little idle time during film rehearsals and shooting. Conversely, postproduction requests67Postproduction requests may include press tour appearances or reshoots of particular scenes. Id. are generally subject to an actor’s availability, as an actor’s work will typically be completed and any post-term restriction preventing the actor from accepting other jobs would likely be invalidated by Section 16600.68Id.

It may be possible, however, for talent contracts to include postproduction restrictions preventing actors from working on certain projects for a certain amount of time even after a movie has completed filming. One can imagine this being the case for actors in the Marvel Universe.69Dean Ravenola & Brian Boone, Rules Actors Have to Follow When Joining the MCU, Looper (Jan. 31, 2023, 7:59 AM EST), https://www.looper.com/139571/rules-actors-have-to-follow-when-joining-the-mcu [https://perma.cc/U2SY-6Z87]. For example, Chris Hemsworth—popular for his role as superhero Thor in the Marvel Comics (“Marvel”) film Thor as well as The Avengers—could theoretically be unable to appear in films by Marvel’s direct competitor, DC Comics (“DC”).70Id.; see also Edward Nigma, Chris Hemsworth Confirms That Marvel Actors Aren’t Allowed to Be in DC Movies, Fortress of Solitude (June 19, 2017), https://www.fortressofsolitude.co.za/marvel-actors-arent-allowed-dc-movies. Marvel and DC are both immensely popular comic-book publishers that have transformed their comic-book characters into big-screen franchises. If these contractual provisions exist, they may be legally vulnerable, as Section 16600 invalidates post-term restrictions on an employee’s work.

It is also possible that these terms, which (on face value) appear to be post-term, are actually in-term restrictions, and thus valid under Section 16600. The acting industry has a unique gray area between in-term and post-term restrictions when an actor is no longer actively filming but may be called back for a reshoot, or–for example–when an actor is no longer filming the first Thor movie but is still under an exclusive contract for the second movie. While this may be in-term contractually, it has post-term implications as an actor’s work is restricted while they wait for the next production cycle to begin. This area of talent contracts seems to be the most legally vulnerable, especially under the Ninth Circuit’s extension of Section 16600 in ITN to invalidate in-term noncompete provisions.

2.  Series and Short Form (Television)

While not heavily debated in film contracts, exclusivity terms are a highly contentious subject of debate in television and series contracts. Television seasons may be short, and actors may find themselves wanting to solicit intermittent work, leading them to seek additional roles while still under contract with another show. Unlike films, there is not a set beginning and end date in television series production, as shows are in “a relatively constant state of production and postproduction during which there will be stretches of time when an actor’s services are not needed.”71Smith, supra note 64, at 21. Exclusivity terms can vary greatly depending on the contractual terms negotiated: a series actor who wants to render outside services may either be free to do so, may need special permission from the studio, or may be prohibited from doing so.72Id. at 21–22.

When a television talent contract does allow an actor to pursue additional roles, there are practical limits to an actor’s ability to do so, as studios prefer their talent to be somewhat exclusive to their shows.73Id. at 22. For example, scheduling work on a feature film is difficult, as television series production is demanding and leaves only a few days off at a time (apart from true offseasons).74Id. at 21–22. Further, many deals preclude an actor from appearing on another television series, apart from “a limited number of guest spots, appearance in foreign commercials and services in nonidentified voice-over commercials.”75Id. at 22. Standard series agreement deals (as of December 2021) allowed networks and streaming services to enter into exclusivity deals with talent for “anywhere from nine months to more than a year in some cases,” making the process of rendering outside services difficult during this period.76Otterson, supra note 20.

Noncompete and exclusivity terms for a series contract generally fall in the category of in-term restrictions, as they apply to the actor while they are still in a contract for their current series. They are therefore not legally vulnerable under the historical interpretation of Section 16600, which has traditionally applied only to post-term restrictions. If, however, the Ninth Circuit’s extension of Section 16600 in ITN to invalidate in-term employment restrictions is valid, then these common industry practices may be legally vulnerable.77ITN Flix, LLC v. Hinojosa, 686 F. App’x 441, 441 (9th Cir. 2017).

3.  Option Contracts

When a film or television series is part of a larger, ongoing story and multiple production periods are likely (such as classic blockbuster films like Wonder Woman and Spider-Man for which sequels can be anticipated), option contracts are often used. An option clause in a talent contract “gives the producer or studio the sole right, or ‘option,’ to extend a contract for an additional period of time [and] commits the actor to working on the subsequent television or new media season.”78What Are Options and Exclusivity Clauses?, Service SAG-AFTRA, https://servicesagaftra.custhelp.com/app/answers/detail/a_id/2188 [https://perma.cc/CBS8-QTKB]. The option period can last anywhere from months to years.79Jan Breslauer, What You Need to Know About Entertainment Contracts: Part Deux, Breslauer L. (Nov. 8, 2014), https://www.breslauerlaw.com/what-you-need-to-know-about-entertainment-contracts-part-deux [https://perma.cc/6UGV-4AHM]. While option clauses ensure that characters an audience has come to know and love will be returning in the same role, they have the potential to prevent an actor from accepting additional work when paired with exclusivity terms, as options can be exercised even if no start date has been set for the next project.80What Are Options and Exclusivity Clauses?, supra note 78. This raises similar issues to exclusivity and noncompetes, as actors may be kept off the market for unreasonably long periods of time, yet studios have an interest in ensuring well-known characters such as Wonder Woman and Spider-Man are able to return for a sequel.

4.  Pay or Play

A pay or play term in a contract guarantees that an actor will be paid for their role in a production, regardless of whether they are used or whether the production gets made.81Dominique Saint Malo, Pay or Play Contract—How Does It Affect Your Production?, StudioBinder (Feb. 20, 2022), https://www.studiobinder.com/blog/pay-or-play-contract [https://perma.cc/5U5M-L6UD]. These terms can typically only be negotiated by top talent, as they require the studio to pay the actors their full salaries “even if they are terminated before rendering all of their services.”82‘Pay or Play’ Contracts: Behind the Scenes of Johnny Depp’s Fantastic Beasts Exit, Harbottle & Lewis (Nov. 24, 2020), https://viewpoints.harbottle.com/post/102hbg6/pay-or-play-contracts-behind-the-scenes-of-johnny-depps-fantastic-beasts-exit [https://perma.cc/XJ9D-PAFR]. This compensates the talent for rejecting other “lucrative” roles with similar time lines due to the expectation of exclusivity surrounding their involvement in the pay or play production.83Id. Pay or play terms therefore symbolize the acknowledgement by studios that exclusivity is highly valuable, and as such, studios are willing to compensate top talent highly for the opportunity cost of rejecting other roles. Since these terms are often accompanied by exclusivity terms, they may be subject to challenges if Section 16600 is extended to invalidate in-term exclusivity.

C.  Industry Practices

The Screen Actors Guild (“SAG”) was founded in 1933 as a union representing actors in “film, television, and digital media.”84Matt Crawford, What Is SAG-AFTRA? History, Origins & How To Get Membership, Filmmaking Lifestyle, https://filmlifestyle.com/what-is-sag-aftra [https://perma.cc/4WSP-25UU]; see also The History of the Unions During the 1930s, SAG-AFTRA, https://www.sagaftra.org/about/our-history/1930s [https://perma.cc/2TFY-DL52]. In 2012, the Screen Actors Guild merged with the American Federation of Television and Radio Artists (“AFTRA”) to form SAG-AFTRA, a powerful union representing “approximately 160,000 actors, announcers, broadcast journalists, dancers, DJs, news writers, news editors, program hosts, puppeteers, recording artists, singers, stunt performers, voiceover artists and other media professionals.”85About, SAG-AFTRA, https://www.sagaftra.org/about [https://perma.cc/R2MK-B8JX]. The union is frequently in talks with major production studios to exercise its collective-bargaining power and achieve favorable deals for its members, going on strike in 2023 to do so.

In-term exclusivity is highly contested in the acting industry as a constant battle between studios, who view the terms as essential to production, and actors and unions, who generally oppose them. In August of 2022, SAG-AFTRA86See SAG-AFTRA Celebrates 10th Anniversary of Merger of Screen Actors Guild and American Federation of Television and Radio Artists, SAG-AFTRA, https://www.sagaftra.org/sag-aftra-celebrates-10th-anniversary-merger-screen-actors-guild-and-american-federation-television [https://perma.cc/W2TH-WHAQ]. reached an agreement with Netflix (“Agreement”) limiting the use of exclusivity provisions for series regulars.87SAG-AFTRA, supra note 9, at 1–2. A similar agreement was reached between SAG-AFTRA and the Alliance of Motion Picture and Television Producers (“AMPTP”)88The Alliance of Motion Picture and Television Producers (“AMPTP”) acts as the collective-bargaining representative for “over 350 motion picture and television producers” such as “Paramount Pictures, . . . Twentieth Century Fox, Universal Pictures, Walt Disney Pictures and Warner Bros. Pictures [and] ABC, CBS, FOX, and NBC.” As Netflix has joined the AMPTP in 2021, all future negotiations on behalf of Netflix will take place with those of the AMPTP. Bruce Bisbey, What Is the Alliance of Motion Picture and Television Producers? (In the Entertainment Industry.), LinkedIn (Apr. 20, 2019), https://www.linkedin.com/pulse/what-alliance-motion-picture-television-producers-industry-bisbey [https://perma.cc/P2F9-BUTA]; see Welcome, AMPTP, https://www.amptp.org [https://perma.cc/5B8M-57GJ]; see also SAG-AFTRA Netflix Agreement, supra note 9, at 1. limiting exclusivity in series actors’ employment agreements.89Robb, supra note 18. SAG-AFTRA’s 2023 strike ended in November of 2023 (the Writers Guild of America also went on strike on May 2, 2023 and ultimately reached a deal with AMPTP on September 27, 2023).90Mandalit del Barco, Hollywood Writers Return to Work, After a Nearly Five Month Strike, NPR (Sept. 27, 2023, 11:27 AM EST), https://www.npr.org/2023/09/26/1201936449/writers-strike-end-vote-wga-leadership [https://perma.cc/763M-374S]. This Note will focus on the terms impacting SAG-AFTRA as they existed prior to the 2023 strike.

An important distinction exists between stars and other talent in the entertainment industry when considering the practical relevance of the prior Agreement’s minimum terms. For those paid above a minimum salary, known as the “exclusivity money break,” the minimum terms of the collective-bargaining agreement do not apply.91SAG-AFTRA, supra note 9, at 1. This means that stars and top-tier talent paid above this amount are not bound by the terms, limiting the effect of the Agreement to non-star talent. The 2022 SAG-AFTRA and Netflix Agreement increased this exclusivity money break (above which exclusivity can be freely negotiated) from $40,000 per episode or per week in 2019 to “$65,000 for a half-hour program and $70,000 for an hour program.”92Id. at 2.

For non-star talent paid less than the exclusivity money break, “the minimum terms of the collective bargaining agreement . . . require that a series regular retain the right to do certain other work in addition to working on the series on which they are a regular.”93Id. at 1. The Agreement grants them the ability to take on a second position as a series regular or miniseries lead and removes the condition that a guest appearance94A “guest appearance” is a brief role on another show. may not be on a competing platform.95SAG-AFTRA, supra note 9, at 2. Netflix still “must approve the [guest] [a]ppearance and the series regular must confirm availability and scheduling with Netflix before accepting it.”96Id. Netflix retains the ability to deny a guest appearance if the guest role is too similar to the actor’s Netflix role, and actors cannot make irreversible changes to their appearance (such as haircuts).97Id. . A minimum three-month “conflict free window” after each season, “during which the series regular may accept a [guest] [a]ppearance without first having to confirm availability or schedule with Netflix” has been established.98Id. at 3. This conflict free window means that a series regular will not be held off the market during offseasons, even in an in-term exclusive talent contract. The guest appearance, however, must be completed during the conflict free window or all remaining work will be second to Netflix’s scheduling, reflecting the concern of studios regarding the coordination of many crew and cast member schedules.99Id. Failure on Netflix’s end to provide this window would result in Netflix paying the series regular their episodic fee for the prior season during the window the actor is not able to compete.100Id.

Concessions on behalf of Netflix in the above Agreement were made in exchange for SAG-AGTRA’s withdrawal of a California bill that it had supported, AB-437, known as the Let Actors Work (LAW) Act.101Id. at 4. AB-437 would have sharply limited exclusivity in television deals in favor of allowing actors to work on competing networks “as long as ‘there is no material conflict of interest with their original employer,’ [and] . . . it [did] not conflict with the original show’s schedule.”102Tapp, supra note 27, at 27. At the time AB-437 was drafted, exclusivity terms in contracts often prohibited stars from appearing on competing networks, even during production breaks.103Id. While “AB-437 passed the [California] Senate Judiciary Committee in a 9-1 vote,”104Id. it was withdrawn prior to the close of the August 2022 Legislative session as a result of the SAG-AFTRA agreements with Netflix and AMPTP.105Kristina M. Launey & Scott P. Mallery, Final Round: Employment Bills Making the Cut to the Governor, Seyfarth: Cal. Peculiarties Emp. L. Blog (Sept. 1, 2022), https://www.calpeculiarities.com/2022/09/01/final-round-employment-bills-making-the-cut-to-the-governor [https://perma.cc/CZP3-5JMX].

The 2022 Agreement between Netflix and SAG-AFTRA exemplifies the arguments both for and against exclusivity and the resulting tension, as concessions were made on each side of the bargaining table. As a result of the Agreement, the demands of an actor’s “exclusivity” during an offseason were reduced. If exclusivity terms for actors paid below the exclusivity money break must now allow a conflict free window during which an actor can accept other roles, these terms may—in practice—act more like noncompete agreements than exclusivity agreements, since even “exclusive” actors are able to work on other shows. This blurs the lines between exclusivity and in-term noncompete agreements. Simultaneously, the importance of ensuring talent’s availability has been acknowledged and protected with provisions granting Netflix first position rights for scheduling.

II.  SECTION 16600 AND EDWARDS

A.  Section 16600

Some states allow contractual noncompete agreements, provided they satisfy a certain reasonableness standard. California, however, takes a strong stance against the enforcement of noncompetes in favor of employee mobility. Section 16600 of the Code states, “Except as provided in this chapter, every contract by which anyone is restrained from engaging in a lawful profession, trade, or business of any kind is to that extent void.”106Cal Bus. & Prof. Code § 16600(a). Section 16600 carves out an exception for noncompetes in the sale or dissolution of corporations, partnerships, and LLCs, allowing covenants not to compete “where a person sells the goodwill of a business and where a partner agrees not to compete in anticipation of the dissolution of a partnership.”107Kelton v. Stravinski, 41 Cal. Rptr. 3d 877, 881 (Ct. App. 2006); see also Edwards v. Arthur Andersen LLP, 189 P.3d 285, 290–92 (Cal. 2008). This exception ensures that those who purchase a business do not immediately face competition from the seller.

B.  Edwards v. Arthur Andersen

Prior to 2008, a small number of cases (mainly federal) allowed narrow restraints on competition in California if they passed a “reasonableness” standard.108See Schmidt, supra note 17, at 147–48. In 2008, the California Supreme Court rejected the Ninth Circuit’s narrow restraint approach to Section 16600 and articulated a single standard for noncompetes in Edwards v. Arthur Andersen LLP.109See Edwards, 189 P.3d at 288, 293.

In 1997, Raymond Edwards II was hired as an accountant by the Los Angeles office of Arthur Andersen LLP, contingent on his signing a noncompete agreement that all managers were required to sign.110Id. at 288; see also Edwards v. Arthur Andersen, Stan. L. Sch., https://scocal.stanford.edu/opinion/edwards-v-arthur-andersen-33130 [https://perma.cc/EW7W-BS2A]. The noncompete prohibited Edwards from performing similar services to any clients he had worked with in the eighteen months prior to his departure for another eighteen months after his release or resignation.111Edwards, 189 P.3d at 288. While Edwards was subject to a non-solicitation provision, he was not prohibited from accepting employment with clients.112Id.

In 2003, shortly after Edwards’s employment at the firm was terminated, Edwards filed a complaint alleging that Andersen’s noncompete agreement violated Section 16600 and was thus unlawful.113Id. at 289. The trial court held that, since the noncompete was “narrowly tailored” and “did not deprive Edwards of his right to pursue his profession,” it did not violate Section 16600.114Id.

The California Court of Appeals reversed, finding that the noncompete was invalid under Section 16600.115Id. at 290. The California Supreme Court agreed and explicitly rejected the “narrow-restraint” exception to Section 16600 used by the Ninth Circuit, stating that “California courts have not embraced the Ninth Circuit’s narrow-restraint exception.”116Id. at 293. The court discussed its policy rationale in favor of protecting Californians and ensuring that “every citizen shall retain the right to pursue any lawful employment and enterprise of their choice.”117Id. at 291 (citing Metro Traffic Control Inc v. Shadow Traffic Network, 27 Cal. Rptr. 2d 573, 577 (Ct. App. 1994)). This means that even a narrow restriction on employment in a specific industry will be invalid under California law.118Edwards, 189 P.3d at 297; see also Daniel Joshua Salinas, Amy Abeloff & Robert B. Milligan, California Court Gives Two Thumbs Down and Voids Non-Compete in Actor’s Agreement, Seyfarth: Trading Secrets (Apr. 20, 2016), https://www.tradesecretslaw.com/2016/04/articles/trade-secrets/california-court-gives-two-thumbs-down-and-voids-non-compete-in-actors-agreement [https://perma.cc/5BFH-35RX]. As noted in Edwards, the California Legislature did not include language to narrow the application of Section 16600 to only overbroad or unreasonable restraints on competition, thus the court will not add those limitations unless expressly indicated by the legislature.119Edwards, 189 P.3d at 293.

While the text and notes of the Code do not specify whether Section 16600 should apply to current as well as former employees, there are nearly a hundred years of case law interpreting the Code in the context of post-term employment restrictions—during which no California state cases have applied the Code to in-term restraints on employment such as exclusivity provisions.120The notes to Section 16600 state that former employees have the “right to engage in competitive business . . . and to enter into competition with [their] former employer, even for business of those who were formerly customers of [their] former employer, provided such competition is fairly and legally conducted,” implying that as long as rules surrounding confidentiality and trade secrets are not violated, the Code applies to former employees to ban noncompetes. Cal Bus. & Prof. Code § 16600 note (citing Fortna v. Martin, 323 P.2d 146, 148 (Cal. Ct. App. 1958)). Accordingly, exclusivity provisions are used often in the talent industry to coordinate schedules among individuals involved in production.121See Rivkin, supra note 11; see also Tapp, supra note 27. However, a recent Ninth Circuit case, ITN, may extend application of Section 16600 (California’s ban on noncompetes) to exclusivity in the acting industry.

III.  CASE ANALYSIS

A.  Exclusivity Analyzed Through the Lens of Section 16600

Since Edwards, Section 16600 “often operates as a per se rule against noncompete clauses in contracts,” prohibiting noncompete agreements in California regardless of whether they are narrowly tailored in favor of promoting open competition and employee mobility.122Thomas D. Nevins, Is an Exclusive Dealing Contract an Unlawful Covenant Not to Compete?, Casetext (Apr. 13, 2009), https://casetext.com/analysis/is-an-exclusive-dealing-contract-an-unlawful-covenant-not-to-compete [https://perma.cc/388Z-VZVE]. As noted earlier, the vast majority of cases applying Section 16600 have been restricted to the post-term noncompete context. Therefore, in-term exclusivity agreements and noncompetes will typically be allowed if narrowly and fairly drafted, as parties often use in-term exclusivity to ensure loyalty and investment in employee development. In the acting industry, in-term exclusivity may be used to coordinate production schedules, make talent contracts more valuable, and prevent actors from simultaneously appearing in rival network platforms (assuming they are paid above the exclusivity money break). For films, these exclusivity provisions include services such as “preproduction (rehearsal, costume fittings, etc.), production (i.e. principal photography), postproduction (which may include special effects work, dubbing, and reshoots), and publicity for the film.”123Smith, supra note 64.

Nevertheless, at least one recent Ninth Circuit decision (ITN) applying California law has extended Section 16600 to invalidate in-term noncompete agreements as well. This extension impacts not only in-term noncompetes, which limit an employee from working in a certain area, but also exclusivity provisions, which restrict an actor’s ability to work in other productions entirely. As in-term noncompetes are less restrictive than exclusivity provisions, the policy reasons in support of the Code’s extension to capture in-term noncompetes may capture exclusivity as well.

B.  Section 16600 Application to In-Term Provisions

Following years of consistent judicial application by California courts, Section 16600 prohibits most post-term noncompete agreements. The more difficult inquiry is whether Section 16600 does or should apply to in-term exclusivity and noncompete agreements for actors. In 2021, the court for the Southern District of California summarized the precedent set by California courts that Section 16600 applies only to bars on post-employment, not in-term employment, competition in Youngevity Int’l, Corp. v. Smith: “Section ‘16600 does not apply to restrictions on a person’s ability to engage in a lawful business while that person is employed by the company to which he or she promised loyalty. . . . Rather, § 16600 targets restrictions on post-employment activity.’ ”124Youngevity Int’l, Corp. v. Smith, No. 3:16-cv-704-BTM-JLB, 2021 U.S. Dist. LEXIS 53456, at *35 (S.D. Cal. Feb. 3, 2021) (emphasis added) (citation omitted). In-term prohibitions on competition have allowed employers to rely on an employee’s loyalty and commitment while employed.125Techno Lite, Inc. v. Emcod, LLC, 257 Cal. Rptr. 3d 643, 651 (Ct. App. 2020). Further, in Techno Lite, Inc. v Emcod, LLC (2020), the California Court of Appeals notes that “[a]ppellants do not cite—and we have not found—a single case in which Section 16600 was held to invalidate an agreement not to compete with one’s current employer while employed by that employer,” rejecting an argument that Section 16600 could apply to restrictions on employees while currently employed.126Id.

However, in 2017, the Ninth Circuit Court of Appeals applied California state law in ITN Flix, LLC v. Hinojosa to hold that Section 16600 does in fact apply to invalidate “in-term” noncompete clauses lasting only for the term of employment set by the contract.127ITN Flix, LLC v. Hinojosa, 686 F. App’x 441, 444 (9th Cir. 2017). If Section 16600 is extended to prohibit in-term noncompete and exclusivity terms (as is suggested by ITN), many existing practices in the entertainment industry could be legally vulnerable. In ITN, an actor’s Master License Agreement (“MLA”) and Acting Agreement (“AA”) were found to be void as unlawful restraints on trade since they limited the actor’s right to pursue lawful employment.128Id. at 443–44. The actor had entered into the MLA and AA contracts after starring in a film franchise built around his “vigilante character” role.129Salinas et al., supra note 118. The contracts limited the actor’s ability to play “vigilante characters” in other films, as well as his ability to appear in similar films from 2006 to 2013 (a term of seven years brushing against the outer limit of California’s “Seven Year Rule” for personal-service contracts).130Id.; see also ITN Flix, LLC v. Hinojosa, casetext, https://casetext.com/case/itn-flix-llc-v-hinojosa-2 [https://perma.cc/2XEZ-26BM]. The film was a box office flop.131Salinas et al., supra note 118. The actor then starred in a later film as a “vigilante character,” which was a commercial success.132Id. The producer of the original film, Medina, sued for the actor’s breach of contract and argued that the MLA and AA were valid contracts not to compete, as Section 16600 “does not apply to ‘in-term’ non-compete clauses that last only for the term of employment set by the contract.”133ITN, 686 F. App’x at 444.

The court disagreed and said that “[u]nder Cal. Bus. & Prof. Code [Section] 16600, both the MLA and AA are void as unlawful restraints on trade because they limit the right of [the actor] to pursue lawful employment.”134Id. at 443–44. In rejecting Medina’s argument that Section 16600 applies only to post-term noncompetes, the court stated—in no soft terms—that “[b]oth California courts and the Ninth Circuit have rejected [that] argument,” citing two cases in support of their bold statement that Section 16600 applies to invalidate in-term noncompetes: (1) Kelton v. Stravinski (a 2006 California Court of Appeals case) and (2) Comedy Club Inc. v Improv West Associates (a 2009 Ninth Circuit case).135Id. Both of these cases, however, discuss noncompete agreements in contexts outside of the employment context—first the franchise context and later in partnerships—raising the question as to whether the court in ITN was stretching to find support for its policy stance.136Kelton v. Stravinski, 41 Cal. Rptr. 3d 877, 882 (Ct. App. 2006); Comedy Club, Inc. v. Improv W. Assocs., 553 F.3d 1277, 1291–92 (9th Cir. 2009). California courts have explicitly stated that the “reasoning [in certain cases] is tied to the franchise context,” meaning a case involving a franchisor and franchisee is not directly analogous to a case involving an actor and their employer.137Kelton, 41 Cal. Rptr. 3d at 882. Thus, the extension of Section 16600 to in-term noncompetes does not seem to be supported by existing laws or cases.

First, Kelton involved two partners who developed industrial warehouses and thus had a partnership relationship as opposed to that of an employee and employer.138Id. at 877. The partners had agreed to a covenant not to compete which prohibited them from building warehouses independently.139Id. After one partner allegedly breached the covenant, the California Fifth District Court of Appeals held that the covenant was invalid under Section 16600, as it did not fall under any exceptions to the Code and “[i]n the partnership context, an ongoing business relationship [between the parties] does not validate the covenant [not to compete],” or create a Section 16600 exception.140Id. at 879 (emphasis added). Further, the partners in Kelton limited the fiduciary duties owed to one another to only those rising out of the Partnership’s property, explicitly stating that they had no “obligation to refer to the Partnership or to the other Partner any business opportunity,” and that “each partner could ‘engage in other real estate activities, . . . competitive with the Partnership or otherwise.’ ”141Id. This is significant, as a large policy reason for enforcing in-term noncompete covenants is the expectation of loyalty that accompanies them. Here, the partners expressly limited both their fiduciary duties and any expectations of loyalty regarding real-estate developments.142Id. This makes the facts in Kelton distinguishable from those in ITN, and, while cited as a supporting case for the application of Section 16600 to in-term noncompetes, support for ITN from Kelton is not strong due to the factual differences between a business partner and an employee.

The second case cited in ITN supporting the ban on in-term noncompetes in the employment context is Comedy Club Inc. v. Improv West Associates.143Comedy Club, Inc. v. Improv W. Assocs., 553 F.3d 1277, 1292 (9th Cir. 2009). Comedy Club involved two businesses that agreed to an exclusive Trademark License Agreement, which was later breached.144Id. The Ninth Circuit Court of Appeals stated that “an in-term covenant not to compete in a franchise-like agreement will be void if it ‘foreclose[s] competition in a substantial share’ of a business, trade, or market.”145Id. (citing Dayton Time Lock Serv., Inc. v. Silent Watchman Corp., 124 Cal. Rptr. 678, 682 (Ct. App. 1975)). However, the MLA and AA in ITN did not resemble a franchise agreement. While Comedy Club held the franchise’s in-term noncompete was invalid, Comedy Club involved two businesses in a franchise agreement—not an employee and an employer like in ITN.146Comedy Club, 553 F.3d at 1292. Further, the Ninth Circuit in Comedy Club refused to void the entire in-term covenant.147Id. at 1293. Instead, it weighed the interests of the plaintiff in operating its business against those of the defendant seeking to protect its trade name and goodwill, creating a compromise which allowed the plaintiff to operate in certain areas in which the defendant did not already operate.148Id.

The Comedy Club court does note that “California courts are less willing to approve in-term covenants not to compete outside a franchise context because there is not a need ‘to protect and maintain [the franchisor’s] trademark, trade name and goodwill.’ ”149Id. at 1292 (citing Kelton v. Stravinski, 41 Cal. Rptr. 3d 877, 882 (Ct. App. 2006)). This suggests that in-term exclusivity provisions may be subject to some challenges if they are not drafted with appropriate terms.

Lastly, Medina argued that Section 16600 should not be applied to the entertainment industry, as it “would be unworkable because personal services contracts are so often needed to ensure the availability of celebrities.”150ITN Flix, LLC v. Hinojosa, 686 F. App’x 441, 444 (9th Cir. 2017). While the court was not persuaded by this argument, maintaining its stance that this noncompete was illegal regardless of scheduling implications, Medina’s argument touches on some of the most important issues that would stem from an extension of Section 16600 to in-term noncompetes.151Id. Scheduling work on large productions would be more difficult, potentially raising costs and slowing the pace of production. Further, studios have an interest in ensuring their stars do not accept similar roles in the same time frame during which their films are being released, as it could lower viewership and performance. It is also important to note that exclusive personal-service contracts today are the product of a freelance entertainment industry in which actors are cast for specific roles, as well as extensive collective-bargaining negotiations in which both actors and studios are represented. Actors need protection from exploitation, and unions (such as SAG-AFTRA) will go on strike to ensure actors’ interests are adequately represented in collective-bargaining negotiations. The resulting contracts are far less demanding than those that existed in the 1930s, and interference by the courts with the established system and the agreements that have resulted from it may raise more problems than solutions. It may be best to allow unions and studios to reach their desired outcomes without judicially imposed boundaries on in-term noncompete agreements.

One way to reconcile the outcome of ITN with the overwhelming enforcement of in-term noncompetes is by treating the MLA and AA as post-term noncompete agreements. While the court said the actor’s contract was an in-term prohibition, it is possible that it actually categorized the MLA and AA as post-term noncompete contracts and treated them as such, since both restricted the actor’s work after the film was released. Thus, the contracts may have been post-term prohibitions on competition and invalid for that reason, despite the courts “in-term” language. This would allow the result in ITN to be accurate while maintaining the concept of exclusivity. Further, the importance of this distinction highlights the nuances and gray areas that exist in an actor’s contract. A contract may be “in-term” if it applies for a set number of years or seasons of a show, while also operating as “post-term” if it continues to limit the actor after filming has wrapped and an actor’s services are no longer actively needed.

Another possibility is that the Ninth Circuit in ITN simply incorrectly overapplied Section 16600 in an effort to show its recognition of California case law as distinguished from the more lenient noncompete laws of other states in the Ninth Circuit. In the past, the Ninth Circuit has issued certified questions to the California Supreme Court regarding noncompetes, as it did in Ixchel Pharma v. Biogen, asking how broad Section 16600 is in its reach.152Robert B. Milligan, Lauren Leibovitch & Miguel Ramirez, Ninth Circuit Seeks Guidance from California Supreme Court on Business to Business Non-Competes, Casetext (Mar. 23, 2020), https://casetext.com/analysis/ninth-circuit-seeks-guidance-from-california-supreme-court-on-business-to-business-non-competes [https://perma.cc/9ZSP-5CQ8]. Further, in 2008, the California Supreme Court had explicitly rejected the “narrow-restraint” exception previously used by the Ninth Circuit.153Edwards v. Arthur Andersen LLP, 189 P.3d 285, 291 (Cal. 2008). The Ninth Circuit therefore may have improperly applied Section 16600 due to confusion regarding the scope of the Section or in an effort to show its recognition of California law as distinguished from other states in the Ninth Circuit.

ITN may also suggest that certain situations in the acting industry can trigger the application of Section 16600 to hold an in-term noncompete invalid if it is unduly harsh. The actor’s contracts, while technically within the seven years allowed for a personal-service contract in California, were at the outer limits as they lasted for a full seven years. Perhaps a shorter period, such as three or four years, would have led the court to reach a different conclusion.

Despite the plausible explanations above reconciling ITN with existing California case law, ITN is likely an outlier on the treatment of in-term noncompetes in California. In Edwards, the Supreme Court of California invalidated a noncompete agreement that forbade a former employee from working with certain clients and soliciting other employees for periods of twelve to eighteen months after his employment terminated.154Id. The Supreme Court “did not address—much less invalidate—agreements by employees not to undermine their employer’s business by surreptitiously competing with it while being paid by the employer.”155Techno Lite, Inc. v. Emcod, LLC, 257 Cal. Rptr. 3d 643, 650 (Ct. App. 2020). This suggests that if an employee is still being paid, in-term noncompetes are entirely valid. As the California Supreme Court has not weighed in on the treatment of in-term noncompete agreements, its deferential stance in Edwards to legislative intent signals that it may be waiting for clarification from the lawmaking branches of the California government before extending Section 16600 to in-term covenants. Further, the transition away from the studio system of the 1930s (in which actors were held off the market for long periods of time) to the freelance model of talent contracts today, accompanied with the introduction of the “Seven Year Rule” in California, has put in place protections for actors that seem to absolve the need for any total ban on exclusivity or in-term noncompete agreements.

C.  Section 16600 Application to Post-Term Noncompete Provisions

It is generally accepted that Section 16600 prohibits post-term noncompete provisions in California.156Youngevity Int’l, Corp. v. Smith, No. 3:16-cv-704-BTM-JLB, 2021 U.S. Dist. LEXIS 53456, at *34–35 (S.D. Cal. 2021). Post-term noncompetes prevent former employees from working for a competitor or soliciting clients for a certain amount of time.157Id. Most cases interpreting Section 16600 under California law fall in this post-employment context, as the statute has consistently invalidated covenants not to compete that interfere with an employee’s ability to compete after they cut ties with a former employer.158Comedy Club, Inc. v. Improv W. Assocs., 553 F.3d 1277, 1290 (9th Cir. 2009).

Post-term noncompete clauses in the entertainment industry are not common but might include provisions forbidding an actor from working in a production associated with a rival television network or film studio, even after all work has been completed for the current role. Terms with similar effects, however, may be included in the contracts of megastars preventing them from accepting roles with competing studios.159Nigma, supra note 70. This represents the unique gray area in talent contracts, in which it appears an actor has completed a term of their contract (as filming for the movie is done), yet the actor may have a three-picture contract bringing these terms within the scope of in-term exclusivity.

Interestingly, the LA County Superior Court seems to take the stance that post-term noncompete agreements are valid in the narrow fixed-term employment context where an employee leaves in the 2020 case Viacom v. Netflix.160Viacom Int’l v. Netflix, Inc., No. 18STCV00496, 2020 Cal. Super. LEXIS 4442, at *7–10 (Cal. Super. Ct. 2020). This is particularly relevant in the acting industry, in which talent contracts are generally for a set term as opposed to at will. The holding of Viacom, applied to talent contracts, suggests that noncompete agreements restricting an actor’s ability to accept roles after a contract has ended could actually be valid where the actor is the party that breaches the contract. In Viacom, an executive employed by Viacom (an entertainment company) with a fixed-term employment contract left her job nineteen months prior to the end of her contract to work for Netflix.161Id. at *3. Viacom sued Netflix, seeking a permanent injunction enjoining Netflix from taking its employees in this manner, as well as damages.162Id. The disputed noncompete provisions from the executive’s employment agreement read

Your employment with the Company is on an exclusive and full-time basis, and while you are employed by the Company, you shall not engage in any other business activity which is in conflict with your duties and obligations (including your commitment of time) to the Company . . . .
The “Non-Competition Period” begins on the Effective Date and ends on the last day of the Contract Period, provided that:
1. If the Company terminates your employment without Cause before the end of the Contract Period, then the Non-Competition Period shall end on the earlier of (i) the end of the period in which you are receiving payments pursuant to paragraph 11(b)(i) or (ii) the effective date of your waiver in writing of any right to receive or continue to receive compensation and benefits under paragraph 11. You shall be deemed to have irrevocably provided such waiver if you accept competing employment.
2. If the Company terminates your employment for Cause or you resign, the Non-Competition Period shall end on the earlier of (i) the last day of the Contract Period or (ii) eighteen (18) months after such termination or resignation.163Id. at *13–14 (emphasis added).

While Netflix argued that the covenant was an unlawful prohibition preventing the employee from working in similar positions for eighteen months post-employment, the court disagreed.164Id. The court stated that there is no case law supporting the argument that fixed-term contracts not to compete are invalid given that the employee voluntarily left Viacom under assurance from Netflix that she would be indemnified and would not have to pay legal fees.165Id. at *18. While Section 16600 would invalidate the noncompete if the employee had been terminated by Viacom, this case suggests that a noncompete provision for a set amount of time will be upheld where the employee voluntarily leaves their position.166Id. Applied to talent contracts, actors who sign fixed-term exclusivity and noncompete contracts for the filming of their television shows or films may have agreed to valid noncompete provisions in the case that an actor quits in order to pursue a different role, regardless of whether the noncompete becomes post-term.

In Viacom, the noncompete was valid because the employee was not terminated but chose to leave to work for a competitor, thus forsaking her position and its salary voluntarily.167Id. Had the employee instead been terminated, Section 16600 would undoubtedly be implicated.168Id. Additionally, the court notes that at will employment contracts (as opposed to fixed term) with identical language would prove to be unlawful.169Id. This raises an interesting question regarding option contracts.170Are option contracts at will since the producer often has the sole option to extend the contract for an additional movie or season of a show? Or are they fixed term, since the option must be triggered within a set amount of time? See What Are Options and Exclusivity Clauses?, supra note 78. Further, it seems as though the court wanted to hold for Netflix from a policy perspective.171Viacom, Inc., 2020 Cal. Super. LEXIS 4442*, at *18. The court expressly stated that it believes Viacom’s fixed-term employment contracts may violate Section 16600, but that it is unable to find binding case law in support of this position.172Id. This is an interesting narrowing of Section 16600 in finding post-term noncompete terms legal in the situation in which an employee leaves, with particular application to the acting industry where the actor will typically be the one breaching an exclusivity provision in order to render outside work. While helpful in noting that California case law does not suggest that exclusivity in a fixed-term contract is unlawful, this is a Superior Court case and is thus not binding.173Id. The court itself seems to struggle with the outcome and is perhaps expressing its struggle with existing precedent in an effort to open the door for the California Supreme Court to weigh in on the matter.

Steinberg Moorad & Dunn, Inc. v. Dunn, an unpublished 2005 Ninth Circuit case referenced in Viacom, takes the view that a post-term noncompete is invalid regardless of whether the employee left or was fired: “[w]hen an employee leaves, be it before the term of employment has ended or not, [S]ection 16600 prohibits the employer from preventing that employee from pursuing his trade.”174Steinberg Moorad & Dunn, Inc. v. Dunn, 136 F. App’x 6, 10 (9th Cir. 2005). The Viacom court states that, while it would like to rely on Steinberg as persuasive, it is unable to do so because, as an unpublished case, it lacks the specific facts needed to analyze Viacom’s noncompete clause.175Viacom, Inc., 2020 Cal. Super. LEXIS 4442, at *17. This further suggests that some direction is needed from the higher state courts in California or the legislative branch on the application of Section 16600 when an employee is the one to cut ties with the employer in a fixed-term exclusivity contract.

While Viacom represents a narrow application of Section 16600 to allow post-term noncompetes, the application is important in certain contexts such as Silicon Valley where technology companies are constantly poaching employees with key information regarding data breakthroughs such as self-driving car technology.176Timothy B. Lee, A Little-Known California Law Is Silicon Valley’s Secret Weapon, Vox (Feb. 13, 2017, 2:00 PM), https://www.vox.com/new-money/2017/2/13/14580874/google-self-driving-noncompetes [https://perma.cc/B5Z4-Y8AJ]. However, the nuanced application of the Code to the general prohibition on post-term noncompetes (allowing them where the employee leaves a fixed-term contract) may have unintended consequences by restricting the movement of talent in the acting industry.

D.  Factors Unique to the Entertainment Industry

In determining whether Section 16600 should apply to noncompetes in talent contracts, perhaps talent contracts should be evaluated under a unique standard that considers the nuanced aspects of acting, such as fame. Are actors distinct from other employees whose in-term noncompetes in California are valid? As touched on in the discussion of ITN above, a gray area exists within noncompetes in which a contract may be ongoing, but an actor is no longer actively working on a project. A theoretical argument can be made that fame should play a role in the analysis. While most employees merely provide labor, actors are involved in a finished product, the value of which may turn on an actor’s reputation. This is particularly relevant when an actor is a widely recognized celebrity, known for their portrayal of certain characters or for a certain genre. For example, horror films or children’s films. Perhaps an actor is different from a typical employee in that the subsequent work of a “famous” actor could impact their image, and in turn, the value of the character created in a series or film owned by the studio. If this is the case, fame could be an important factor in the analysis of exclusivity provisions. While the actions of a little-known actor after a film or series airs will likely be inconsequential, the press surrounding a major celebrity may have a large impact on the success of a program.

This can be exemplified by the controversy surrounding Daniel Radcliffe’s involvement in Equus, a play in which Radcliffe appeared “full-frontally nude in a prolonged scene.”177Sarah Lyall, Onstage, Stripped of That Wizardry, N.Y. Times (Sept. 11, 2008), https://www.nytimes.com/2008/09/14/theater/14lyal.html [https://perma.cc/33N3-DYHJ]. Following Radcliffe’s nude appearance in the play, press speculated whether the star of the Harry Potter film franchise would be denied the role in the last two films, as the franchise was widely popular with children. One comment on a Harry Potter fan site following news of Radcliffe’s role in Equus with mature scenes read, “We as parents feel Daniel should not appear nude. Our nine-year-old son looks up to him as a role model. We are very disappointed and will avoid the future movies he makes.”178Harry Potter Bares All: Upsets Parents, Live J. (Jan. 30, 2007, 9:09 PM), https://ohnotheydidnt.livejournal.com/10593488.html [https://perma.cc/F3QJ-VPRY].

On the other hand, Daniel Radcliffe’s role in Equus did not seem to hinder the success of the final two Harry Potter movies, as “[t]he eighth and final Harry Potter movie was . . . the third-biggest movie of all time behind only Titanic . . . and Avatar,” bringing in $1.342 billion in the global box office.179Scott Mendelson, Every ‘Harry Potter’ Movie Ranked by Worldwide Box Office, Forbes (Aug. 13, 2020, 1:00 PM), https://www.forbes.com/sites/scottmendelson/2020/08/13/harry-potter-movies-ranked-box-office-jk-rowling-emma-watson-daniel-radcliffe [https://perma.cc./57NM-LEFC]. If viewers do not place substantial weight on an actor and instead focus on the character portrayed, the argument that fame should be factored into the legality of post-term exclusivity terms is substantially weaker.

IV.  SUMMARY OF CASE ANALYSIS

Analyzed through the lens of Section 16600, in-term noncompetes and exclusivity provisions in the acting industry seem to fall outside the scope of the Code’s prohibition of post-term noncompetes and are thus, at least in the general sense, legal. This does not mean, however, that the line is clear-cut or that all in-term noncompete and exclusivity clauses are watertight in their legality. Some in-term noncompete provisions may be prohibited if they are too broad in their restrictions or if they are not well-drafted. The Ninth Circuit’s application of California law in ITN exemplifies a court’s refusal to enforce an actor’s exclusive MLA and AA agreements even for in-term contracts, as the studio’s ban on the actor playing other “vigilante characters” for seven years was an illegal prohibition on the actor’s right to work. While this case is an outlier in an otherwise mostly unified interpretation of Section 16600’s application to post-term noncompete provisions, it indicates that in some instances, reasonableness and length of a contract may still be used to judge the legality of an in-term noncompete agreement.

Conversely, post-term exclusivity provisions are exactly what the California Code was designed to prevent and are generally illegal, except (as Viacom suggests) perhaps in the narrow situation where the employee voluntarily leaves the employer. In California, an employer cannot prohibit a former employee from working after they have left. Viacom interprets the Code, however, as allowing noncompetes in fixed-term employment contracts where the employee voluntarily leaves but prohibiting them when it comes to at will contracts with no end date. It is notable that the court in Viacom, however, believes that these contracts are perhaps illegal but is unable to hold that they are due to the lack of precedential case law on the matter. This may be a signal that it is time for the California Supreme Court to weigh in on the distinction between post-term and in-term exclusivity provisions under Section 16600 and explain that—as currently written and interpreted—it does not extend to invalidate in-term exclusivity and noncompete agreements. The California Supreme Court may also need to articulate whether post-term noncompetes are allowed in the narrow situation where an employee voluntarily leaves.

Another interesting distinction can be made between actors and nonactor employees; while actors are classified as employees, they are distinct due to their fame and their reputational value that has the potential to impact a final work product. This may support the theoretical argument that fame should be considered in analyzing exclusivity and noncompetes. However, if viewers can separate an actor from the roles they play, this may not be an issue.

The table below summarizes the standard from the majority of California cases interpreting Section 16600.

Figure 1.

Section 16600 In-Term ExclusivityPost-Term Exclusivity
Exclusivity terms are heavily negotiated in television talent contracts, yet not heavily negotiated film talent contracts.

Typically legal if narrow in scope and well-drafted.

 

Often illegal if at will.

Potentially illegal if Section 16600 is extended to invalidate restrictive in-term exclusivity.

(ITN)

Potentially legal if fixed-term and the employee voluntarily leaves.

(Viacom)

CONCLUSION

California is firm in its stance against post-term noncompetes, yet an acting industry specific analysis suggests that the unique attributes of talent contracts may require a more nuanced approach. The rise of online streaming has changed the demands placed on actors, with shorter series seasons contributing to an increase in idle time. The landscape is changing rapidly, resulting in the frequent renegotiation of terms and resulting standstills, exemplified by the 2023 SAG-AFTRA strike.

The in-term and post-term treatment of exclusivity provisions and noncompetes has received conflicting treatment by California and Ninth Circuit Courts, suggesting that perhaps the California Supreme Court should weigh in on the matter as they did in 2008 with Edwards and articulate whether Section 16600 can apply to in-term noncompete and exclusivity provisions.180Edwards v. Arthur Andersen LLP, 189 P.3d 285 (Cal. 2008). While it is widely held that Section 16600 does not apply to in-term noncompetes, the holding in ITN suggests that certain situations in the acting industry may trigger its application and deem an in-term noncompete invalid if unduly harsh.181ITN Flix, LLC v. Hinojosa, 686 F. App’x 441, 444 (9th Cir. 2017).

Viacom suggests that in certain instances where an employee breaches a fixed-term exclusivity provision, post-term noncompetes may be upheld. Regardless, the ability of actors and unions to negotiate with studios for mutually beneficial terms has allowed common practices in entertainment contracts to shift over time without much recent legislation. This suggests that, while the applicable law will provide one side with bargaining power, negotiations and collective-bargaining agreements will largely continue to set the standards for common entertainment contract practices.

97 S. Cal. L. Rev. 1087

Download

* J.D. Candidate 2024, University of Southern California Gould School of Law. To my parents, thank you for being my best friends and biggest supporters.

Infringement Episodes

For decades, copyright scholars have waged a spirited campaign against statutory damages. Our remedial system, critics say, is an incoherent mess. The core problem is that copyright holders can recover a separate award of statutory damages for every infringed work. As a result, damages can rapidly add up in any case involving multiple works. Because the number of statutory awards is tethered to the number of works, even trivial claims can lead to crippling damages. Commentators, policymakers, and judges have criticized this system as arbitrary and overbroad. And yet it endures.

This Article argues that copyright’s per-work scheme has obscured, and at times eclipsed, a more compelling paradigm of copyright damages—one that attends more closely to the defendant’s course of conduct. This new approach would allow courts to examine whether the defendant’s actions arose out of, and were rooted in, a single infringement episode. By infringement episode, I mean a chain of infringing acts that together constitute a larger factual event. When the defendant’s conduct is traceable to a single episode, courts should issue only a single statutory award—no matter how many works are at stake. This framework, in short, would substitute rigidity for flexibility. It would displace copyright’s one-award-per-work scheme and instead introduce a contextual inquiry into the defendant’s course of conduct. Doing so can mitigate the risk of outlandish awards, encourage courts to properly calibrate damages, and infuse a degree of much-needed pragmatism into our system.

INTRODUCTION

Copyright scholars have long taken a decidedly dim view of statutory damages. The federal copyright statute entitles plaintiffs to recover statutory damages without proof of harm.117 U.S.C. § 504(c)(1) (“[T]he copyright owner may elect, at any time before final judgment is rendered, to recover, instead of actual damages and profits, an award of statutory damages for all infringements involved in the action, with respect to any one work.”); see infra Section I.B. But our remedial system, critics say, is an incoherent jumble. Courts wield “virtually unfettered discretion” in assessing damages.2Cullum v. Diamond A Hunting, Inc., 484 F. App’x. 1000, 1002 (5th Cir. 2012); see also Fitzgerald Publ’g Co. v. Baylor Publ’g Co., 807 F.2d 1110, 1116 (2d Cir. 1986) (noting that the Copyright Act affords courts “wide discretion . . . in setting the amount of statutory damages”); Playboy Enters., Inc. v. Webbworld, Inc., 991 F. Supp. 543, 560 (N.D. Tex. 1997) (emphasizing the courts’ broad discretion in awarding damages “[w]ithin the range defined by the statutory maximum and minimum”). Statutory awards often seem inexplicable or arbitrary.3See Pamela Samuelson & Tara Wheatland, Statutory Damages in Copyright Law: A Remedy in Need of Reform, 51 Wm. & Mary L. Rev. 439, 485–91 (2009). Courts are also deeply divided over the justificatory basis for statutory damages. Some believe that statutory damages derive from a compensatory rationale, while others lean more readily on punitive instincts.4See, e.g., Peer Int’l Corp. v. Pausa Recs., Inc., 909 F.2d 1332, 1337 (9th Cir. 1990) (“ ‘[E]ven for uninjurious and unprofitable invasions of copyright, the court may, if it deems it just, impose a liability within [the] statutory limits to sanction and vindicate the statutory policy’ of discouraging infringement.” (alteration in original) (quoting F.W. Woolworth Co. v. Contemp. Arts, Inc., 344 U.S. 228, 233 (1952))). Other courts, by contrast, insist that statutory damages “should not be converted into a windfall where, as a practical matter, the plaintiff has suffered only nominal damages.” Doehrer v. Caldwell, 207 U.S.P.Q. (BL) 391, 393 (N.D. Ill. 1980). For an overview of the justificatory framework underpinning the law of statutory damages, see infra Section II.A. And statutory damages sometimes lead to inflated awards that far outstrip any reasonable assessment of harm.5See infra Section II.C. The law of statutory damages is a disaster.

Underlying these pathologies is a peculiar feature of our copyright system: its per-work structure. Copyright holders can recover a separate statutory award for every infringed work.6See infra Section I.B. As a result, statutory damages can add up rapidly in any case involving more than a single work.7See infra Section II.C. In one recent case, copyright holders leveraged this per-work scheme to sue for a breathtaking award of $75 trillion in statutory damages—more than “the combined gross domestic product and national debt of the United States.”8Defendants’ Brief Regarding Plaintiffs’ “Per Infringement” Damages Theory at 2–3, Arista Recs. LLC v. Lime Grp. LLC, 784 F. Supp. 2d 398 (S.D.N.Y. 2011) (No. 06 Civ. 5936) (“Plaintiffs claim a potential award of some 75 trillion dollars—more than double the combined gross domestic product and national debt of the United States, and infinitely more money than the entire music recording industry has made since Edison’s invention of the phonograph in 1877.”); see also Devin Coldewey, Record Industry: Limewire Could Owe $75 Trillion—Judge: “Absurd”, TechCrunch (Mar. 24, 2011, 2:56 PM), https://techcrunch.com/2011/03/24/record-industry-limewire-could-owe-75-trillion-judge-absurd [https://perma.cc/6342-GL5T]. More recently, a jury returned an award of one billion dollars in a case implicating thousands of copyrighted works. Sony Music Ent. v. Cox Commc’ns, Inc., 464 F. Supp. 3d 795, 808 (E.D. Va. 2020), appeal docketed, No. 21-01168 (4th Cir. 2021). Describing the award as “unwarranted, unjust and beyond excessive,” the defendant vowed to challenge it. Court Upholds $1bn Copyright Ruling Against ISP Cox, World IP Rev. (Jan. 18, 2021), https://www.worldipreview.com/news/court-upholds-1bn-copyright-ruling-against-isp-cox-20590 [https://perma.cc/5XGZ-8LUG]. In another notable case, a record label brought action against Jammie Thomas-Rasset,  a single mother from Minnesota,  for illegally downloading and distributing twenty-four songs.9Capitol Recs., Inc. v. Thomas, 579 F. Supp. 2d 1210, 1212–13 (D. Minn. 2008), vacated sub nom., Capitol Recs., Inc. v. Thomas-Rasset, 692 F.3d 899 (8th Cir. 2012); see also Amy Forliti, Single Mom Can’t Pay $1.5M Song-Sharing Fine, NBC News (Nov. 5, 2010, 11:42 AM), https://www.nbcnews.com/id/wbna40030700 [https://perma.cc/Z9EM-EPMY]. A jury found the defendant liable and returned an award of $80,000 per infringed song, resulting in a total award of $1.92 million.10Capitol Recs., Inc. v. Thomas-Rasset, 680 F. Supp. 2d 1045, 1050 (D. Minn. 2010), vacated, 692 F.3d 899 (8th Cir. 2012). Initially, the jury awarded the plaintiffs $9,250 per infringed song for a total award of $220,000. Thomas, 579 F. Supp. 2d at 1213. On retrial, however, the jury increased the award to $1.92 million based on an assessment of $80,000 per song. Thomas-Rasset, 680 F. Supp. 2d at 1050. Aghast, the district court dismissed the final award as “simply shocking” and reduced it to $54,000. Id. at 1054. Following a third trial, the Eighth Circuit reinstated the first judgment of $220,000. Capitol Recs., Inc. v. Thomas-Rasset, 692 F.3d 899, 906 (8th Cir. 2012). Stunningly, by some measures, this award was 35,000 times larger than the plaintiff’s actual loss.11Thomas, 579 F. Supp. 2d at 1227. The plaintiff’s actual harm, at least based on a rough assessment by the court, was about fifty dollars. Id. (“Thomas allegedly infringed on the copyrights of 24 songs—the equivalent of approximately three CDs, costing less than $54.”).

As these examples illustrate, the prospect of outlandish damages is rather startling. Because the number of statutory awards is tethered directly to the number of infringed works, statutory damages can quickly balloon in relatively minor cases. The risk of financial ruin looms large.

Copyright scholars, in turn, have proposed a number of potential reforms.12See infra Section III.A. Matthew Sag, for instance, suggests that we reduce or cap the range of available damages in certain file-sharing cases.13Matthew Sag, Copyright Trolling, an Empirical Study, 100 Iowa L. Rev. 1105, 1139–40 (2015) (suggesting that first-time defendants in file-sharing cases face a reduced statutory award). Oren Bracha and Talha Syed contend that, at least under a compensatory framework, statutory damages should approximate actual harm.14Oren Bracha & Talha Syed, The Wrongs of Copyright’s Statutory Damages, 98 Tex. L. Rev. 1219, 1250 (2020). Ben Depoorter contemplates a variety of reforms that would make excessive damages both less likely and less costly.15Ben Depoorter, Copyright Enforcement in the Digital Age: When the Remedy Is the Wrong, 66 UCLA L. Rev. 400, 441–46 (2019). James DeBriyn calls for eliminating statutory damages.16James DeBriyn, Shedding Light on Copyright Trolls: An Analysis of Mass Copyright Litigation in the Age of Statutory Damages, 19 UCLA Ent. L. Rev. 79, 111 (2012). Michael Carrier would proscribe recovery of statutory damages in cases involving secondary liability.17Michael A. Carrier, Increasing Innovation Through Copyright: Common Sense and Better Government Policy, 62 Emory L.J. 983, 985 (2013) (“The second copyright proposal that would foster innovation would be to eliminate statutory damages in cases of secondary liability.”). And Alan Garfield endorses a host of different reforms—interpretive, legislative, and constitutional—to restrict judicial discretion in assessing statutory damages.18See Alan E. Garfield, Calibrating Copyright Statutory Damages to Promote Free Speech, 38 Fla. St. U. L. Rev. 1, 37–53 (2010).

None of these proposals, however, confront the core issue: the per-work structure of copyright remedies. To meet the urgency of the moment, this Article suggests a legislative reform that would do away with per-work remedies altogether. Instead, I propose that courts police the defendant’s conduct based on a more holistic approach.19See infra Part III. Our current system encourages courts and juries to engage in a rote exercise of counting infringed works.20See infra Section II.C. A better framework, I argue, would afford courts greater discretion to assess whether the defendant’s conduct gave rise to, and was squarely rooted in, a single infringement episode. By infringement episode, I mean a chain of related infringing acts that together constitute a larger factual event. When the defendant’s conduct is traceable to a single larger episode, courts should be able to issue only a single statutory award—no matter how many works are at stake. By focusing on infringement episodes rather than the number of infringed works, this approach breaks with copyright’s one-award-per-work system. It substitutes rigidity for flexibility, relying instead on a context-sensitive inquiry into the defendant’s course of conduct. And it offers an alternative analytical paradigm for thinking about copyright remedies.

To operationalize this approach, the Article sketches a richly nuanced account of when and why courts might treat a series of infringing acts as sufficiently intertwined to constitute a single episode.21See infra Part III. In particular, I argue that courts should attend to an array of interrelated factors, including the nature of the infringing acts; the time and place of each act; whether the evidence supporting one infringement is necessary or sufficient to sustain liability for another; whether the defendant’s actions were executed in pursuit of a common plan or a larger creative enterprise; and the potential consequences of a statutory award. If adopted, this multifactor framework would allow courts to avoid per-work damages when the defendant’s actions derive from a single infringement episode.

This approach may seem radical. Immersed as we are in a legal culture that prizes per-work awards, it may be difficult to conceive of a better system for assessing damages—one that is both administrable and normatively attractive. But if we want to put an end to excessive awards, we need to get judges and juries out of the business of counting infringed works. The way to do that is through structural reform that would enable courts to scrutinize the defendant’s overarching course of conduct.

The Article also brings this approach into conversation with other legal disciplines.22See infra Section III.B. Across a variety of seemingly siloed areas of law—criminal law, civil procedure, and immigration law—courts have already fashioned doctrinal tools to evaluate whether a cluster of wrongful acts might be collectively reducible to a single transaction, episode, scheme, or series of occurrences. Typically, courts do so by undertaking a multifactor inquiry into the defendant’s entire course of conduct. Rather than examining every single action in isolation, courts approach the defendant’s conduct from a more holistic perspective. And this framework, subject to a few notable modifications, could be brought to bear on modern copyright law.

Ultimately, this Article seeks to chart a new horizon for copyright remedies. What distinguishes the proposed approach from previous proposals is that it takes seriously the core issue: per-work remedies that allow courts and juries to aggregate damages. After decades of faint-hearted debates, policymakers ought to try something different. If left unabated, copyright’s remedial system will continue to provoke all manner of mischief. It is time to rethink per-work damages in toto.

The argument proceeds in four parts. Part I discusses two of the most contestable aspects of our current system. The first problem is that infringement is principally understood to be a strict liability tort.23See infra Section I.A. In other words, liability for copyright infringement does not depend on the infringer’s intent, knowledge, or negligence. Accidental infringement, then, is a very real possibility. The second problem is that copyright owners can elect to recover a separate award of statutory damages for every infringed work.24See infra Section I.B. Because these awards can range anywhere from $200 to $150,000 per work, there is much at stake.

And when one considers the caselaw, the picture grows darker still. Part II delivers a diagnosis of the central issues surrounding the law of statutory damages. First, courts often disagree over the appropriate justification for statutory damages.25See infra Section II.A. They vacillate between compensatory, punitive, and deterrence-based rationales, and rarely offer anything more than a threadbare explanation for why one justification is preferable to another. Second, copyright’s remedial scheme tends to produce unpredictable or otherwise arbitrary outcomes.26See infra Section II.B. Third, statutory damages can lead to grossly excessive awards.27See infra Section II.C. Due to per-work aggregation, statutory awards can swell up in any case involving more than a single work. Fourth, this per-work structure can discourage courts from properly developing copyright law.28See infra Section II.D. Because statutory damages attach to infringed works rather than infringed rights, courts have little incentive to systematically address or unpack certain rights. The result is an impoverished body of law.

Part III outlines an alternative framework for assessing damages. It begins by surveying previous reform proposals.29See infra Section III.A. It next catalogs a range of doctrinal analogues borrowed from other legal disciplines, including criminal law, civil procedure, and immigration law.30See infra Section III.B. Based on this cross-disciplinary analysis, I then chalk out a more granular account of how courts might go about identifying infringement episodes.31See infra Section III.C. In so doing, I show that the proposed approach can offer a practical blueprint for evaluating the defendant’s conduct in different kinds of cases: file-sharing cases, artistic appropriation cases, and commercial infringement cases.

Part IV considers and rejects a number of potential objections. I defend my use of doctrinal analogues; clarify the scope of this framework by explaining what it can—and cannot—do; and examine how my proposal might ensure that plaintiffs are compensated for the harms they suffered.32See infra Part IV. A brief conclusion follows.

I.  A TALE OF TWO WOES

Copyright law suffuses our world. An incredible constellation of seemingly mundane activities—emailing, posting photos and videos on social media, texting, and even sharing memes—can give rise to claims of copyright infringement.33Shani Shisha, The Folklore of Copyright Procedure, 36 Harv. J.L. & Tech. 61, 62 (2023). The result is that copyright “pervades our cultural universe.”34Id. It has come to “dominate vast swaths of everyday life.”35Id. As one commentator put it, copyright law “touches everyone and everything.”36Jessica Litman, The Exclusive Right To Read, 13 Cardozo Arts & Ent. L.J. 29, 34 (1994).

This Section discusses two of the main features that render modern copyright law so permissive. The first is the fundamental nature of our system as a strict liability regime. The second is a unique remedial framework that entitles copyright owners to recover statutory damages without proof of actual harm. These two features strike at the heart of a formidable regime that has grown precariously overbroad. The risk of accidental infringement, coupled with the attendant threat of exorbitant damages, is all but inescapable. Copyright law is a slumbering giant.

A.  Strict Liability

In the traditional telling, ours is a strict liability regime. Courts routinely treat copyright infringement as a strict liability tort.37See, e.g., Educ. Testing Serv. v. Simon, 95 F. Supp. 2d 1081, 1087 (C.D. Cal. 1999) (“There is no need to prove anything about a defendant’s mental state to establish copyright infringement; it is a strict liability tort.”); Atl. Recording Corp. v. Spinrilla, LLC, 506 F. Supp. 3d 1294, 1315 (N.D. Ga. 2020) (noting that “the Copyright Act is a strict liability statute”); King Recs., Inc. v. Bennett, 438 F. Supp. 2d 812, 852 (M.D. Tenn. 2006) (“[A] general claim for copyright infringement is fundamentally one founded on strict liability.” (quoting Bridgeport Music, Inc. v. 11C Music, 154 F. Supp. 2d 1330, 1335 (M.D. Tenn. 2001))); EMI Christian Music Grp., Inc. v. MP3tunes, LLC, 844 F.3d 79, 89 (2d Cir. 2016) (“Copyright infringement is a strict liability offense in the sense that a plaintiff is not required to prove unlawful intent or culpability.”); Shapiro, Bernstein & Co. v. H.L. Green Co., 316 F.2d 304, 308 (2d Cir. 1963) (explaining that “[w]hile there have been some complaints concerning the harshness of the principle of strict liability in copyright law . . . courts have consistently refused to honor the defense of absence of knowledge or intention”); Toksvig v. Bruce Publ’g Co., 181 F.2d 664, 666 (7th Cir. 1950) (“Intention is immaterial if infringement appears.”); Fitzgerald Publ’g Co. v. Baylor Publ’g Co., 807 F.2d 1110, 1113 (2d Cir. 1986) (“[I]ntent or knowledge is not an element of infringement.”); Gener-Villar v. Adcom Grp., Inc., 509 F. Supp. 2d 117, 124 (D.P.R. 2007) (“One must first take notice that the Copyright Act is a strict liability regime under which any infringer, whether innocent or intentional, is liable.”); Millennium Funding, Inc. v. Priv. Internet Access, Inc., No. 21-cv-01261, 2022 U.S. Dist. LEXIS 187487, at *40 (D. Colo. Oct. 13, 2022) (“Indeed, copyright infringement is a strict liability tort.”); Faulkner v. Nat’l Geographic Soc’y, 576 F. Supp. 2d 609, 613 (S.D.N.Y. 2008) (“Copyright infringement is a strict liability wrong in the sense that a plaintiff need not prove wrongful intent or culpability in order to prevail.”). Scholars take a similar view.38See, e.g., Jacqueline D. Lipton, Cyberspace, Exceptionalism, and Innocent Copyright Infringement, 13 Vand. J. Ent. & Tech. L. 767, 768 (2011) (“Historically, copyright infringement claims have been litigated on a strict liability basis.”); Kent Sinclair, Jr., Liability for Copyright Infringement–Handling Innocence in a Strict-Liability Context, 58 Calif. L. Rev. 940, 944 (1970) (“The rule is well established in copyright law that lack of intention to infringe is not a defense to an action for infringement.”); Oren Bracha & Patrick R. Goold, Copyright Accidents, 96 B.U. L. Rev. 1025, 1028 (2016) [hereinafter Bracha & Goold, Copyright Accidents] (“Copyright law has so far responded to accidents through a rule of strict liability. It does not matter how much care you take to prevent the accidental infringement; if you end up transgressing upon copyright entitlements, you will be held liable.” (footnote omitted)); Patrick R. Goold, Moral Reflections on Strict Liability in Copyright, 44 Colum. J.L. & Arts 123, 125 (2021) [hereinafter Goold, Moral Reflections] (“Copyright infringement is a strict liability tort: Liability attaches when someone infringes the right, regardless of how carefully the defendant tried to prevent any legal wrongdoing.”); Apostolos G. Chronopoulos, Strict Liability and Negligence in Copyright Law: Fair Use as Regulation of Activity Levels, 97 Neb. L. Rev. 384, 386 (2018) (“Copyright infringement is considered to be a strict liability tort.”). It is easy enough to understand why. Copyright liability, after all, does not depend on the infringer’s knowledge, negligence, or intent.39As Dane Ciolino and Erin Donelon put it, liability for copyright infringement can arise even absent “scienter, intent, knowledge, negligence, or similar culpable mental state. On the contrary, liability for civil copyright infringement is strict.” Dane S. Ciolino & Erin A. Donelon, Questioning Strict Liability in Copyright, 54 Rutgers L. Rev. 351, 356 (2002). It is simply irrelevant whether the infringer knew that their actions constituted copyright infringement. Liability is not conditioned upon any kind of fault. To establish a prima facie case, the copyright owner need not make a showing of knowledge or negligence on the part of the defendant. Copyright infringement, in short, is a strict liability tort.

But why is that so? One standard answer is that copyright is a form of property entitlement, and violations of property rights ought to be punishable regardless of the transgressor’s fault.40Bracha & Goold, Copyright Accidents, supra note 38, at 1028; Sinclair, supra note 38, at 945 (emphasizing that the concept of “absolute liability” in copyright law tracks the idea that “literary works . . . must be afforded legal protection to the same extent as his real or personal property” (footnote omitted)); Lipton, supra note 38, at 769–70. Another explanation is that the defendant should face absolute liability because they are better positioned to avoid loss, at least as compared to the copyright owner.41Lipton, supra note 38, at 770; 4 Melville B. Nimmer & David Nimmer, Nimmer on Copyright § 13.08 (2023). The idea is that “as between owners and infringers, it is more efficient for infringers to bear the costs of infringement.”42Ciolino & Donelon, supra note 39, at 376 (citing Eaton S. Drone, A Treatise on the Law of Property in Intellectual Productions in Great Britain and the United States 403 (Rothman Reprints 1972)). Yet another familiar theme is that a strict liability standard serves an evidentiary function: it relieves plaintiffs of the burden of proving the infringer’s state of mind.43Lipton, supra note 38, at 770–71.

To understand how our strict liability system came to be, it is critical to consider its historical genesis. As others have noted, Judge Learned Hand first laid the foundations for a strict liability regime in a spate of early-twentieth-century decisions dealing with copyright infringement.44E.g., Goold     , Moral Reflections, supra note 38, at 125. In these cases, Judge Hand dismissed the defendants’ claims of ignorance on the theory that the copyrights at issue were registered with the Copyright Office: the defendants, Judge Hand suggested, were on notice as to the existence of the copyright.45See, e.g., Stern v. Jerome H. Remick & Co., 175 F. 282, 282–83 (C.C.S.D.N.Y. 1910) (noting that the defendant “had means of knowledge from the copyright office that the [work] had been in fact copyrighted; and he, like anyone else, took his chances when he published the song without any inquiry”). At the time these decisions were issued, copyright registration was mandatory.46See Shisha, supra note 33, at 70–71; Act of May 31, 1790, ch. 15, § 3, 1 Stat. 124, 124 (amended 1831) (mandating registration with the clerk’s office at the author’s local district court); Act of June 30, 1834, ch. 157, § 1, 4 Stat. 728, 728 (requiring that copyright owners record “deeds or instruments in writing” for the transfer or assignment of copyrights). Failure to register the rights would lead to copyright forfeiture.47Shisha, supra note 33, at 68 (“When an author failed to register the work, deposit copies, append a notice to published copies, or give public notice in a newspaper, she would forfeit her copyright.” (footnote omitted)). Because the infringed works were registered, Judge Hand could reasonably surmise that the defendants had proper notice of the status of the works they had copied.

Thus, a strict liability regime was defensible on more functional grounds. It arose at a time when copyright registration was mandatory. Registration provided notice of a work’s legal status. And copyright protection was also conditioned on an additional notice affixed to copies of the work—every published copy had to include a notice specifying who the copyright owner was and when the copyright was registered.48See id. at 70–71; Act of Apr. 29, 1802, ch. 36, § 1, 2 Stat. 171, 171 (requiring that copyright owners “give information by causing the copy of the record . . . to be inserted at full length in the title-page or in the page immediately following the title of every such book or books”); Act of Mar. 4, 1909, ch. 320, § 12, 35 Stat. 1075, 1078 (amended 1976) (conditioning copyright protection upon “publication of the work with the notice of copyright”).

But these formalities were eliminated or rendered optional in the late twentieth century.49For a discussion of copyright formalities, see Shisha, supra note 33, at 70–76 (noting that registration is no longer a precondition to copyright protection, while the notice requirement has been jettisoned altogether). Consequently, under our current system, it has become increasingly difficult to track down and identify the owners of certain works.50Olive Huang, U.S. Copyright Office Orphan Works Inquiry: Finding Homes for the Orphans, 21 Berkeley Tech. L.J. 265, 265 (2006) (“[O]wnership information for a copyrighted work is sometimes hard to find, and tracking down the owner to ask permission presents daunting challenges for potential users.”). Copyright protection today vests automatically—authors need not take any affirmative steps to register their rights or apply for copyright protection.51Indeed, “[c]opyright attaches to an original work of authorship the moment it is fixed in some tangible form. Authors need not take any affirmative steps to claim copyright protection; original works are protected by default. Modern copyright law is thus a system of unconditional protection—one in which copyright vests automatically.” See Shisha, supra note 33, at 62–63. This complicates matters. Because copyrights vest automatically but are not registered, the risk of accidental infringement has grown measurably. It is indeed harder to justify a strict liability regime in a world where the risk of accidental infringement is so high.

Meanwhile, copyright’s strict liability regime has been the subject of sustained scholarly attention. A strict liability regime, scholars contend, is no longer workable. By increasing the likelihood that inadvertent infringers face liability, a strict liability scheme threatens to disproportionately expand the scope of copyright entitlements. Dane Ciolino and Erin Donelon, for instance, argue that strict liability frustrates the very objectives underlying our copyright system—it overprotects preexisting works, thus decreasing accessibility and preventing future authors from engaging with and building on existing works.52Ciolino & Donelon, supra note 39, at 410–15. Patrick Goold rails against copyright’s strict liability regime on moral grounds. Quite apart from any of the practical consequences that might attend a strict liability system, Goold explains that liability for accidental infringement is morally unjust—it is simply unfair to impose liability on a defendant who did everything they reasonably could to avoid infringement.53Goold, Moral Reflections, supra note 38, at 126 (“My question is not whether strict liability fails to properly deter accidents or inhibits creativity (which it does), but more simply, whether strict liability is fair. I make an argument that strict liability is not fair because it results in copyright users being held liable for accidents for which they are not morally responsible.”). Avihay Dorfman and Asaf Jacob claim that strict liability makes little sense in a system focused on intangible goods—among other problems, intangible goods are difficult to define and their legal status is often uncertain.54Avihay Dorfman & Assaf Jacob, Copyright as Tort, 12 Theoretical Inquiries L. 59, 87–96 (2011). Finally, Jacqueline Lipton asserts that strict liability aligns poorly with the realities of the digital era.55Lipton, supra note 38, at 808–09. In a world where technology allows for the mechanical, accidental, and often involuntary copying of works, strict liability is inappropriate.56Id. at 784–801.

In 2012, Shyamkrishna Balganesh offered a more systematic account of copyright infringement.57Shyamkrishna Balganesh, The Obligatory Structure of Copyright Law: Unbundling the Wrong of Copying, 125 Harv. L. Rev. 1664, 1682 (2012). As Balganesh points out, the basic structure of the infringement action does not require proof of harm. To make a prima facie case of copyright infringement, the plaintiff must prove (1) ownership of a valid copyright in the work and (2) infringement of one of the plaintiff’s rights, requiring both copying-in-fact and improper appropriation.58See Feist Publ’ns, Inc. v. Rural Tel. Serv. Co., 499 U.S. 340, 361 (1991); see also Nimmer & Nimmer, supra note 41, § 13D.02.

The question of harm is curiously absent from this basic scheme. As Balganesh notes, it is only later in the process, when confronting the question of fair use, that the issue of harm comes into play. Fair use is copyright’s most important defense to claims of infringement.59Abraham Bell & Gideon Parchomovsky, Propertizing Fair Use, 107 Va. L. Rev. 1255, 1257 (2021) (describing fair use as “the most significant and most capacious defense against copyright infringement”). In determining whether the defendant’s copying qualifies as fair use, courts consider a range of statutory factors, including the question of “market harm”—the “effect of the use upon the potential market” for the copyrighted work.6017 U.S.C. § 107. So although harm is not an element of the infringement tort, it surfaces as part of the fair use inquiry. The issue of harm, then, is baked into the defense stage. Nevertheless, although the question of harm is central to the fair use analysis, the larger point remains: our copyright system is essentially a no-fault regime, one in which it “makes little difference for liability whether the copying was intentional, negligent, or a genuine mistake.”61Balganesh, supra note 57, at 1682.

Others, though, question whether our system is truly a strict liability regime. In a series of articles published over the past few years, Goold articulated an intricate account calling into question the conventional view that copyright is best conceptualized as a strict liability regime.62See generally Patrick R. Goold, Is Copyright Infringement a Strict Liability Tort?, 30 Berkeley Tech. L.J. 305 (2015) [hereinafter Goold, Copyright Infringement] (suggesting that copyright is best conceptualized as a fault-based regime); Bracha & Goold, Copyright Accidents, supra note 38 (questioning whether strict liability is the appropriate liability standard for copyright accidents and considering a range of potential alternatives); Goold, Moral Reflections, supra note 38 (asserting that copyright’s strict liability standard is morally unfair because it leads to copyright liability for accidents for which defendants are not morally responsible). Goold posits that the infringement tort does, in fact, accommodate a fault element. This element, however, has gone largely unnoticed thanks to copyright’s messy structure.63Goold, Copyright Infringement, supra note 62, at 338. The fair use principle is again central to the story: Goold suggests that the fairness standard reflected in the fair use inquiry is a kind of fault standard, somewhat analogous to the reasonableness standard undergirding negligence law.64Id. at 340–50. As a result, the fair use inquiry has a dual role—establishing whether the proscribed conduct (copying) was harmful and, in addition, whether the copying was wrongful. Id. at 338. In Goold’s formulation, copyright liability is not conditioned on the infringer’s state of mind, but rather depends upon the infringer’s failure to satisfy a standard of conduct. Id. at 340. For liability to arise, it is not enough that the defendant created a substantially similar work—“the copying must also be unfair.” Id.

But whatever one makes of Goold’s analysis, it is not at all clear that the fair use defense can meaningfully relax the no-harm regime that lies at the heart of copyright’s prima facie case. For many would-be infringers, the fair use defense is no silver bullet. The fundamental problem with fair use is that it relies on an ex-post analysis of four factors.65Shani Shisha, The Copyright Wasteland, 47 BYU L. Rev. 1721, 1779 (2022) (“[T]he fair use doctrine rests on a statutory test that is flexible by design: Courts examine whether the defendant’s use is fair on a case-by-case basis against four statutory factors.”); Clark D. Asay, Arielle Sloan & Dean Sobczak, Is Transformative Use Eating the World?, 61 B.C. L. Rev. 905, 917 (2020) (“Fair use is meant to be a flexible standard . . . that courts can adapt to achieve the most just results in any given situation.”). By design, the fair use inquiry is case specific and can only be tested by a court after the fact.66Larry Lessig famously described fair use as “the right to hire a lawyer.” Lawrence Lessig, Free Culture 187 (2004). As Peter Jaszi explains:

The statutory formulation [of fair use] . . . is too vague and open-ended to be relied upon effectively; its real utility is severely limited because fair use claims can be tested only after the fact of use and then only when a creator relying on the doctrine is able to retain legal counsel and willing to expose himself or herself to considerable economic risk in the event that the defense fails.

Peter Jaszi, Copyright, Fair Use and Motion Pictures, 2007 Utah L. Rev. 715, 729 (2007); see also James Gibson, Risk Aversion and Rights Accretion in Intellectual Property Law, 116 Yale L.J. 882, 889 (2007) (“From the ex post perspective of the defendant already embroiled in expensive litigation, an adaptable, equitable defense is useful. But for the prospective defendant wondering whether a given act will prove to be infringing, fair use is too ambiguous to provide much ex ante guidance.”).
To prevail on fair use, the defendant must withstand lengthy and uncertain litigation.67Am. Intell. Prop. L. Ass’n, Report of the Economic Survey 44 (2017) (estimating that the median cost of copyright litigation ranges from $200,000 to $1 million). For that reason, risk-averse defendants would often choose to settle out of court.68Shisha, supra note 33, at 120. With the specter of statutory damages looming in the background, the risks of going to court are too grave. In such circumstances, the safe choice—and perhaps the only sensible one—is to settle.

The idea here is clear: whether the fair use doctrine introduces a fault standard, as Goold claims, is beside the point. For many would-be infringers, the fair use defense never really comes into play. And while it may be true that, as an analytical matter, copyright infringement is best characterized as a fault liability tort, this is largely tangential to the broader argument I advance here—that our current regime is overly burdensome.

B.  Statutory Damages

A successful plaintiff in an infringement action may pursue one of two remedial avenues.6917 U.S.C. §§ 504(a), (c)(1) (“[T]he copyright owner may elect, at any time before final judgment is rendered, to recover, instead of actual damages and profits, an award of statutory damages for all infringements involved in the action, with respect to any one work.”). The first entitles a victorious plaintiff to collect actual damages plus infringement-related profits.70Id. § 504(b). The second relieves a prevailing plaintiff of the burden of establishing actual harm, allowing instead for the recovery of statutory damages.71Id. § 504(c). Ordinarily, a plaintiff may recover statutory damages ranging from $750 to $30,000 for any infringed work.72Id. § 504(c)(1). The number of statutory awards depends on the number of infringed works.73Id. (stating that the plaintiff is entitled to recover a separate award for “all infringements involved in the action, with respect to any one work” (emphasis added)); see also Nimmer & Nimmer, supra note 41, § 14.04(E)) (“Where the suit involves infringement of more than one separate and independent work, minimum statutory damages for each work must be awarded. For example, if one defendant has infringed three copyrighted works, the copyright owner is entitled to statutory damages of at least $750 and may be awarded up to $30,000.” (quoting H.R. Rep. No. 94-1476, at 162 (1976))). Each statutory award may be reduced to a sum of no less than $200 per work in cases of innocent infringement,7417 U.S.C. § 504(c)(2). or increased—up to a sum of $150,000 per work—in cases of willful infringement.75Id.

As a practical matter, the Copyright Act offers little guidance on how courts might assess statutory damages.76The Copyright Act is codified in Title 17 of the United States Code. 17 U.S.C. §§ 101–1511. As § 504 (“section 504”) of the copyright statute makes clear, the court is at liberty to adjust the award as it “considers just.”77Id. § 504(c)(1). In practice, in the absence of a jury trial, courts often assess damages by reference to four principal factors: (1) the harm suffered by the plaintiff; (2) the profits collected by the defendant; (3) the infringer’s state of mind, namely, whether the infringement was willful, knowing, or innocent; and, finally, (4) whether either of the parties had violated its contractual obligations.78Nimmer & Nimmer, supra note 41, § 14.04(B)(1)(a). More broadly, some courts also attend to the objectives animating our statutory regime. The Second Circuit, for example, often undertakes a more elaborate inquiry into “the deterrent effect on the infringer and third parties,” as well as “the infringer’s cooperation in providing evidence concerning the value of the infringing material” and “the conduct and attitude of the parties.”79Bryant v. Media Right Prods., Inc., 603 F.3d 135, 144 (2d Cir. 2010) (assessing statutory damages under § 504(c)(2)); Pearson Educ., Inc. v. Arora, 717 F. Supp. 2d 374, 380 (S.D.N.Y. 2010), aff’d, 448 F. App’x 163 (2d Cir. 2012) (considering the same factors in assessing statutory damages under § 504(c)(1)).

Moreover, as noted above, the statutory award may be decreased to a sum of no less than $200 per work in cases of innocent infringement.8017 U.S.C § 504(c)(2). In such cases, the defendant bears the burden of proving that they were not aware, and had no reason to believe, that their actions were infringing.81Id. (“In a case where the infringer sustains the burden of proving, and the court finds, that such infringer was not aware and had no reason to believe that his or her acts constituted an infringement of copyright, the court in its discretion may reduce the award of statutory damages to a sum of not less than $200.”). The burden here is twofold: the defendant must establish both that they did not believe their acts constituted infringement and that their belief was reasonable. Whether an award is ultimately reduced, though, remains a discretionary matter, and a court may refuse to do so even in cases involving innocent infringers.82Nimmer & Nimmer, supra note 41, § 14.04(B)(2)(a). Finally, it is also important to note that the Copyright Act precludes reliance on the “innocent infringement” defense when a defendant had access to published copies bearing a copyright notice.8317 U.S.C. §§ 401(d), 402(d).

Similarly, in cases of willful infringement, the court may enhance the amount of statutory damages to a sum of “not more than $150,000” per work.84Id. § 504(c)(2). Whether the infringement was “willful” depends on the infringer’s state of mind.85Erickson Prods., Inc. v. Kast, 921 F.3d 822, 833 (9th Cir. 2019) (“A determination of willfulness requires an assessment of a defendant’s state of mind.” (citing Friedman v. Live Nation Merch., Inc., 833 F.3d 1180, 1186 (9th Cir. 2016))); Russell v. Walmart Inc., No. CV 19-5495, 2020 U.S. Dist. LEXIS 252882, at *27–29 (C.D. Cal. Oct. 16, 2020). And although a determination of willfulness can prove somewhat elusive,86Shisha, supra note 65, at 1757 (noting that courts “have done little to narrow down or clearly define ‘willful infringement’ ” (citation omitted)). the prototypical case of willful infringement involves a defendant who knowingly infringed the plaintiff’s rights87See Nimmer & Nimmer, supra note 41, § 14.04(B)(3)(a). or otherwise displayed “reckless disregard for, or willful blindness to, the copyright holder’s rights.”8817 U.S.C. § 504(c)(2); see also Unicolors, Inc. v. Urb. Outfitters, Inc., 853 F.3d 980, 991 (9th Cir. 2017); Sony BMG Music Ent. v. Tenenbaum, 660 F.3d 487, 507–08 (1st Cir. 2011); Louis Vuitton Malletier, S.A. v. Akanoc Sols., Inc., 658 F.3d 936, 944 (9th Cir. 2011); Graper v. Mid-Continent Cas. Co., 756 F.3d 388, 394 (5th Cir. 2014); Island Software & Comput. Serv., Inc. v. Microsoft Corp., 413 F.3d 257, 263 (2d. Cir. 2005); Wildlife Express Corp. v. Carol Wright Sales, Inc., 18 F.3d 502, 511–12 (7th Cir. 1994); Yurman Design, Inc. v. PAJ, Inc., 262 F.3d 101, 112 (2d Cir. 2001); Cent. Point Software, Inc. v. Glob. Software & Accessories, Inc., 880 F. Supp. 957, 967 (E.D.N.Y. 1995). The point is that, even absent actual knowledge of infringement, the defendant may be held liable for willful infringement so long as they displayed reckless disregard of the plaintiff’s rights.

One strong indication of reckless disregard is a history of past infringements.89Nimmer & Nimmer, supra note 41, § 14.04(B)(3)(a); Lauratex Textile Corp. v. Allton Knitting Mills Inc., 517 F. Supp. 900, 903–04 (S.D.N.Y. 1981). As the Lauratex court emphasized, the defendant had been on the receiving end of

copyright infringement suits brought by converters ten times (including this action) in the last five years. Five of those actions were settled, two are still pending and two resulted in judgments in favor of the plaintiffs. The inference is inescapable that [the defendant] has made a practice of copying the designs of other converters, and that an award of statutory damages is appropriate as a deterrent to further activity of this kind.

 Id. (footnote omitted).
Another piece of evidence probative of willfulness is the infringer’s past record of seeking authorization prior to using other copyrighted works.90Consider, for example, Beastie Boys v. Monster Energy Co., 66 F. Supp. 3d 424 (S.D.N.Y. 2014). The Beastie Boys brought action for copyright infringement against Monster Energy. Id. at 427–28. The Beastie Boys alleged that Monster had used their music in the company’s promotional video. Id. at 428. At trial, Monster chose not to contest the issue of liability, and the trial focused instead on the question of damages. Id. at 432–33. The jury returned a verdict in favor of the plaintiffs, awarding the Beastie Boys $1.2 million in statutory damages based on a finding of willful infringement. Id. at 435. The court upheld the verdict as reasonable, explaining that Monster’s marketing director had previously produced a large number of promotional videos—for which he sought permission in writing. Id. at 442–43. The director had also taken an aggressive stance in guarding against violations of Monster’s own intellectual property in the past. Id. Likewise, Monster failed to craft a policy to regulate its music licensing. Id. Based on these reasons, the court concluded that the jury’s finding of willfulness was well-grounded in the factual record. Id. For a more thorough description of the Beastie Boys case, see Nimmer & Nimmer, supra note 41, § 14.04(B)(3)(a). Indeed, a defendant who was previously vigilant in obtaining licenses but failed to do so at a later point may be deemed to have engaged in willful infringement. In addition, the Ninth Circuit has suggested that a jury could find willfulness where a company failed to promulgate any procedures for “establishing or reporting on who holds the rights to the [works] whose use is proposed.”91Friedman v. Live Nation Merch., Inc., 833 F.3d 1180, 1186 (9th Cir. 2016). And a court may likewise draw an inference of willful infringement when the defendant ignores a letter informing them, in concrete terms, of infringement.92See Chi-Boy Music v. Charlie Club, Inc., 930 F.2d 1224, 1227–28 (7th Cir. 1991); N.A.S. Import, Corp. v. Chenson Enters., Inc., 968 F.2d 250, 253 (2d Cir. 1992).

At the same time, a host of countervailing factors might cut against a finding of willfulness. For one thing, when fair use is a close call, some courts insist that it would be inappropriate to infer willfulness even if the defendant is ultimately found liable.93Nimmer & Nimmer, supra note 41, § 14.04(B)(3)(a). Take the case of Princeton University Press v. Michigan Document Services, Inc., in which a university press sued a copy shop for selling unauthorized “coursepacks.”94Princeton Univ. Press v. Mich. Document Servs., Inc., 99 F.3d 1381, 1383 (6th Cir. 1996); see also Nimmer & Nimmer, supra note 41, § 14.04(B)(3)(a). The court concluded that the defendant’s copying did not constitute fair use95Princeton Univ. Press, 99 F.3d at 1392. but took pains to note that the question was a close one—indeed, there were “forcefully argued dissents” on the issue.96Id. As a consequence, the court declined to determine whether “the defendants’ belief that their copying constituted fair use was so unreasonable as to bespeak willfulness.”97Id.

Courts have also refused to countenance claims of willful infringement when the defendant had a reasonable belief that their actions were legally permissible, even if they had been notified that their use of the work was potentially infringing.98See, e.g., RCA/Ariola Int’l, Inc. v. Thomas & Grayston Co., 845 F.2d 773, 779 (8th Cir. 1988); MJ Int’l, Inc. v. Hwangpo, No. 8:01CV201, 2002 U.S. Dist. LEXIS 11079, at *8 (D. Neb. Mar. 13, 2002). Furthermore, some plaintiffs may struggle to establish claims of willful infringement when the defendant did not consult a lawyer99See, e.g., Henley v. DeVore, 733 F. Supp. 2d 1144, 1165–66 (C.D. Cal. 2010). or when the infringer performed a search of Copyright Office records but could not locate any records for the plaintiff’s work.100See, e.g., U.S. Media Corp. v. Edde Ent., Inc., No. 94 Civ. 4849, 1996 U.S. Dist. LEXIS 13389, at *21–22 (S.D.N.Y. Sept. 12, 1996) (noting that the court lacks evidentiary basis to determine “whether some or all of the defendants—in reliance on a copyright search—acted in the good-faith belief that a lack of copyright registration betokens a lack of statutory protection and whether such a belief, at least with respect to the five films at issue, would have been reasonable”).

In short, a successful plaintiff may choose to recover statutory damages at any point before a final judgment is rendered. Typically, a statutory award can range from $750 to $30,000 for every infringed work.10117 U.S.C. § 504(c)(1). The award may be reduced to a sum of no less than $200 per work in cases of innocent infringement or increased to a sum of no more than $150,000 per work in cases of willful infringement.102Id. § 504(c)(2). The number of statutory works is tied to the number of infringed works and, in some cases, the number of infringers.103Friedman v. Live Nation Merch., Inc., 833 F.3d 1180, 1189–92 (9th Cir. 2016).

This Section draws out two of the central problems ailing our copyright system. The first is that copyright infringement is principally understood to be a strict liability regime. And while this regime is tempered by the flexible fair use standard, it is also true that fair use is mostly irrelevant for a great many defendants—namely, those who might choose to settle out of court. The second major issue confronting our system is that copyright owners can choose to recover a separate award of statutory damages for every implicated work. Since these awards can range anywhere from $200 to $150,000 per work,10417 U.S.C. § 504(c)(1)–(2). vast fortunes hang in the balance.

One might inquire, however, as to the interaction between these two aspects of our copyright system. While the defendant’s state of mind is wholly absent from the liability analysis, it does inform the damages calculus. As this Section makes plain, whether the infringer had real or constructive knowledge that their acts were infringing is key to determining whether they face reduced or increased statutory damages. Therefore, in a sense, these two features of our system—strict liability and statutory damages—work in unison. Theoretically, when liability is imposed on an innocent infringer, courts may choose to reduce the award under the category of innocent infringement (in which case, the award could be as low as $200 per work).105Id. § 504(c)(2). When one mechanism (strict liability) fails, judges can tap into the other (statutory damages) to prevent an unjust outcome—that is, to prevent an inadvertent infringer from facing an excessive award.

There is just one problem: in reality, the law of statutory damages is arbitrary, haphazard, and unpredictable. Part II will explore the law in action. It will show that different courts harbor very different ideas about what might qualify as an appropriate statutory award. Indeed, courts issue wildly divergent awards in factually similar cases. And, worse, courts cannot even agree on what statutory damages are meant to achieve: some believe statutory damages are rooted in a compensatory rationale, while others insist that statutory damages are meant to punish or deter future infringers. The result is a body of caselaw that often seems excessive or unprincipled.

II.  STATUTORY DAMAGES IN ACTION

This Part discusses the principal issues plaguing the law of statutory damages. First, courts often disagree over the appropriate justification for statutory damages, oscillating between compensatory, punitive, and deterrence-based concerns. Second, the caselaw turns out to be unpredictable or otherwise arbitrary—in cases sharing a similar fact pattern, courts mete out divergent awards. In part, this is because various courts employ different multipliers to calculate damages. Third, statutory damages can at times lead to shockingly excessive awards. Because statutory damages are computed on a per-work basis, awards can dramatically rack up in cases that implicate multiple works. Fourth, and perhaps most bafflingly, the per-work structure of statutory damages can produce some unintended consequences: because damages attach to infringed works rather than infringed rights, courts tend to systematically gloss over certain exclusive rights. The net result is an underdeveloped body of law. I take up these issues in turn.

A.  Justifications

In many cases, courts award statutory damages “with little to no attention . . . to their underlying purpose.”106Bracha & Syed, supra note 14, at 1249. Debates about statutory damages tend to home in on three standard justifications. First, some courts believe that statutory damages are grounded in a compensatory-evidentiary justification. The general idea is that copyright owners may struggle to prove actual harm.107Shisha, supra note 65, at 1758–61. Indeed, proponents of statutory damages believe that there is something distinctive about the evidentiary challenges facing copyright plaintiffs.108F.W. Woolworth Co. v. Contemp. Arts, Inc., 344 U.S. 228, 232 (1952) (stressing that “[f]ew bodies of law would be more difficult to reduce to a short and simple formula than that which determines the measure of [damages in copyright cases]”). Imagine, for example, a copyright owner whose work was illegally distributed online and then downloaded by third parties. In such circumstances, a court might struggle to “assess how many of these third-party downloads dislodged actual sales that would have otherwise taken place if the work had been distributed by the plaintiff.”109Shisha, supra note 65, at 1759. In other instances, the information needed to prove actual damages might lie “uniquely within the infringers’ control.”110Clever Covers, Inc. v. Sw. Fla. Storm Def. LLC, 554 F. Supp. 2d 1303, 1311 (M.D. Fla. 2008) (citation omitted).

Given these challenges, the argument goes, statutory damages serve a crucial role in ensuring that rightsholders get compensated for their losses. Without statutory damages, copyright holders would face an all-or-nothing regime; they would not be able to obtain compensation at all if they cannot prove actual loss. On this view, statutory damages are directed at a particular problem—the difficulty of proving actual damages—and should thus seek to account for actual harm. They should approximate, if only imperfectly, “the amount that would be recovered as actual compensation.”111Bracha & Syed, supra note 14, at 1231.

This compensatory justification, simple and intuitive as it may seem, has found favor with many courts.112Courts often cite the compensatory rationale alongside other relevant factors that appear to justify statutory damages. See, e.g., Lauratex Textile Corp. v. Allton Knitting Mills Inc., 517 F. Supp. 900, 903 (S.D.N.Y. 1981); Cable/Home Commc’n Corp. v. Network Prods., Inc., 902 F.2d 829, 850–51 (11th Cir. 1990); Fitzgerald Publ’g Co. v. Baylor Publ’g Co., 670 F. Supp. 1133, 1140 (E.D.N.Y. 1987), aff’d sub nom. Fitzgerald v. Baylor Publ’g, 862 F.2d 304 (2d Cir. 1988); Pret-A-Printee, Ltd. v. Allton Knitting Mills, Inc., No. 81 Civ. 3770, 1982 U.S. Dist. LEXIS 15108, at *9–10 (S.D.N.Y. Sept. 16, 1982); Peter Pan Fabrics, Inc. v. Jobela Fabrics, Inc., 329 F.2d 194, 195–96 (2d Cir. 1964) (discussing the statutory framework predating the 1976 statute); Downs v. Yeshiva World News, LLC, No. 18-CV-0250, 2019 U.S. Dist. LEXIS 17751, at *5 (E.D.N.Y. Feb. 1, 2019); Van Der Zee v. Greenidge, No. 03 CIV. 8659, 2006 U.S. Dist. LEXIS 400, at *3–4 (S.D.N.Y. Jan. 6, 2006); Star’s Edge, Inc. v. Braun (In re Braun), 327 B.R. 447, 450 (Bankr. N.D. Cal. 2005); Warner Bros. Inc. v. Dae Rim Trading, Inc., 877 F.2d 1120, 1126 (2d Cir. 1989). The legislative history, too, seems to indicate that compensatory instincts partly underlie our statutory scheme.113See Staff of H. Comm. on the Judiciary, 87th Cong., Report of the Register of Copyrights on the General Revision of the U.S. Copyright Law 102 (Comm. Print 1961) (noting that “[t]he value of a copyright is, by its nature, difficult to establish, and the loss caused by an infringement is equally hard to determine”); see also Bracha & Syed, supra note 14, at 1231 n.68.

Yet, as Bracha and Syed point out, the evidentiary justification suggests two related principles. First, if our goal is to address the difficulties of proving harm, statutory damages should only be available when evidence of actual harm is unavailable or difficult to obtain. Second, statutory damages should approximate actual harm as best as possible.114Bracha & Syed, supra note 14, at 1232.

The problem, of course, is that neither principle finds support in the caselaw or in the language of the copyright statute. Nowhere does the Copyright Act limit the availability of statutory damages only to circumstances where evidence of actual harm is difficult to recover.115See 17 U.S.C. § 504. To the contrary: the Copyright Act expressly entertains a heightened category of culpability—willful infringement—where damages would be increased beyond a purely compensatory level. 17 U.S.C. § 504(c)(2). If statutory damages were strictly about compensating copyright owners, the Copyright Act would not establish an elaborate scheme based on the infringer’s state of mind. Courts, in turn, treat statutory damages “as a matter of unqualified right.”116Bracha & Syed, supra note 14, at 1232. And statutory awards often seem wholly unmoored from compensatory instincts—courts have developed no rules to cap statutory damages based on an assessment of actual damages. In fact, courts sometimes calculate the proper compensatory sum and then multiply it to arrive at a statutory award that is, by definition, supracompensatory.117Id. at 1232–33; see also 2 Paul Goldstein, Goldstein on Copyright § 14.2.1.1(b) (3d ed. Supp. 2012). As Goldstein explains, when there is no evidence of actual damages, courts often just switch to a different justification, such as a deterrence-based rationale. Id. Indeed, “[i]n cases where the evidence provides few if any clues for approximating actual damages and profits, courts often turn to the underlying rationale for statutory damages—sustaining copyright incentives while deterring infringement.” Id. The compensatory rationale, then, does not quite map onto the caselaw.

Another justification courts occasionally invoke is that statutory damages serve a punitive or retributive role—punishing the infringer for their wrongful conduct.118Bracha & Syed, supra note 14, at 1233–34. Many courts endorse this idea of punishment as a justification for statutory damages.119Energy Intel. Grp., Inc. v. CHS McPherson Refinery, Inc., 300 F. Supp. 3d 1356, 1380 (D. Kan. 2018) (“[A] statutory damages award may properly be ‘wholly punitive’ in nature.”); L.A. News Serv. v. Reuters Television Int’l, Ltd., 149 F.3d 987, 996 (9th Cir. 1998) (citing both compensatory and punitive principles as a justification for statutory damages); Energy Intel. Grp., Inc. v. Kayne Anderson Cap. Advisors, L.P., 948 F.3d 261, 272 (5th Cir. 2020) (noting that “the modern Copyright Act’s statutory damages regime has a significant deterrent and potentially punitive purpose”); Dream Games of Ariz., Inc. v. PC Onsite, 561 F.3d 983, 992 (9th Cir. 2009) (“Statutory damages [have] ‘compensatory and punitive purposes.’ ” (citation omitted)). Many courts recognize that punitive damages are available only for a specific subset of cases—those involving “willful infringement” under section 504(c)(2). See, e.g., On Davis v. Gap, Inc., 246 F.3d 152, 172 (2d Cir. 2001) (“The purpose of punitive damages—to punish and prevent malicious conduct—is generally achieved under the Copyright Act through the provisions of 17 U.S.C. § 504(c)(2).”); Kamakazi Music Corp. v. Robbins Music Corp., 534 F. Supp. 69, 78 (S.D.N.Y. 1982) (“The public policy rationale for punitive damages of punishing and preventing malicious conduct can be properly accounted for in the provisions for increasing a maximum statutory damage award . . . per infringement found to be willful.”); Nintendo of Am., Inc. v. Dragon Pac. Int’l, 40 F.3d 1007, 1011 (9th Cir. 1994) (“[W]hen infringement is willful, the statutory damages award may be designed to penalize the infringer.” (quoting Chi-Boy Music v. Charlie Club, Inc., 930 F.2d 1224, 1228–29 (7th Cir. 1991)). And while judges seldom clarify exactly what they mean in describing statutory damages as “punitive,” the basic idea is simple: it is fair for the wrongdoer to suffer in direct proportion to the wrong they inflicted. The wrongdoer deserves to be punished.

This punitive rationale meshes rather neatly with the language of the Copyright Act. The statute, after all, establishes a tripartite framework to distinguish among different categories of infringement—regular, innocent, and willful—based on the infringer’s mental state.120See supra Section I.B. That is, the infringer’s level of culpability is measured by reference to their state of mind. Correspondingly, the appropriate remedy is adjusted in proportion to the infringer’s degree of culpability—namely, whether they had actual or constructive knowledge that their actions were infringing.

But the larger picture, again, is quite messy. As Bracha and Syed observe, it would be peculiar for our system to enforce a punitive rationale through copyright’s statutory damages scheme.121Bracha & Syed, supra note 14, at 1235–36. That is because our copyright system already classifies certain acts as criminal offenses.12217 U.S.C. § 506. Retribution is widely understood to be the classic justification for criminal sanctions.123Thomas E. Robins, Retribution, the Evolving Standard of Decency, and Methods of Execution: The Inevitable Collision in Eighth Amendment Jurisprudence, 119 Penn St. L. Rev. 885, 889 (2015) (“Since Immanuel Kant’s Philosophy of Law, retribution has been a mainstay of criminal law theory, argued over in classrooms and academic journals for centuries. Retribution remains a fundamental, if controversial, precept of criminal law theory.”). Criminal enforcement is also subject to a host of procedural and substantive safeguards, as well as a heightened burden of persuasion.124Kenneth Mann, Punitive Civil Sanctions: The Middleground Between Criminal and Civil Law, 101 Yale L.J. 1795, 1803–13 (1992) (discussing some of the classic elements, both procedural and substantive, that differentiate criminal from civil law). Accordingly, one might question whether copyright’s scheme of civil remedies should serve any punitive purposes. Why should we impose punitive penalties for conduct that does not meet the heightened standard for criminal culpability under the Copyright Act?

Moreover, the caselaw remains inconsistent and unpredictable. Some courts appear to recognize both compensatory and punitive instincts as proper justifications for statutory damages.125See, e.g., L.A. News Serv. v. Reuters Television Int’l, Ltd., 149 F.3d 987, 996 (9th Cir. 1998); Energy Intel. Grp., Inc. v. Kayne Anderson Cap. Advisors, L.P., 948 F.3d 261, 272 (5th Cir. 2020); Dream Games of Ariz., Inc. v. PC Onsite, 561 F.3d 983, 992 (9th Cir. 2009). Others embrace a more constricted view, insisting that punitive damages should be available only in cases of willful infringement.126See, e.g., On Davis v. Gap, Inc., 246 F.3d 152, 172 (2d Cir. 2001); Kamakazi Music Corp. v. Robbins Music Corp., 534 F. Supp. 69, 78 (S.D.N.Y. 1982); Nintendo of Am., Inc. v. Dragon Pac. Int’l, 40 F.3d 1007, 1011 (9th Cir. 1994)). And sometimes courts dismiss the punitive rationale altogether, implying that statutory damages that extend beyond the purely compensatory level reflect an impermissible “windfall” for the plaintiff.127See, e.g., Desire, LLC v. Manna Textiles, Inc., 986 F.3d 1253, 1271 (9th Cir. 2021) (“ ‘Statutory damages are intended as a substitute for profits or actual damage’ . . . and should not provide copyright owners a windfall.” (citation omitted)); Atari Interactive, Inc. v. Redbubble, Inc., 546 F. Supp. 3d 883, 888 (N.D. Cal. 2021) (“A statutory damages award ‘must bear a plausible relationship to Plaintiff’s actual damages,’ . . . and ‘should not provide copyright owners a windfall.’ ” (citations omitted)); Peer Int’l Corp. v. Luna Recs., Inc., 887 F. Supp. 560, 569 (S.D.N.Y. 1995) (“Statutory damages are not intended to provide a plaintiff with a windfall recovery.”); Malibu Media, LLC v. Danford, No. 2:14-cv-511-FtM-38, 2015 U.S. Dist. LEXIS 62022, at *5–7 (M.D. Fla. May 12, 2015); Clever Covers, Inc. v. Sw. Fla. Storm Def., LLC, 554 F. Supp. 2d 1303, 1313 (M.D. Fla. 2008); Countryman Nev., LLC v. Adams, No. 14-cv-491-Orl-18, 2015 U.S. Dist. LEXIS 16612, at *18–20 (M.D. Fla. Jan. 16, 2015). The bottom line is that the courts’ treatment of the punitive rationale is unpredictable or otherwise inconsistent.

A third justification that courts occasionally marshal in support of statutory damages is a more intuitive one: deterrence. Typically paired with the punitive rationale, the need to deter future infringements figures prominently in many cases discussing statutory damages,128See, e.g., Warner Bros. Ent., Inc. v. Carsagno, No. 06-CV-2676, 2007 U.S. Dist. LEXIS 104335, at *4 (E.D.N.Y. May 9, 2007) (report and recommendation of magistrate judge) (“Within these parameters, courts have broad discretion in setting an amount of statutory damages that effectuates the ‘dual purposes of the Copyright Act—compensation of copyright owners and deterrence of potential infringers.’ ” (citation omitted)), adopted by No. 06-CV-2676, 2007 U.S. Dist. LEXIS 40293 (E.D.N.Y. June 4, 2007); Manno v. Tenn. Prod. Ctr., Inc., 657 F. Supp. 2d 425, 433 (S.D.N.Y. 2009) (explaining that, in setting a statutory award, courts consider “the deterrent effect on others besides the defendant” (citation omitted)); Lowry’s Reps., Inc. v. Legg Mason, Inc., 302 F. Supp. 2d 455, 461 (D. Md. 2004) (“Statutory damages have a deterrent component.”); Broad. Music, Inc. v. George Moore Enters., Inc., 184 F. Supp. 3d 166, 171 (W.D. Pa. 2016) (emphasizing that statutory damages are meant to “deter future infringements by punishing the defendant for its actions” (citation omitted)); EMI Mills Music, Inc. v. Empress Hotel, Inc., 470 F. Supp. 2d 67, 75 (D.P.R. 2006) (“[C]ourts should formulate a damage award that will achieve the deterrent purposes served by the statutory damages provision.”). as well as jury instructions.129See, e.g., Comm. on Pattern Civil Jury Instructions of the Seventh Cir., Federal Civil Jury Instructions of the Seventh Circuit § 12.8.4 (2017) (listing “deterrence of future infringement” as a factor relevant to determining the appropriate amount of statutory damages); see also Ninth Cir. Jury Instruction Comm., Manual of Model Civil Jury Instructions for the District Courts of the Ninth Circuit § 17.35 (2021). The Manual of Model Jury Instructions for the Ninth Circuit notes that “statutory damages serve both compensatory and punitive purposes,” and then cites favorably to the Supreme Court’s decision in F.W. Woolworth Co. v. Contemporary Arts, which expressly endorsed the idea of discouraging infringement as a justification for statutory damages. Id. at § 17.35 cmt. (citing F.W. Woolworth Co. v. Contemporary Arts, Inc., 334 U.S. 228, 233 (1952)). The corollary is that statutory damages, justified by the need for deterrence, should reflect a supracompensatory penalty.130Bracha & Syed, supra note 14, at 1236.

Yet doubts abound here as well. First, the idea of deterrence, if taken to reflect a broad justification for statutory damages, tends to fit poorly with our statutory scheme. It is not clear why we would think deterrence appropriate in cases involving innocent infringers. When a case implicates accidental infringement—for example, a defendant who had no reason to believe that their acts were infringing—the concept of deterrence seems irrelevant. As Bracha and Syed ask, “Why would we want to deter people from engaging in reasonable behavior?”131Id. at 1237.

And even in cases of willful infringement, it is doubtful whether the deterrence rationale is necessarily appropriate. If statutory damages are truly about deterring individuals from engaging in wrongful conduct, it should not matter what their mental state is, be it willfulness, reckless disregard, or even malice. If we aim to discourage infringements, it should not matter whether the infringer held a particular mental state.

Second, courts and commentators rarely offer anything more than a bare-bones explanation for why one might think deterrence is even warranted.132See, e.g., Broad. Music, Inc., 184 F. Supp. 3d at 171. Is there any evidence to support the proposition that purely compensatory penalties cannot deter would-be infringers? What is the appropriate level of deterrence? And why should we want to achieve a level of deterrence that goes beyond what is already produced by compensatory damages? As Depoorter has explained, the most colorable justification for supracompensatory damages—and one that courts regularly skirt—is that there is a strong need for deterrence given the substantial costs of detecting and enforcing infringement.133See Depoorter, supra note 15, at 435–36 (observing that courts ignore the economic rationale of supracompensatory damages). Drawing on Depoorter’s articulation of the deterrence theory, Bracha and Syed have offered a more fully synthetized account of the optimal deterrence justification. Bracha & Syed, supra note 14, at 1238–40. The optimal level of deterrence, then, would depend on enforcement costs.

Third, as with the other justifications sketched above, the jurisprudence governing statutory damages has been unpredictable. Courts tend to vacillate between the standard justifications for statutory damages. While some adopt both punitive and deterrence-based rationales,134See cases cited supra note 125. others are perfectly content to focus exclusively on compensatory impulses.135See cases cited supra note 127.

In sum, all three justifications for statutory damages are under siege from different directions. The caselaw is muddled and the statute is underspecified. Courts have struggled to articulate a proper justification for statutory damages and have largely failed to clarify why one justification is preferable to another. The jurisprudence thus boils down to a hodgepodge of conflicting and alternating views on what might qualify as a valid rationale for copyright’s robust scheme of statutory damages.

B.  Predictability and Consistency

This much is clear: the caselaw is not a model of clarity. Critics have long criticized the law of statutory damages as indeterminate or otherwise arbitrary.136See Samuelson & Wheatland, supra note 3, at 480–91 (describing a host of pathologies that render the caselaw inconsistent and unpredictable); Shisha, supra note 65, at 1763 (explaining that “critics have long argued that the caselaw surrounding statutory damages is inconsistent, murky, and sometimes arbitrary”); Bracha & Syed, supra note 14, at 1249 (lamenting the “discretionary, inconsistent, and largely arbitrary awards” that pervade the caselaw). The problem is that courts appear to have only a rudimentary understanding of copyright’s remedial scheme, and they rarely attempt to apply the law in a manner that is consistent and predictable.

Begin with one of the central challenges in assessing damages: the use of multipliers. In calibrating statutory awards, courts and juries often use a simple, two-step formula. First, they identify the plaintiff’s actual loss—say, the licensing fee the plaintiff would have charged if their rights had not been infringed. Second, courts apply a multiplier to the amount identified in the first step. For example, if the lost fee was $5,000 and the court elected to use a multiplier of three, the final award would be $15,000 ($5,000 x 3).

Yet courts have done little to generate any sense of consistency in the application of multipliers. Cases among and within circuits reveal large discrepancies. For example, in R.A. Guthrie Co., Inc. v. Boparai, the court rejected the plaintiff’s argument that a multiplier of five would be appropriate.137R.A. Guthrie Co., Inc. v. Boparai, No. 4:18-cv-080, 2021 U.S. Dist. LEXIS 61507, at *33–36 (E.D. Tex. Mar. 1, 2021) (report and recommendation of magistrate judge), adopted by No. 4:18-cv-080, 2021 U.S. Dist. LEXIS 61506 (E.D. Tex. Mar. 25, 2021). Instead, the court identified a standard multiplier of two to three,138Id. at *36–37. and then decided to apply a larger multiplier of four to account for the defendant’s willful infringement.139Id. at *43. The court’s determination was based on its “sense of justice.”140Id. The final award totaled $240,000.141Id.

In another case, Strober v. Harris, a district court in Florida applied a multiplier of six.142Strober v. Harris, No. 8:20-cv-2663, 2021 U.S. Dist. LEXIS 256001, at *7–8 (M.D. Fla. Nov. 23, 2021). Notably, the multiplier here consisted of two elements: (1) a multiplier of three, designed to account for the defendant’s willful infringement; and (2) a “scarcity multiplier” of two, meant to account for the “unique” and “rare” attributes of the plaintiff’s work.143Id. (citations omitted). After assessing the lost licensing fee at $5,000, the court employed a multiplier of six to arrive at a total award of $30,000 in statutory damages.144Id. at *8.

Similarly, in Sadowski v. Primera Plana NY, the court invoked a more modest multiplier of three, explaining that the plaintiff did not assert that he ever notified the defendant of infringement.145Sadowski v. Primera Plana NY, Inc., No. 18-CV-10072, 2019 U.S. Dist. LEXIS 179897, at *5–6 (S.D.N.Y. Oct. 16, 2019) (report and recommendation of magistrate judge), adopted by No. 18 Civ. 10072, 2021 U.S. Dist. LEXIS 239122 (S.D.N.Y. Dec. 14, 2021). Other courts have applied multipliers of two, five, and seven.146Korzeniewski v. Sapa Pho Vietnamese Rest. Inc., No. 17-CV-5721, 2019 U.S. Dist. LEXIS 1901, at *20 (E.D.N.Y. Jan. 3, 2019) (report and recommendation of magistrate judge) (recommending a multiplier of five), adopted by No. 17-CV-05721, 2019 U.S. Dist. LEXIS 10949 (E.D.N.Y. Jan. 23, 2019); Lauratex Textile Corp. v. Allton Knitting Mills, Inc., 519 F. Supp. 730, 733 (S.D.N.Y. 1981) (applying a multiplier of seven times the actual damages for willful infringement). Of course, the practical difference between a multiplier of two and a multiplier of eight could be dramatic, depending on the plaintiff’s actual loss and the number of works at play.

But there is far more nuance to the caselaw. Some courts have suggested that double-digit multipliers—that is, multipliers of ten and above—are grossly excessive and potentially impermissible. Consider Affordable Aerial Photography, Inc. v. Palm Beach Real Estate, in which the court rejected the plaintiff’s request that a multiplier of thirty-six be used for each of its three infringed works.147Affordable Aerial Photography, Inc. v. Palm Beach Real Est., Inc., No. 20-81307-CIV, 2021 U.S. Dist. LEXIS 125999, at *2 (S.D. Fla. July 7, 2021). The plaintiff argued for an award of $108,000 in statutory damages, based on an assessment of $1,000 in lost licensing fees for each of the plaintiff’s three works, multiplied by thirty-six ($1,000 x 3 x 36 = $108,000).148Id. After rejecting the proposed multiplier, the court suggested that even a more modest multiplier of 14.4 would “create a windfall for the [p]laintiff.”149Id. at *9.

Or take the case of Markos v. Yacht Charters of Miami.com.150Markos v. Yacht Charters of Miami.com, LLC, No. 19-22284-CV, 2019 U.S. Dist. LEXIS 172148, at * 6–7 (S.D. Fla. Oct. 2, 2019) (report and recommendation of magistrate judge), adopted by No. 19-22284-CIV, 2019 U.S. Dist. LEXIS 231480 (S.D. Fla. Oct. 23, 2019). The plaintiff asked for three separate multipliers to be applied: a scarcity multiplier of three, a “quality . . . factor” multiplier of four, and a “willful [infringer]” multiplier of five, thereby settling on an aggregate multiplier of sixty.151Id. The court was quick to reject the requested multiplier as excessive, opting instead for a multiplier of three.152Id. at * 7–8. Likewise, the court in Stockfood America, Inc. v. Fernando Arcay Special Events Corp. reasoned that a multiplier of twenty was excessive and thus inappropriate.153Stockfood Am., Inc. v. Fernando Arcay Special Events Corp., No. 19-22286-CIV, 2019 U.S. Dist. LEXIS 233056, at *6 (S.D. Fla. Dec. 31, 2019) (report and recommendation of magistrate judge), adopted by No. 19-22286-CIV, 2020 U.S. Dist. LEXIS 153440 (S.D. Fla. Jan. 21, 2020). And a court in another case, Schwabel v. HPT Service, LLC, dismissed a multiplier of fifteen and instead applied a multiplier of three.154Schwabel v. HPT Serv., LLC, No. 3:17-cv-791-J-34, 2018 U.S. Dist. LEXIS 171820, at *8–10 (M.D. Fla. Sept. 6, 2018) (report and recommendation of magistrate judge), adopted by No. 3:17-cv-791-J-34, 2018 U.S. Dist. LEXIS 170804 (M.D. Fla. Oct. 3, 2018).

However, in a long line of cases, courts and juries have applied far larger multipliers—sometimes thousands of times the amount of actual damages. Think, for instance, of a case referenced earlier, Capitol Records, Inc. v. Thomas, in which a record label sued a defendant for illegally downloading twenty-four songs.155Capitol Records, Inc. v. Thomas, 579 F. Supp. 2d 1210, 1212–13 (D. Minn. 2008), vacated sub nom. Capitol Records, Inc. v. Thomas-Rasset 692 F.3d 899 (8th Cir. 2012). The jury ultimately awarded the plaintiff $9,250 per infringed song for a total award of $220,000.156Id. at 1213. This award was particularly jarring given the fact that, as the court emphasized, the plaintiff’s actual loss amounted to $50.157Id. at 1227. The resulting award, therefore, was roughly 4,000 times larger than the sum of the plaintiff’s actual damages.

In another notable case, Zomba Enterprises, Inc. v. Panorama Records, Inc., the court awarded the plaintiff $31,000 per infringed work for an aggregate award of $806,000.158Zomba Enters., Inc. v. Panorama Recs., Inc., 491 F.3d 574, 580 (6th Cir. 2007). The defendant argued that the award was grossly excessive—it was thirty-seven times the plaintiff’s actual damages.159Id. at 586 (arguing that “such a high award of statutory damages . . . renders the district court’s award an ‘excessive fine’ under the Eighth Amendment”). On appeal, the Sixth Circuit upheld the award.160Id. at 586–88. Other courts have variously used multipliers of 10, 20, and 300.161See, e.g., Palmer v. Slaughter, No. 99-899, 2000 U.S. Dist. LEXIS 22118, at *14 (D. Del. July 13, 2000) (applying a multiplier of ten); Wild v. Peterson, No. 2:15-cv-2602, 2016 U.S. Dist. LEXIS 92423, at *11 (E.D. Cal. July 15, 2016) (applying a multiplier of twenty); Samuelson & Wheatland, supra note 3, at 487–88 (discussing Lowry’s Reports, Inc. v. Legg Mason, Inc., 302 F. Supp. 2d 455, 455 (D. Md. 2004), in which the “ratio of punishment to actual harm exceeded 300:1”).

Moreover, as Samuelson and Wheatland explain, inconsistent awards in factually similar cases are “easy to find.”162Samuelson & Wheatland, supra note 3, at 485. Often, a series of cases brought by the same plaintiff (such as a record label) in connection with a set of nearly identical infringing acts can lead to wildly divergent awards.163Id. 485–86. And in many cases—especially ones involving excessive awards rooted in double-digit multipliers—the size of the final award seems inexplicable.164Id. at 480 (describing a number of high-profile cases where “reprehensibility was low because evidence of willfulness was weak, none of the defendants were the kind of egregious or repeat infringer for which the enhanced statutory damage awards were intended, and the ratio of punitive to actual damages was exceptionally high.” (footnote omitted)). Equally troubling is the fact that cases dealing with similar categories of infringing acts sometimes yield markedly inconsistent awards.165Id. at 486–87.

And broader issues persist. Looking at the caselaw, one gets the distinct sense that courts do not have a firm grasp of copyright’s remedial scheme. A few different problems intersect. First, and perhaps most perplexingly, judges sometimes jump straight to the statutory maximum, even when the defendant was not especially culpable and the plaintiff suffered little to no harm.166Id. at 481 (discussing Macklin v. Mueck, No. 00-14092-CIV, 2005 U.S. Dist. LEXIS 18027, at *1–2 (S.D. Fla. Mar. 10, 2005), aff’d, 194 F. App’x 712 (11th Cir. 2006)). Second, and relatedly, Samuelson and Wheatland find that courts often treat the statutory maximum as a starting point and then work backwards from there.167Id. at 483–84. And in one of the cases discussed before, Zomba Enterprises, the court appears to have erroneously assumed that, if the defendant is found to be a willful infringer, the court must award above $30,000 in damages—which would explain why the court issued an oddly specific and otherwise atypical award of precisely $31,000 per work.168Id. at 484.

The picture, in other words, is a profoundly bleak one. Courts dole out inconsistent awards in factually similar cases. They cannot quite agree on the appropriate multiplier range. They do not understand the law’s remedial structure. And, as the previous Section showed, they disagree over the proper justification for statutory damages.

C.  Excessive Awards

The law is not just inconsistent but can also produce outlandish statutory awards that seem entirely divorced from any conception of actual harm or culpability. Partly at fault for this state of affairs is the per-work structure of our remedial system. Even a reasonable award can become excessive thanks to the aggregation requirement: courts and juries are simply compelled to multiply the award by the number of infringed works.

A few examples might prove instructive. Consider again a case discussed throughout this Article, Capitol Records, Inc. v. Thomas.169Capitol Recs., Inc. v. Thomas, 579 F. Supp. 2d 1210 (D. Minn. 2008), vacated sub nom. Capitol Recs., Inc. v. Thomas-Rasset 692 F.3d 899 (8th Cir. 2012). The final award in that case was 4,000 times the sum of the plaintiff’s actual damages.170See supra notes 155–57 and accompanying text. The jury assessed statutory damages of $9,250 per work, although the trial judge observed that the plaintiff’s actual damages—for the 24 works combined—amounted to about $50.171Thomas, 579 F. Supp. 2d at 1213, 1227.

It bears emphasizing, though, that even at the statutory minimum of $750 per work, the total award in Capitol Records, Inc. v. Thomas would have been excessive. Because the case involved 24 infringed works, and because statutory damages are awarded on a per-work basis, the jury would have to issue a minimum aggregate award of no less than $18,000 ($750 x 24).172If the defendant was found to be an innocent infringer, the amount could be reduced further to $200. 17 U.S.C. § 504(c)(2). But it is worth noting that, again, even an award of $200 per work would be excessive as compared to the plaintiff’s actual harm. Because 24 works were at issue, a per-work award of $200 would have resulted in a total award of $4,800, a sum that is 96 times the plaintiff’s actual damages. But recall again that the actual harm suffered by the plaintiff in that case was roughly $50. So even under the statutory minimum, the aggregate award would have been 360 times the plaintiff’s actual damages—an appallingly excessive award by any measure.

Now consider the procedural history of Capitol Records, Inc. v. Thomas. After the jury returned an award of $220,000 against the defendant, the trial judge vacated the judgment and ordered a retrial on the basis of a jury instruction error.173Thomas, 579 F. Supp. 2d at 1226–27. In doing so, the judge also cautioned that Congress did not intend for large statutory awards to be “applied to a party who did not infringe in search of commercial gain.”174Id. at 1227. Remarkably, however, the second trial turned out to be even more controversial: the jury increased the statutory award to $80,000 per work, for a final award of $1.92 million.175Capitol Recs., Inc. v. Thomas-Rasset, 680 F. Supp. 2d 1045, 1050 (D. Minn. 2010), vacated, 692 F.3d 899 (8th Cir. 2012). Following the second trial, the defendant filed a post-trial motion requesting, among other things, that the court reduce the jury award either by way of remittitur or under the Due Process Clause.176Id. at 1049. The court agreed. It lambasted the second award as “simply shocking” and remitted it to a sum of $2,250 per work, for a total award of $54,000.177Id. at 1054–55. As the court stressed,

despite the . . . justifications [for increased damages] and the Court’s deference to the jury’s verdict, $2 million for stealing 24 songs for personal use is simply shocking. No matter how unremorseful Thomas-Rasset may be, assessing a $2 million award against an individual consumer for use of Kazaa is unjust. Even Plaintiffs admit that Thomas-Rasset is unlikely to ever be able to pay such an award.178Id. at 1054.

The record companies then exercised their right to seek a new trial on the question of damages.179Capitol Recs., Inc. v. Thomas-Rasset, 799 F. Supp. 2d 999, 1003 (D. Minn. 2011), vacated, 692 F.3d 899 (8th Cir. 2012). The district court held a third trial, and the jury returned an award of $62,500 per work, yielding a total award of $1.5 million.180Id. Once again, the defendant moved to amend the judgment.181Id. The district court again granted the defendant’s motion and reduced the award to $2,250 per work, leading to a final award of $54,000.182Id. at 1012 (“The Court concludes that a statutory damages award of $2,250—3 times the statutory minimum—per sound recording infringed is the maximum permitted under the due process analysis. As the Court explained . . . there is a broad legal practice of establishing a treble award as the upper limit permitted to address willful or particularly damaging behavior.”). The record companies appealed, and the Eighth Circuit reinstated the first award of $222,000.183Capitol Recs., Inc. v. Thomas-Rasset, 692 F.3d 899, 906 (8th Cir. 2012).

What makes Capitol Records, Inc. v. Thomas particularly egregious is that the case appears to pit a poorly resourced defendant against a group of deep-pocketed plaintiffs that suffered little material harm. But the issues here were further compounded by the aggregation requirement. Since the defendant technically infringed the plaintiffs’ rights in twenty-four individual works, even a minimum statutory award would have been grossly excessive. Given copyright’s per-work structure, the prospect of crippling damages was virtually inescapable.

Another example is Columbia Pictures Television v. Krypton Broadcasting of Birmingham, Inc.184Columbia Pictures Television, Inc. v. Krypton Broad. of Birmingham, Inc., 106 F.3d 284 (9th Cir. 1997), rev’d sub nom. Feltner v. Columbia Pictures Television, Inc., 523 U.S. 340 (1998), remanded sub nom. Columbia Pictures Television, Inc. v. Krypton Broad. of Birmingham, Inc., 259 F.3d 1186 (9th Cir. 2001) (affirming the district court’s summary judgment in favor of the plaintiff in a case involving the unauthorized broadcasting of various television shows). Columbia Pictures sued the owner of several television stations for broadcasting four copyrighted television programs without a license.185Id. at 288–89. The court found for Columbia and entered an award of $20,000 per episode.186Id. at 292. Because the infringed shows consisted of 440 episodes, the court issued a total award of $8.8 million.187Id. at 288. After the Ninth Circuit affirmed the judgment, the Supreme Court reversed and remanded for failure to grant the defendant a jury trial on the issue of damages.188Feltner v. Columbia Pictures Television, Inc., 523 U.S. 340, 340, 355 (1998). After a jury trial was convened, the jury returned a staggering award of $72,000 per infringed episode, for a total award of $31.68 million.189Columbia Pictures Television, Inc. v. Krypton Broad. of Birmingham, Inc., 259 F.3d 1186, 1195 (9th Cir. 2001). Here, too, aggregation proved fatal. Because each episode constituted a separate work, the per-work award had to be multiplied by 440, leading to a final award in excess of $30 million.190Id.

The number of infringed works is often a matter of bitter contestation for that reason precisely. Take, for example, Yellow Pages Photos, Inc. v. Ziplocal, LP.191Yellow Pages Photos, Inc. v. Ziplocal, LP, 795 F.3d 1255 (11th Cir. 2015). Yellow Pages Photos licensed stock photos for the yellow pages industry.192Id. at 1260. It owned a library of thousands of photos that were sorted into themed collections.193Id. Yellow Pages Photos brought action against two companies for infringing its rights in 178 collections that contained 10,411 photos.194Id. at 1262–63. The jury returned an award of $123,000 in statutory damages against one of the defendants.195Id. at 1263. On appeal, the plaintiff argued that the court erred in treating each of its 178 collections, rather than each of its 10,411 photos, as a separate work.196Id. at 1276. The relevant benchmark, the plaintiff asserted, was the number of individual photos, not the number of collections.197Id. As the Eighth Circuit explained, “[b]y taking the position that its 10,411 individual photos are each separate works, [the plaintiff] presumably [sought] to raise the statutory damages award in this case from $123,000 to a minimum of $1.5 million and a potential maximum of $300 million.”198Id. Ultimately, the court rejected the plaintiff’s claims, holding that each collection was an individual work under the Copyright Act.199Id. at 1277–79; see also 17 U.S.C. § 504(c)(1) (mandating that “all the parts of a compilation or derivative work constitute one work”).

Still, this case offers a neat illustration of the broader issues. Although the plaintiff’s claims were eventually rejected, the critical point is that the aggregation requirement presented a serious threat. The scope of the defendants’ potential exposure was startling: depending on how the court might have chosen to count the number of implicated works, the defendants could have faced an award ranging anywhere from $123,000 up to $300 million. Indeed, as this brief discussion demonstrates, our per-work system is broken. Statutory damages can add up quickly and dramatically in cases involving multiple infringed works. And that should give us pause.

D.  The Copyright Wasteland

I have argued elsewhere that per-work statutory damages have another profound yet overlooked consequence: they discourage courts from properly developing copyright law.200See generally Shisha, supra note 65 (arguing that certain copyright entitlements remain doctrinally underdeveloped in part because of copyright’s per-work scheme of statutory damages). Modern copyright law, to be sure, is underdeveloped. Typically, courts focus on one exclusive right—the right to reproduce the copyrighted work—while glossing over all other copyright entitlements.201Id. at 1724. As a result, we do not have a clear sense of what some exclusive rights mean or how they might be applied. Courts have not really grappled with many questions central to the scope of these rights.202Id. at 1732–55.

One example is the exclusive right to distribute the copyrighted work.20317 U.S.C § 106(3) (granting copyright owners the exclusive right “to distribute copies or phonorecords of the copyrighted work to the public by sale or other transfer of ownership, or by rental, lease, or lending”). It is an open question whether the distribution right extends only to “actual dissemination,” namely, to circumstances where copies of the work were offered to and received by members of the public.204Shisha, supra note 65, at 1746–55; Peter S. Menell, In Search of Copyright’s Lost Ark: Interpreting the Right to Distribute in the Internet Age, J. Copyright Soc’y U.S.A., Fall 2011, at 1, 1–2 (2011). Thus, in cases centering on claims of online distribution, it is unclear whether the plaintiff must show that copies of the work were not only uploaded online but also downloaded by third parties. District courts have been somewhat divided on the issue, although most courts now seem increasingly more hospitable to the view that distribution does, in fact, require proof of actual dissemination.205Some courts have relied on the so-called “making available” theory to hold that a third party may infringe the distribution right by merely making the work available to others—say, by uploading the work to a publicly accessible website. See, e.g., Universal City Studios Prods. LLLP v. Bigwood, 441 F. Supp. 2d 185, 190 (D. Me. 2006) (using file-sharing software to upload contents online constitutes unauthorized distribution); Hotaling v. Church of Jesus Christ of Latter-Day Saints, 118 F.3d 199, 203 (4th Cir. 1997) (placing a copy of a work in a publicly accessible index could constitute distribution); UMG Recordings, Inc. v. Hummer Winblad Venture Partners (In re Napster, Inc. Copyright Litig.), 377 F. Supp. 2d 796, 805 (N.D. Cal. 2005) (holding that the distribution right is infringed whenever the defendant merely “offer[s]” to distribute copies of the work); Atl. Recording Corp. v. Anderson, No. H-06-3578, 2008 U.S. Dist. LEXIS 53654, at *20 (S.D. Tex. Mar. 12, 2008) (placing copyrighted works into a shared folder constitutes distribution); Arista Recs. LLC v. Greubel, 453 F. Supp. 2d 961, 969–71 (N.D. Tex. 2006) (holding that it is a violation of the distribution right to make copyrighted works available to others); Malibu Media, LLC v. Dhandapani, No. 3:19-cv-01300-M, 2020 U.S. Dist. LEXIS 194794, at *7 (N.D. Tex. Feb. 12, 2020) (“[D]istribution may also be accomplished through the publication of a copyrighted work that makes it available for others to copy.”).

Other courts, by contrast, reject the making available theory as legally implausible and instead hold that distribution requires actual dissemination. See, e.g., Atlantic Recording Corp. v. Brennan, 534 F. Supp. 2d 278, 282 (D. Conn. 2008) (emphasizing that a violation of the distribution right requires “actual distribution of copies” (citation omitted)); Atlantic Recording Corp. v. Howell, 554 F. Supp. 2d 976, 981 (D. Ariz. 2008) (stating that distribution requires “actual dissemination of either copies of phonorecords.” (citation omitted)); BMG Rights Mgmt. (US) LLC v. Cox Commc’ns, Inc., 149 F. Supp. 3d 634, 670 (E.D. Va. 2015), aff’d in part, rev’d in part, 881 F.3d 293 (4th Cir. 2018); SA Music, LLC v. Amazon.com, Inc., No. 2:20-CV-0579, 2021 U.S. Dist. LEXIS 13489, at *6 (W.D. Wash. Jan. 25, 2021) (stating that distribution requires “the transfer (or download) of a file containing the copyrighted work from one computer to another”); EVOX Prods., LLC, v. Verizon Media Inc., No. CV 20-2852, 2021 U.S. Dist. LEXIS 151460, at *6 (C.D. Cal. May 5, 2021) (holding that the “making available” theory “fails as a matter of law”); Grecco v. Age Fotostock Am., Inc., No. 21-cv-423, 2021 U.S. Dist. LEXIS 192021, at *13 (S.D.N.Y. Oct. 5, 2021) (holding that “an unconsummated offer to distribute does not give rise to liability under Section 106(3)”); Annabooks, LLC v. Issuu, Inc., No. 20-cv-04271, 2020 U.S. Dist. LEXIS 221963, at *10 (N.D. Cal. Sept. 24, 2020) (noting that infringement of the distribution right requires actual dissemination).
But, stunningly, circuit courts have not weighed in on the question.206U.S. Copyright Office, The Making Available Right in the United States: A Report of the Register of Copyrights 22 (2016), https://www.copyright.gov/docs/making_available /making-available-right.pdf [https://perma.cc/ LT3K-AEK7] (noting that “[t]o date, neither the U.S. Supreme Court nor any of the circuit courts has had occasion to directly rule on [the question of actual dissemination]”). Nor has the Supreme Court.207Id.

Why is that the case? The answer is that courts never really have to address the issue. Virtually every copyright infringement case involves claims of unauthorized reproduction.208Shisha, supra note 65, at 1767–69. And once courts establish that the right of reproduction has been infringed, all other rights turn out to be inconsequential. It does not matter how many separate exclusive rights were infringed; a single infringement suffices. That is because statutory damages attach to infringed works rather than infringed rights. So, it is irrelevant whether there is one right at issue or a few separate ones—the result, in terms of the final statutory award, will be the same.209Take, for example, Capitol Recs., Inc. v. Thomas-Rasset, 692 F.3d 899 (8th Cir. 2012). In Thomas-Rasset, the Eighth Circuit declined to tackle the issue of distribution. Id. at 902. In explaining its decision, the court stressed that the question of distribution was simply “unnecessary for the remedies sought or to a freestanding decision on whether [the defendant] violated the law.” Id. Why? Because the defendant was already held liable for making infringing copies of the plaintiffs’ works. Id. There was a clear infringement of the reproduction right, so the court had little incentive to reach the question of distribution. The court therefore refused to address a legal question that was irrelevant to the outcome of the case. As the court explained, it was immaterial in terms of the “remedies sought” whether any other rights, beyond reproduction, had been infringed—the size of the statutory award would have remained the same. Id. The only thing that matters is the number of infringed works, not infringed rights.21017 U.S.C. § 504(c)(1) (providing that a plaintiff may choose to recover “an award of statutory damages for all infringements involved in the action, with respect to any one work”); Nimmer & Nimmer, supra note 41, § 14.04(E)(1)(a) (“[Where a suit involves multiple infringed works,] statutory damages for each work must be awarded.” (citing H.R. Rep. No. 94-1476, at 162 (1976))).

In that sense, there is little incentive for courts to properly address copyright’s full suite of exclusive rights. Instead, courts tend to fixate on one exclusive entitlement—the right of reproduction—to the exclusion of all others. And this pathology, again, is an artifact of copyright’s per-work scheme of statutory damages.

III.  TOWARD STRUCTURAL REFORM

The discussion thus far offers a grim perspective. It paints a picture of a system run amok. It demonstrates that copyright’s scheme of statutory damages is troublesome for any number of reasons: it rests on ever-shifting and poorly developed justifications; it is rooted in a body of caselaw that is both arbitrary and inconsistent; it can lead to outlandish awards; and it allows courts to systematically shun certain exclusive rights.

Where does this leave us? I offer here a high-level sketch of one possible solution. In simple terms, I suggest that we disentangle the number of statutory awards from the number of infringed works. Instead, I develop a framework for assessing damages by reference to the concept of infringement episodes. An infringement episode, in a nutshell, is a series of related infringing acts that together make up a single, larger event. When courts identify a single infringement episode, they may choose to issue only a single statutory award—no matter how many individual works are in play. The proposed framework would thus introduce a degree of much-needed flexibility into our system. Rather than engage in a mechanical exercise of counting works, courts and juries would enjoy a significant measure of discretion to assess whether the defendant’s actions are sufficiently interconnected to constitute a single, larger episode.

This Part begins by offering a brief overview of the proposed framework. I explain that previous proposals have not gone far enough in confronting the main problem, and then describe in general terms how an approach rooted in the concept of infringement episodes might work. I next turn to demonstrate that a similar approach, based on a holistic assessment of the defendant’s actions, already exists and is widely employed in the fields of criminal law, civil procedure, and immigration law. I close by cashing out my proposed framework and developing a more fine-grained account of how courts might identify infringement episodes.

A.  Overview

To reiterate, statutory damages are awarded on a per-work basis. And this is a problem for two related reasons. First, copyright’s per-work scheme forces courts to aggregate damages in ways that lead to unpredictable—and often excessive—awards. In cases like Yellow Pages Photos,211Yellow Pages Photos, Inc. v. Ziplocal, LP, 795 F.3d 1255, 1276–82 (11th Cir. 2015). for example, conflicting views on how to measure the number of works could potentially produce earth-shattering consequences. Under the defendants’ count, the statutory minimum was $120,000 to $135,000, and under the plaintiff’s, the

statutory award could have been as high as $1.5 billion.212Id. at 1276. The court cited slightly different figures, noting that the statutory maximum under the plaintiff’s preferred count was approximately $300 million. Id. The reason for that lower figure is that the jury rejected the plaintiff’s claim that the defendants engaged in willful infringement, so the statutory maximum in this case was merely $30,000 per work, instead of the increased maximum of $150,000 per work in cases where claims of willful infringement are still on the table. Id. at 1271–72. A divergence of this scope and intensity is indefensible. Second, as discussed above, per-work damages generate a legal equilibrium in which the number of infringed rights, as opposed to infringed works, is of little practical significance.213See supra Section II.D. And that is why courts tend to glaze over certain exclusive rights. The result is an impoverished body of caselaw.214See supra Section II.D.

Per-work damages, then, have worked considerable harm to the copyright system. And copyright scholars, for their part, have suggested a number of doctrinal or legislative reforms to address the issue. Let us first consider what is arguably the most serious attempt to confront the problem of excessive damages in recent years: Depoorter’s multifaceted proposal for reducing the incidence and availability of excessive awards.215See generally Depoorter, supra note 15. Based on a close analysis of docket records and court decisions, Depoorter finds that “a vast majority of plaintiffs accuse defendants of willful copyright infringement.”216Id. at 428. Indeed, “[t]the sheer number of willful infringement claims is remarkable,” especially when one considers that enhanced damages were originally designed to address only exceptional cases.217Id. Nevertheless, Depoorter also finds that, of all cases where the plaintiff won on the merits, courts awarded increased damages for willful infringement in only 2.8% of cases.218Id. at 429. This glaring mismatch between what plaintiffs allege and what courts find suggests that plaintiffs engage in remedy overclaiming.219Id. at 439–40.

Based on these findings, Depoorter puts forward a number of prescriptive proposals. First, he advances a series of proposals meant to make remedy-overclaiming more costly to opportunistic plaintiffs. One way to do so is to impose sanctions on plaintiffs who make inflated or exaggerated damage claims.220Id. at 441. Under the current statute, a plaintiff seeking statutory damages need not justify their damage claims on the basis of actual harm.221Id.

And attorney cost fee-shifting is only available to the prevailing party,22217 U.S.C. § 505 (directing that “the court may also award a reasonable attorney’s fee to the prevailing party as part of the costs” (emphasis added)). so a defendant who lost but faced an outlandish award claim (which was ultimately rejected by the court) has no effective recourse. Depoorter thus recommends that Congress amend the Copyright Act to allow for fee-shifting in cases in which the plaintiff is ultimately found to have made inflated damage claims, even if the plaintiff prevailed in litigation.223Depoorter, supra note 15, at 442.

Second, Depoorter also presses a few proposals designed to make remedy-overclaiming less risky to defendants. One way to achieve this is by developing a set of guidelines and standards to govern the application and assessment of statutory damages.224Id. at 443. A second useful step would be to simply reduce the statutory range—which currently extends from $750 to $150,000 in most cases.225Id. Yet another helpful measure would seek to increase transparency by forcing courts to explain, in detail, the “motivation and calculation involved with every statutory award,” as well as by establishing a central database to collect statutory award judgments.226Id. at 443–44. Depoorter also suggests that Congress eliminate enhanced damages for willful infringement or otherwise limit their availability only to cases of commercial infringement.227Id. at 444–45. Another possibility is to impose a cap on the size of the total award.228Id. at 445. In addition, Congress should amend section 504(c) “so that statutory damages are not available when the defendant has offered credible evidence of its profits and/or the plaintiff’s damages.”229Id. at 446.

Similarly insightful is recent work by Bracha and Syed.230See Bracha & Syed, supra note 14, at 1249–53. Focusing more keenly on the justification for statutory damages, they propose a few ways to think about the role that statutory damages ought to serve in the copyright system. One possible approach is to conceive of statutory damages as serving a compensatory role.231Id. at 1250. On this view, courts should strive to award statutory damages that approximate actual harm.232Id. Alternatively, courts can frame statutory damages as a means of achieving optimal deterrence.233Id. at 1250–52. But optimal deterrence, at least on its face, would seem to justify imposing very high awards on a small number of defendants in an effort to deter future infringement.234Id. at 1238–40. Thus, an “optimal deterrence” approach would only be defensible to the extent that it is subject to “judicial safeguards against the incurrence of serious inequitable harms.”235Id. at 1251.

Two other reform proposals merit attention. One is a study by Pamela Samuelson and Tara Wheatland.236Samuelson & Wheatland, supra note 3, at 502–04. Drawing on a comprehensive survey of the history and doctrine of statutory damages, Samuelson and Wheatland make two central claims: first, that statutory damages should ordinarily approximate actual damages; and, second, that enhanced damages should be available only in cases involving the sort of willful infringement Congress had originally contemplated—that is, patently outrageous cases implicating mass-infringement or repeat infringers.237Id.

Another proposal, introduced by Garfield, focuses on three different approaches to calibrating statutory damages: interpretive, statutory, and constitutional.238Garfield, supra note 18, at 37–53. On the interpretive front, Garfield embraces the set of proposals articulated by Samuelson and Wheatland—proposals that would compel courts to approximate actual damages in most cases and reserve enhanced damages for nakedly egregious cases.239Id. at 38–39. But Garfield also notes that such an interpretive fix would not necessarily “translate[] into clear signals that potential users . . . can identify and confidently trust.”240Id. at 39. Garfield thus reasons that additional measures are necessary. On the legislative front, Garfield proposes an amendment of section 504 to (1) clarify that statutory damages should be primarily compensatory, (2) limit enhanced damages to exceptional cases, (3) empower courts to adjust the total award below the statutory minimum even in cases involving multiple works, and (4) give courts discretion to reduce the statutory minimum when a user has a strong innocent infringement claim, even if that user had access to works that bore a valid copyright notice.241Id. at 43. Finally, on the constitutional front, Garfield recommends that courts reinvigorate and harness both Due Process and free-speech limits on statutory damages.242Id. at 46–53.

There is much to admire in these proposals. To be sure, copyright scholars have devoted a great deal of energy to crafting proposals that would limit the scope of statutory damages. But none of these proposals go nearly far enough: they all stop short of severing the link between statutory damages and the number of infringed works. To meet the challenge of the moment, I suggest a legislative reform that would do away with copyright’s per-work remedies altogether. A better framework, I argue, would afford courts a significant degree of discretion to look beyond the number of implicated works and instead evaluate whether the defendant’s conduct gave rise to, and was grounded in, a single infringement episode. By infringement episode, I mean a chain of related infringing acts that together constitute a larger factual event. When the defendant’s conduct is attributable to a single infringement episode, courts should issue only a single statutory award, even if multiple infringed works are at stake.

To illustrate, consider Sony BMG Music Entertainment v. Tenenbaum.243Sony BMG Music Ent. v. Tenenbaum, 660 F.3d 487 (1st Cir. 2011) (affirming the denial of the defendant’s motion for a new trial or remittitur after the jury returned an award of $675,000 in statutory damages). A group of record companies brought action against the defendant, Joel Tenenbaum, for downloading and distributing thirty songs.244Id. at 490. The jury found that Tenenbaum had willfully infringed the plaintiffs’ rights and returned an award of $22,500 per infringed song for a total award of $675,000.245Id. The defendant here was accused of having downloaded works en masse. But what if the court were able to treat the defendant’s conduct as reflecting a single infringement episode? And what if, because the defendant’s actions were sufficiently intertwined, the court were able to issue only a single statutory award?

There is reason to think a single award would have been appropriate in cases like Tenenbaum or Thomas.246Capitol Recs., Inc. v. Thomas, 579 F. Supp. 2d 1210 (D. Minn. 2008). The value of the copyrighted works in both cases was negligible, and the harm to the plaintiff was marginal.247See, e.g., id. at 1227 (explaining that the defendant “infringed on the copyrights of 24 songs,” which were roughly equivalent to three CDs costing about $54). In both cases, the defendants were private individuals who pursued a noncommercial objective by downloading songs for private consumption. And in both cases, the aggregation of per-work awards proved fatal. After all, a person of average means may never be able to withstand (or fully recover from) a damages award approaching $700,000.

Now, to be clear, courts today already consider some of these factors in calibrating the size of the per-work award.248See text accompanying supra notes 78–79 (discussing the various factors that courts consider in calculating statutory damages, including the profits generated by the defendant and the harm suffered by the plaintiff). At times, courts also appear eager to accommodate certain distributive concerns—say, when they fear that a crushing award might drive the defendant out of business. See sources cited infra note 338. Yet the larger problem remains: courts are forced to multiply the award by the number of infringed works.249See supra Section II.C. So even if a court determines that the per-work award should be lower because the infringer was not especially culpable, the total award may nonetheless prove disproportionately large after the per-work award has been multiplied by the number of infringed works. Recall that in Tenenbaum, the per-work award itself ($22,500) was arguably quite modest, amounting to less than a fifth of the statutory maximum.250Sony BMG Music Ent. v. Tenenbaum, 660 F.3d 487, 490 (1st Cir. 2011). But after it was multiplied by thirty, the award grew dramatically.251Id. The rub is that even a modest per-work award can grow significantly due to aggregation.

To address the issue, I suggest that Congress amend the Copyright Act to permit courts to break with this per-work scheme in cases where they find, based on a multifactor inquiry, that the defendant’s acts were attributable to a single infringement episode. Looking for sources of inspiration, the next Section will explore how courts have approached analogous questions in different fields. And the third Section will then offer a more detailed explication of the proposed framework, discussing the different factors that courts might consider in evaluating the defendant’s course of conduct.

B.  Doctrinal Analogues

An assessment of the defendant’s conduct, as explained previously, would have to depend upon an open-textured, case-by-case analysis. But would this system be administrable? Are courts well-positioned to engage in this type of multifactor inquiry? Would the proposed framework spawn lengthy and uncertain litigation?

The answer is no. Across a variety of different legal fields—including criminal law, civil procedure, and immigration law—courts have already developed a rash of somewhat equivalent doctrines aimed at evaluating the defendant’s conduct. These doctrines all share a core commitment: they leverage a flexible, multifactor inquiry to assess whether the defendant’s actions can be lumped together as part of a larger transaction, episode, scheme, or series of occurrences. The following discussion will offer a perfunctory and necessarily incomplete overview of such doctrines. These doctrines, though at times controversial, demonstrate that the proposed framework is both plausible and administrable.

1.  Criminal Law

The defendant’s course of conduct features most notably in the doctrine of double jeopardy. The Double Jeopardy Clause of the Fifth Amendment shields a defendant from being “twice put in jeopardy of life or limb” for the same offense.252U.S. Const. amend. V. The goal of the prohibition against double jeopardy is to protect an individual from “being subjected to the hazards of trial and possible conviction more than once for an alleged offense.”253Green v. United States, 355 U.S. 184, 187 (1957). The Supreme Court has interpreted the Double Jeopardy Clause as protecting against a second prosecution for the same offense after either acquittal or conviction.254North Carolina v. Pearce, 395 U.S. 711, 717 (1969). More broadly, the Double Jeopardy Clause has also been understood to preclude multiple punishments for the same offense.255Id. In other words, the central question under the Double Jeopardy Clause is whether the defendant is being punished or tried twice for the same offense.

How do courts define the term “same offense”? Federal courts have long relied on a decades-old test first developed in Blockburger v. United States256Blockburger v. United States, 284 U.S. 299, 304 (1932). to determine whether the defendant is at risk of being tried twice for the same offense. Under the Blockburger test, when the same act violates two separate statutory provisions, the court is charged with examining “whether each provision requires proof of a fact which the other does not.”257Id. In essence, courts seek to ascertain whether one or two offenses are at issue. This same-elements test examines the statutory elements of each offense to determine whether one of the implicated offenses incorporates an element that the other does not. Accordingly, if a single act or transaction leads to multiple offenses—namely, if one offense requires additional proof beyond what is required by the other—the Double Jeopardy Clause does not apply.

Yet, ultimately, whether one is placed twice in jeopardy for the same offense is not just a matter of federal law. Many states also encode a robust protection against double jeopardy, either through constitutional provisions or through state statutes requiring joinder of certain offenses.258Rebecca A. Delfino, Prohibition on Successive Prosecutions for the Same Offense—In Search of the “Goldilocks Zone”: The California Approach to a National Conundrum, 54 Am. Crim. L. Rev. 423, 424 (2017). The following discussion largely tracks, and is principally based on, a recent study by Rebecca Delfino. See generally Delfino, supra. And the picture here is more intricate. Some states, like Michigan, apply the Blockburger test.259The Michigan Supreme Court adopted the Blockburger test in People v. Nutt, 677 N.W.2d 1, 3, 12–13 (Mich. 2004). Other states, like Louisiana, employ a double-pronged test for defining “same offense”: a variation of the Blockburger same-elements test, coupled with an additional same-evidence test.260See, e.g., State v. Miller, 571 So. 2d 603, 606 (La. 1990) (quoting State v. Knowles, 392 So. 2d 651, 654 (La. 1980)). Under the same-evidence test, courts examine whether the evidence necessary to support the second indictment was also sufficient to support a conviction for the first.261Id.

A number of states, however, reject the Blockburger same-elements test in favor of a more flexible test: the same transaction or course of conduct test. Consider, for instance, New Jersey. As the New Jersey Supreme Court has explained, courts should consider several different factors in assessing whether the defendant’s misconduct is part of a single transaction, including whether the offenses are “based on the same conduct or arose out of the same episode.”262State v. Williams, 799 A.2d 470, 474 (N.J. 2002) (quoting State v. Yoskowitz, 563 A.2d 1, 12 (N.J. 1989)). And New Jersey courts, in turn, have applied this “same episode” test by considering a number of additional factors, including:

[T]he nature of the offenses, the time and place of each offense, whether the evidence supporting one charge is necessary and/or sufficient to sustain conviction under another charge, whether one offense is an integral part of the larger scheme, the intent of the accused, and the consequences of the criminal standards transgressed.263Id. at 475–76 (citing State v. Best, 356 A.2d 385 (N.J. 1976)).

Other states similarly consider whether the defendant’s actions are causally related and whether they were temporally close.264A prime example is Florida. See, e.g., Ellis v. State, 622 So. 2d 991, 1000 (Fla. 1993) (“[T]he crimes . . . must be linked in some significant way. This can include the fact that they occurred during a ‘spree’ interrupted by no significant period of respite . . . or the fact that one crime is causally related to the other, even though there may have been a significant lapse of time.” (citations omitted)). Additionally, a 2017 study found that “[t]hirteen states apply no individual test, but rather multiple factors, to prohibit successive prosecutions.”265See Delfino, supra note 258, at 436. These states use some combination of several existing tests—temporal and spatial tests, “same transaction” tests, evidentiary/same-elements tests, and others—to carry out a more open-ended, multifactor inquiry.266Id. at 436–39.

To conclude, different states diverge radically in their approach to protecting against double jeopardy. Nevertheless, a few relevant factors stand out as particularly relevant. Some factors hinge on the defendant’s conduct: the spatial and temporal proximity of the defendant’s acts; whether they fit into a larger episode or transaction; and the nature of the causal relationship between each of them. A second set of factors look at the legal directives that were violated—do these provisions share the same elements and require the same evidence to establish guilt? A third category of factors lean more broadly on the defendant’s intent and the consequences of the defendant’s conduct.

2.  Civil Procedure

The rule of permissive joinder, codified in Rule 20 of the Federal Rules of Civil Procedure, provides that multiple plaintiffs may join in one action if (1) they assert a right to relief arising out of the “same transaction, occurrence, or series of transactions or occurrences,” and (2) “any question of law or fact common to all plaintiffs will arise in the action.”267Fed. R. Civ. P. 20(a)(1). A similar rule of permissive joinder applies when multiple defendants face actions arising out of a single series of occurrences and involving questions of law and fact common to all defendants.268Fed. R. Civ. P. 20(a)(2). Rule 20 is permissive in that it does not compel joinder even in circumstances that satisfy its requirements.269See Mary Kay Kane, 7 Federal Practice and Procedure (Wright & Miller) § 1652 (3d ed. 2022) [hereinafter Wright & Miller] (“Rule 20(a) is permissive in character; joinder in situations falling within the rule’s standard is not required unless it is within the scope of compulsory joinder prescribed by Federal Rule of Civil Procedure 19.”).

As a general matter, the rule of permissive joinder is applied liberally and flexibly. The Ninth Circuit, for instance, has adopted a liberal approach in an effort “to promote trial convenience and to expedite the final determination of disputes, thereby preventing multiple lawsuits.”270League to Save Lake Tahoe v. Tahoe Reg’l Plan. Agency, 558 F.2d 914, 917 (9th Cir. 1977). And the Supreme Court has clarified that “[u]nder the Rules . . . joinder of claims, parties and remedies is strongly encouraged.”271United Mine Workers of Am. v. Gibbs, 383 U.S. 715, 724 (1966). It is also important to note that joinder is a procedural device and does not affect the substantive rights of the parties.272Wright & Miller, supra note 269, § 1653. Hence, a judgment for or against one party need not lead to a similar outcome for another party in a joined action.273Id.

More concretely, the first requirement under Rule 20—that the disputes arise out of the same series of occurrences—is most relevant for our current purposes. This requirement appeals to an open-ended assessment of the parties’ entire course of conduct. But, consistent with the flexible spirit animating Rule 20, courts have largely balked at developing “one generalized test” for identifying a single transaction or occurrence.274Id. Instead, courts leverage a case-by-case approach to examine whether the events at stake are all logically related and thus constitute a single series of occurrences.275See, e.g., Mosley v. Gen. Motors Corp., 497 F.2d 1330, 1333 (8th Cir. 1974) (“In ascertaining whether a particular factual situation constitutes a single transaction or occurrence for purposes of Rule 20, a case-by-case approach is generally pursued.” (citing Wright & Miller, supra note 269, § 1653)); Almont Ambulatory Surgery Ctr., LLC v. UnitedHealth Grp., Inc., 99 F. Supp. 3d 1110, 1187–88 (C.D. Cal. 2015) (noting that “[t]he transaction and common-question requirements . . . are flexible concepts used by the courts to implement the purpose of Rule 20 and therefore are to be read as broadly as possible whenever doing so is likely to promote judicial economy” (citing Wright & Miller, supra note 269, § 1653)). This logical-relationship test defies any rigid formulation and is flexible by design. Moreover, some courts complement the logical-relationship test with a variation of the evidentiary standard that criminal courts employ under the double jeopardy rule. They do so by asking whether different legal actions would lead to “overlapping proof and duplication in testimony.”276Wright & Miller, supra note 269, § 1653. When such overlap is likely, courts are more inclined to find that the claims arise out of the same transaction or occurrence.

Given the flexible nature of the permissive-joinder inquiry, it is hardly surprising that courts sometimes find that events stretching over many years could nonetheless be sufficiently connected to warrant joinder. One example is Burton v. American Cyanamid.277Burton v. Am. Cyanamid, 128 F. Supp. 3d 1095 (E.D. Wis. 2015) (denying the defendant’s motion to dismiss in part because the plaintiff’s claims were sufficiently connected to warrant joinder). In Burton, the plaintiffs alleged harm as a result of ingesting lead as children, arguing that their injuries over the years constituted a single series of occurrences that resulted from the defendants’ negligence in manufacturing, promoting, and selling lead paint.278Id. at 1098–99. The court agreed and ordered joinder.279Id. at 1103–04. In another case, a federal court ordered joinder of more than 400 defendants in an action involving some 400 separate insurance and retirement plans, finding that the claims, while addressing many different plans and defendants, all arose from the same series of transactions or occurrences.280UnitedHealth Grp., Inc., 99 F. Supp. 3d at 1119. The rules of permissive joinder, in short, rest on a flexible standard and are thus applied somewhat liberally.

3.  Immigration

Questions about the defendant’s course of conduct also crop up under the Immigration and Nationality Act.2818 U.S.C. § 1227(a)(2)(A)(ii) (directing that “[a]ny alien who at any time after admission is convicted of two or more crimes involving moral turpitude, not arising out of a single scheme of criminal misconduct, regardless of whether confined therefor and regardless of whether the convictions were in a single trial, is deportable”). The Act renders an alien deportable if, at any point after admission, they are convicted of two or more crimes of moral turpitude not arising out of a “single scheme of criminal misconduct.”282Id. One of the central questions in this context is whether the alien’s convictions could be characterized as arising out of a single scheme of misconduct. While the statute does not define the phrase “single scheme of . . . misconduct,”283Id. some early authorities have suggested that a variety of factors might play into an analysis of the alien’s conduct, including the time and purpose of the crimes, the methods and procedures used, the identity of the participants, and the identity of the victims.284Wood v. Hoy, 266 F.2d 825, 828–33 (9th Cir. 1959). In Wood, the court suggested that two armed robberies committed three days apart could constitute part of a single scheme. Id. Although the court ultimately remanded the case with instructions to fully address the “single scheme” element, it also explained that the two robberies seemed plausibly connected:

[B]oth crimes were committed by the same four persons, in both crimes money was obtained from the victims by means of force and fear, and the two crimes were committed within three days of each other. [In addition,] . . . two or three weeks before the crimes were committed, the four defendants met and at the suggestion of one of them, the four agreed to participate in the two particular armed robberies.

Id. at 831.

Nonetheless, clouds of uncertainty enshroud the single scheme exception. Some circuit court cases, as well as decisions issued by the Board of Immigration Appeals (“B.I.A.”), take a restrictive approach. They do so by suggesting that, to qualify for the “single scheme” exception, the alien’s crimes must be temporally close.285Balogun v. INS, 31 F.3d 8, 8–9 (1st Cir. 1994). Likewise, the First Circuit has stated that a single scheme “must take place at one time; there must be no substantial interruption that would allow the participant to disassociate himself from his enterprise and reflect on what he has done.”286Id. at 8 (citing Pacheco v. INS, 546 F.2d 448, 451 (1st Cir. 1976)). These authorities come close to equating a “single scheme” with a “single act”—a uniform, “temporally integrated episode of continuous activity.”287Id. at 9 (citing Pacheco, 546 F.2d at 451–52).

Other courts, by contrast, find that separate crimes committed under a preconceived plan—even if the two crimes were days apart—could qualify as a single scheme of misconduct.288Wood, 266 F.2d at 831; Gonzalez-Sandoval v. U.S. INS, 910 F.2d 614, 616 (9th Cir. 1990). These courts suggest that two crimes executed in pursuit of a common plan could constitute a single scheme.289Nason v. INS, 394 F.2d 223, 227 (2d Cir. 1968) (“The word ‘scheme’ implies a specific, more or less articulated and coherent plan or program of future action.”). And while this “common plan” approach has been rejected by the B.I.A. and the First, Fourth, Fifth, Seventh, and Tenth Circuits,290Matter of Adetiba, 20 I. & N. Dec. 506, 508–12 (B.I.A. 1992); Balogun, 31 F.3d at 8–9; Akindemowo v. U.S. INS, 61 F.3d 282, 286–87 (4th Cir. 1995); Iredia v. INS, 981 F.2d 847, 849 (5th Cir. 1993); Abdelqadar v. Gonzales, 413 F.3d 668, 674–75 (7th Cir. 2005); Nguyen v. INS, 991 F.2d 621, 623–25 (10th Cir. 1993). it has been adopted to varying degrees by the Second, Third, and Ninth Circuits.291See, e.g., Gonzalez-Sandoval, 910 F.2d at 616; Nason, 394 F.2d at 226–27; Sawkow v. INS, 314 F.2d 34, 37–38 (3d Cir. 1963); Wood, 266 F.2d at 828–33. As a consequence, there remains a great deal of disagreement and confusion as to the scope of the “single scheme” exception.

* * *

In summary, each of the doctrines canvassed above seems to lend some support to the suggested framework. Each seeks to identify the circumstances under which a series of wrongful actions might give rise to a single transaction, episode, or scheme. And each appears to mobilize a case-by-case approach. Courts applying these doctrines often consider both the specific attributes of the defendant’s actions and the consequences associated with the larger scheme.

At the same time, it would be a mistake to overstate the significance of this cross-disciplinary survey. The preceding discussion does not aim to provide a fully exhaustive synthesis of three discrete bodies of law. Rather, it seeks only to highlight a few rough, high-level sources of inspiration that could bear in some loose sense upon the proposed framework.

C.  Application

Copyright’s per-work scheme has been the source of much anguish. It tends to produce grossly excessive awards, and it discourages courts from properly developing modern copyright law. A better framework, I argue, would empower courts to look beyond the number of infringed works and focus instead on the defendant’s entire course of conduct. When a series of separate infringing acts can be traced to a single, larger infringement episode, courts should be able to issue only a single statutory award—regardless of how many works are at issue.

But how might courts evaluate the defendant’s course of conduct? In what follows, I map out a number of relevant factors that courts should consider, including the spatial and temporal nature of the defendant’s actions; whether the evidence supporting one infringement is necessary or sufficient to sustain liability for another; whether the defendant’s actions were executed in pursuit of a common plan or a larger creative project; and the potential consequences of a statutory award.

These factors operate along three interrelated dimensions: conduct/evidence, intent, and consequences. Some factors focus on the evidence required to distinguish between—or bring together—the defendant’s actions. A second subset of factors look to the defendant’s intent in pursuing these actions. A third subset of factors dwell on the larger consequences of the defendant’s actions.

Let us consider a few illustrative examples. Think, for instance, of Yellow Pages Photos.292Yellow Pages Photos, Inc. v. Ziplocal, LP, 795 F.3d 1255 (11th Cir. 2015) (affirming a judgment in favor of the plaintiffs in an action for copyright infringement). A local phone book publisher, Ziplocal, licensed thousands of photos owned by the plaintiff.293Id. at 1260–61. The licensing agreement authorized Ziplocal to distribute the photos to its users but prohibited the company from transferring the photos to outside companies or nonemployees.294Id. at 1261. A few years later, Ziplocal subcontracted with an outside company to assist with editing the photos and producing a phone book.295Id. at 1262. The plaintiff brought action for infringement, and the jury found Ziplocal liable for infringing the plaintiff’s rights in 123 individual works.296Id. at 1262–63.

The court, though, could have reasonably concluded that the 123 infringed works were all implicated in a single scheme—a commercial effort by the defendant to produce a phone book. Perhaps more crucially, it seems as though both parties understood the defendant’s actions to constitute a single episode. The plaintiff, after all, never identified with any degree of specificity the circumstances under which particular works were infringed.297The plaintiff’s complaint does not address the specific circumstances attending any particular collection of photos that was allegedly transferred from Ziplocal to the outside company. See Third Amended Complaint at 4–11, Yellow Pages Photos, Inc. v. Ziplocal, LP, No. 12-cv-755, 2014 U.S. Dist. LEXIS 87772 (M.D. Fla. June 27, 2014), aff’d, 795 F.3d 1255 (11th Cir. 2015). The plaintiff did not bother to break down the broader episode into smaller, concrete actions that the defendant carried out.

This becomes all the more apparent when the case is assessed against the same-elements/same-evidence question: Is the evidence supporting one infringement necessary or sufficient to sustain liability for another? Again, the answer here is yes. The plaintiff never adduced any evidence to separate one infringing act from another. Instead, the defendant’s liability was ultimately based on evidence relating to the broader episode rather than any particular infringing acts. Under the same-evidence test, then, the defendant’s actions could be characterized as arising out of a single infringement episode.

The same is true for some file-sharing cases, such as Tenenbaum.298Sony BMG Music Ent. v. Tenenbaum, 660 F.3d 487, 492–96 (1st Cir. 2011). In Tenenbaum, the defendant was accused of illegally downloading and distributing thirty sound recordings.299Id. at 490. But, again, the plaintiffs here failed to present any evidence to distinguish among discrete actions. Consider the plaintiffs’ pretrial motion, which discusses at some length the infringed works and the plaintiffs’ process for detecting them. As the pretrial motion explains, the plaintiffs engaged an external company to assist in detecting infringement.300Plaintiffs’ Pretrial Motion at 3, Sony BMG Music Ent. v. Tenenbaum, No. 07cv11446, 2009 U.S. Dist. LEXIS 115734 (D. Mass. Dec. 7, 2009), amended in part, 721 F. Supp. 2d 85 (D. Mass. 2010), aff’d in part, vacated in part, rev’d in part, 660 F.3d 487 (1st Cir. 2011). The company was able to identify the defendant by pinpointing the IP address of his computer, and the company was also able to ascertain that the defendant had used a file-sharing software called Kazaa to illegally download sound recordings.301Id. But the plaintiffs’ evidence did not identify when the infringed works were downloaded—it could only establish that, at some point, the defendant’s publicly viewable Kazaa folder contained and listed the infringed works.302See id. at 3–4. So the plaintiffs could show that the defendant engaged in a long-running scheme of downloading songs via file-sharing software, but they could not identify any particular actions the defendant took at any particular point in time. The upshot, once again, is that a court may reasonably conclude that the defendant’s actions here—as reflected in the evidence presented and in the plaintiffs’ failure to identify particular acts—were reducible to a single infringement episode.

Now, to take another example, consider the case of Cariou v. Prince.303Cariou v. Prince, 714 F.3d 694 (2d Cir. 2013) (holding that the defendant did not infringe the plaintiff’s rights by incorporating portions of the plaintiff’s photos into the defendant’s collages). Patrick Cariou published a book of portraits depicting Rastafarians in Jamaica.304Id. at 698. A few years later, a well-known appropriation artist by the name of Richard Prince tore out several of Cariou’s photos, altered them, and incorporated them into a series of paintings and collages that he exhibited at art galleries.305Id. Cariou sued Prince for copyright infringement, alleging that Prince had infringed his rights in dozens of separate photos.306Id. at 698–700. The court held that twenty-five of Prince’s artworks made fair use of the infringed photos.307Id. at 706–10. But let us assume, for the sake of argument, that Prince was held liable for his actions involving dozens of infringed works.

Could a court conclude that Prince’s actions arose out of, and were grounded in, a single infringement episode? The common plan test suggests a reason to think so. As the Second Circuit ultimately acknowledged, Prince’s actions were carried out in pursuit of a larger creative plan. Prince incorporated Cariou’s photos into a series of artworks displayed as part of an exhibition, called “Canal Zone,” and these works were later bound up and published in an accompanying exhibition catalog.308Id. at 703. In executing his creative vision, Prince altered the infringed photos and used them to create an exhibition that was far removed from Cariou’s original work. Prince’s exhibition “manifest[ed] an entirely different aesthetic,” turning Cariou’s serene portraits into “crude and jarring works.”309Id. at 706. So while Prince used a large number of Cariou’s works, these works were all incorporated into a larger project conveying a different brand of creative and social commentary. In other words, Prince’s actions arose out of a larger creative project—and were pursued in compliance with a preconceived plan. One could thus conclude that Prince’s infringing acts constituted a single infringement episode and may warrant only a single statutory award.

This suggests that the defendant’s intent, at least to the extent that it involves a larger creative enterprise, should inform the analysis. Why inquire into the defendant’s intent? The general idea is that the defendant’s creative motivation should matter because copyright law is fundamentally about encouraging creativity.310Sony Corp. of America v. Universal City Studios, Inc., 464 U.S. 417, 429 (1984) (pronouncing that copyright law “is intended to motivate the creative activity of authors”). Nowhere is this more apparent than in the context of fair use. The most important factor in the fair use analysis is whether the purportedly infringing use is transformative.311See generally Asay et al., supra note 65 (concluding, on the basis of an empirical study, that transformative use is the most consequential factor in the fair use analysis). The defendant’s use is transformative when it introduces a new purpose, meaning, or message.312Campbell v. Acuff-Rose Music, Inc., 510 U.S. 569, 579 (noting that the defendant’s use is transformative when it “adds something new, with a further purpose or different character, altering the first with new expression, meaning, or message”). This means that, generally speaking, injecting new creative expression into an original work is likely to be privileged as transformative.313Asay et al., supra note 65, at 950–51 (finding that a significant subset of transformative use cases involve a defendant who altered an original work with new creative expression). The central point, then, is that courts should follow a similar path when adjusting damages. They should consider whether the defendant engaged with an existing work for the purpose of pursuing a new creative project—this is, after all, precisely the kind of creativity that our law seeks to foster.314See William W. Fisher III, Reconstructing the Fair Use Doctrine, 101 Harv. L. Rev. 1659, 1768 (1988) (explaining that, as a matter of copyright policy, “uses of copyrighted material that either constitute or facilitate creative engagement with intellectual products should be preferred to uses that neither constitute nor foster such engagement”); Campbell, 510 U.S. at 579 (noting that “the goal of copyright, to promote science and the arts, is generally furthered by the creation of transformative works”).

In addition, there is another set of considerations that should bear on an assessment of the defendant’s conduct: the potential consequences of a statutory award. Here, courts need to consider two separate questions. The first is whether the defendant’s conduct prevented cost recoupment by the plaintiff. Did the plaintiff already recoup the costs of creating and distributing their works? If so, courts should be more inclined to find that the defendant’s actions derive from a single episode.

To see why, consider the dominant justification for copyright law in the United States: the incentive-access tradeoff. On this account, the copyright system provides authors with incentives to create intellectual works.315Shani Shisha, Fairness, Copyright, and Video Games: Hate the Game, Not the Player, 31 Fordham Intell. Prop. Media & Ent. L.J. 694, 773–75 (2021); see also Mazer v. Stein, 347 U.S. 201, 219 (1954) (“The economic philosophy behind the clause empowering Congress to grant patents and copyrights is the conviction that encouragement of individual effort by personal gain is the best way to advance public welfare through the talents of authors and inventors in ‘Science and useful Arts.’ ”). Because intellectual goods are non-rivalrous and non-excludable,316Shisha, supra note 315, at 773–74. authors need some form of legal protection to prevent others from free riding on their creative efforts.317Id. at 777 (“[The copyright owner] might have to rely on IP to prevent competitors from copying the work and infusing the market with cheap copies. Subsequent copies are often costless and easy to mass-produce.”). Without copyright protection, copyists will be able to distribute cheap copies and undercut the author’s prices. Consequently, authors will not be able to recoup the costs of producing their works, and so they may choose not to create at all.318Id. Cost recoupment is central to this story. By providing authors with a legal entitlement to exclude copyists, copyright aims to ensure that authors are able to recover their costs.319Shani Shisha, Commercializing Copyright, 65 B.C. L. Rev. 443, 482–83 (2024). Copyright law, then, provides authors with an economic incentive to invest in the production of intellectual works.

But copyright protection comes at a price. When a copyright owner enjoys legal exclusivity, they can increase the price they charge for access to their work.320Id. at 774 (“Copyright exacts a heavy toll: it allows creators to charge supracompetitive prices, thereby pricing some consumers out of the market.”). This means that more consumers will be priced out of the market.321Kal Raustiala & Christopher Jon Sprigman, The Second Digital Disruption: Streaming and the Dawn of Data-Driven Creativity, 94 N.Y.U. L. Rev. 1555, 1606 (2019). And the problem runs deeper: by limiting access to existing works, copyright law runs the risk of frustrating, rather than encouraging, creativity.322Id. Indeed, authors often create new works by engaging with existing ones.323Mark A. Lemley, The Economics of Improvement in Intellectual Property Law, 75 Tex. L. Rev. 989, 997 (1997) (explaining that “knowledge is cumulative—authors and inventors must necessarily build on what came before them”). Therefore, if copyright law seeks to make good on its promise of encouraging creativity, it must ensure that copyrighted works are ultimately made accessible to current and future users.324Shisha, supra note 65, at 1771. That is why copyright protection must be limited in scope and duration—it must “protect authors only to the extent necessary,”325Sinclair, supra note 38, at 943 (“[Copyright law] protect[s] authors only to the extent necessary to encourage continued production of works of merit. To extend protection beyond this point would be to lose sight of the very purpose of copyright law.”). while allowing for “the creation of new works that build upon earlier ones.”326Roger D. Blair & Thomas F. Cotter, An Economic Analysis of Damages Rules in Intellectual Property Law, 39 Wm. & Mary L. Rev. 1585, 1606 (1998); see also Mark A. Lemley, Beyond Preemption: The Law and Policy of Intellectual Property Licensing, 87 Calif. L. Rev. 111, 124–25 (1999). To do so, our law should strike a balance between two conflicting interests: the need to incentivize authors on the one hand, and the need to ensure that works are accessible to society at large on the other.327Shisha, supra note 33, at 104–05.

Consistent with this understanding of copyright law, it is crucial to figure out whether the copyright owner was able to recoup their costs. And that question should inform not only the duration and scope of copyright entitlements;328Glynn S. Lunney, Jr., Copyright and the 1%, 23 Stan. Tech. L. Rev. 1, 58 (2020) (proposing an alternative framework that would take into account cost recoupment as a condition of “establish[ing] infringement or obtain[ing] injunctive relief”). it should also interact with the question of remedies. My core claim, in other words, is that in assessing whether to multiply the number of awards, courts should be able to confront the question of cost recoupment. If the plaintiff suffered little harm and reaped enormous profits—and if there is no risk the plaintiff might fail to recoup their costs—courts should find that the defendant’s actions constitute a single episode.329On the centrality of the cost-recoupment question, see id. at 57 (“When a copyright owner has had the opportunity to recoup, and certainly, when the copyright owner has recouped, its persuasion costs, the purpose of copyright has been satisfied. At that point, copyright protection should end.”).

Admittedly, the question of cost recoupment extends beyond the defendant’s immediate conduct. But assessing cost recoupment makes sense as a matter of copyright policy: it allows courts to account for the underlying objective of our system.

Finally, courts should also give pride of place to various distributional concerns. To assess whether it would be reasonable to treat the defendant’s conduct as arising out of a single scheme, courts must consider the distributional effects of an aggregate statutory award. As Bracha and Syed explain, distributional concerns should typically provoke a distinction between corporate and individual defendants.330Bracha & Syed, supra note 14, at 1251–52. By and large, when the defendant is a corporation or a firm with dispersed liability, “neither distributive nor aggregation concerns are likely to prove worrying enough to require any [judicial] safeguard[s].”331Id. at 1251. However, when the defendant is a private individual, courts should ask whether a statutory award—multiplied by the number of infringed works—would “so eat into an average individual’s income or wealth”332Id. as to trigger significant distributive concerns. A hefty award directed at an individual of average means may well “have serious effects in areas such as housing, health, education, and more generally the ability to make and pursue basic life choices.”333Id. at 1248.

It follows, then, that courts should account for the identity (and relative wealth) of the defendant in considering whether their conduct is best characterized as reflecting a single, larger event. When the defendant is a private individual of moderate means—like the defendants in Thomas or Tenenbaum—it would be more appropriate to conclude that the defendant’s actions were the product of a single episode.

In conclusion, the proposed framework turns on a number of related factors. It attends not only to the defendant’s immediate conduct, but also to some of the broader distributional and utilitarian concerns that underpin our system. In particular, it directs courts to take into account the spatial and temporal nature of the defendant’s actions; whether the evidence supporting one infringement is necessary or sufficient to sustain liability for another; whether they were executed in accordance with a common plan or a larger creative project; and the potential consequences of a statutory award.

IV. OBJECTIONS

This Part briefly considers three potential objections: that the proposed system draws inspiration from the wrong sources; that it fails to alleviate the indeterminacy that ails our law; and that it is poorly suited to the task of compensating aggrieved plaintiffs. For various reasons, none of these objections is particularly compelling. Below I explain why.

A.  Doctrinal Transplants

One preliminary objection is that the doctrinal analogues I have identified above are irrelevant or otherwise inapplicable. The basic idea is that these doctrines emerged in very different settings and were designed with different goals in mind: streamlining civil proceedings, protecting criminal defendants, and demarcating grounds for deportation. These doctrines, put simply, serve different objectives, are subject to different limitations and judicial safeguards, and should not be applied to copyright lawsuits. Accordingly, it is not clear why these discrete bodies of law should bear in any meaningful sense on copyright remedies.

Yet this objection misfires for a number of reasons. One is that, as clarified above, I do not mean to suggest here that these doctrinal analogues are directly applicable to the law of copyright remedies. The point, rather, is that these doctrines could be treated as rough sources of inspiration. A doctrinal analogue is just that—an analogue. And while the proposed framework does incorporate some notable doctrines borrowed from other fields—the same-evidence test, the temporal question, and the common plan test—I nonetheless chose to pass over certain other doctrines.

I should also clarify that, in order to account for the underlying objectives of our system, I propose that we reorient some of these doctrinal analogues. Take the “common plan” test used by courts to determine whether an alien is deportable for committing two crimes of moral turpitude.3348 U.S.C. § 1227(a)(2)(A)(ii) (“Any alien who at any time after admission is convicted of two or more crimes involving moral turpitude, not arising out of a single scheme of criminal misconduct . . . is deportable.”). Under this test, some courts ask whether the two crimes were executed in pursuit of a common, preconceived plan. When that is the case, the two crimes are thought to have arisen out of a “single scheme of criminal misconduct.”335Id. Note, however, that as I have discussed above, there is a circuit split on the issue of the common plan test. While the B.I.A. and the First, Fourth, Fifth, Seventh, and Tenth Circuits reject the common plan test, the Second, Third, and Ninth Circuits have adopted it. See text accompanying supra notes 289–91. But under the framework developed here, the “common plan” test would be expanded to cover a larger creative enterprise.336See text accompanying supra notes 303–09. This is important because, as discussed above, modern copyright law seeks to stimulate precisely this type of creativity—namely, to allow users and authors to pursue creative projects.337See text accompanying supra notes 310–14. It would thus make sense for courts to evaluate the defendant’s creative motivation when calculating remedies. Even if the defendant is found to have infringed the plaintiff’s rights—such that the defendant’s conduct cannot be shielded by the fair use doctrine—we may still think it desirable to treat such infringements as less culpable so long as the defendant pursued a creative vision.

Moreover, in a broad sense, the framework envisioned here is a rather familiar one. Though the suggested system relies on a smattering of different factors, many of these factors are ones that courts already consider. It is just that courts typically do so when considering the size of the award, rather than the number of available awards. Applying these factors, then, should not require much adaptation—courts and juries have long attended to these factors anyway. Consider, for instance, the distributional issues I discussed above, which courts today sometimes assess by asking whether a large award would “put [the defendant] out of business.”338J & J Sports Prods., Inc. v. Arboleda, No. 6:09-cv-467-Orl-18, 2009 U.S. Dist. LEXIS 99768, at *19–20 (M.D. Fla. Oct. 5, 2009) (report and recommendation of magistrate judge) (“The Court must strike a balance between deterring other incidents of piracy by these Defendants and others, and not making the award such that it will put a small business out of business.”), adopted by No. 6:09-cv-467-Orl-18, 2009 U.S. Dist. LEXIS 99782 (M.D. Fla. Oct. 27, 2009); see, e.g., Garden City Boxing Club, Inc. v. Polanco, No. 05 Civ. 3411, 2006 U.S. Dist. LEXIS 5010, at *16–17 (S.D.N.Y. Feb. 7, 2006) (noting that “[a single] violation is not so serious as to warrant putting the restaurant out of business”), aff’d, 228 F. App’x 29 (2d Cir. 2007); R.A. Guthrie Co., Inc. v. Boparai, No. 4:18-cv-080, 2021 U.S. Dist. LEXIS 61507, at *32–33 (E.D. Tex. Mar. 1, 2021) (report and recommendation of magistrate judge) (“[T]he Court is cognizant that the purpose of willfulness damages is not to put the Defendants out of business.”), adopted by No. 4:18-cv-080, 2021 U.S. Dist. LEXIS 61506 (E.D. Tex. Mar. 25, 2021); Kingvision Pay-Per-View Ltd. v. Lake Alice Bar, 168 F.3d 347, 350 (9th Cir. 1999) (“Depending on the circumstances, a low five figure judgment may be a stiff fine that deters, while a high five figure judgment puts a bar out of business.”); Dae Han Video Prod., Inc. v. Chun, No. 89-1470-A, 1990 U.S. Dist. LEXIS 18496, at *19–20 (E.D. Va. June 18, 1990) (“The court is unwilling to . . . award [the plaintiff] what would amount to a windfall award that could potentially drive the defendants’ store out of business.”); G&G Closed Cir. Events, LLC v. GCF Enters. LLC, No. EP-15-CV-00111, 2015 U.S. Dist. LEXIS 156672, at *12 (W.D. Tex. Nov. 19, 2015) (“[T]he Court also recognizes that the purpose of these damages is not to drive Defendants out of business.”); Joe Hand Promotions, Inc. v. Ducummon, No. 11-CV-278, 2012 U.S. Dist. LEXIS 56672, at *6 (N.D. Okla. Apr. 23, 2012) (noting that an enhanced damages award should not be “so substantial that it will likely put defendants out of business” (citation omitted)). In so doing, courts impose a proportionality requirement—a requirement that the final award not be so large or disproportionate as to put the defendant out of business. My suggestion, then, is that courts simply consider this factor in one additional context: not just when calibrating the size of the award but also when evaluating whether to issue one award or multiple ones.

So, to sum it up, my choice of looking outward for inspiration is defensible. I suggest that we do so by devising a careful and nuanced framework, one that accounts for the ultimate objectives of the copyright system. And, essentially, this system is structured around an assortment of factors that courts already consider in different contexts, both within and outside of copyright law.

B.  Indeterminacy

Another objection is that the proposed system would not resolve the problem of indeterminacy. As everyone seems to agree, the caselaw is a mess. Courts issue divergent awards in similar cases and rarely offer any explanation for doing so. The caselaw, then, turns out to be inconsistent and arbitrary. And the proposed framework, in turn, does little to confront this issue. Instead, it affords courts and juries even more discretion, asking them to carry out a flexible, holistic inquiry on the basis of several unweighted factors. Won’t that make matters worse?

One possible response is that the proposed system will not make things any worse than they are now—the law is already an unmitigated disaster. As others have observed, courts tend to issue awards that are simply inexplicable, often dispensing radically different awards in similar circumstances.339See supra Section II.B. So even if my proposed system will not yield a measurable improvement in consistency, it is also unlikely my framework will make matters any worse. We’ve already hit rock bottom.

In any event, it is important to remember that a host of other issues bedevil our system. In a system driven by per-work aggregation, statutory awards can swell up dramatically.340See supra Section II.C. The result is that some defendants face a significant risk of excessive awards. Copyright’s per-work structure also discourages courts from developing substantive law.341See supra Section II.D. And it is here that one can clearly see the benefits of a more flexible system: if we decouple the number of awards from the number of infringed works, at least in cases where the defendant’s conduct resulted from a single infringement episode, the twin problems of aggregation and underdevelopment are likely to diminish. Given that these problems derive from per-work remedies, we could mitigate their effects by allowing courts to break from our per-work scheme. This means that, while my approach cannot tackle all of the relevant issues, it can surely contribute to eliminating or addressing some of them.

This suggests a related point: I am not claiming here that the proposed framework is a catchall solution. Rather, my proposal must be accompanied by a suite of additional reforms designed to overhaul our remedial scheme. Many of the proposals other scholars have advanced—such as reducing the statutory range,342See Depoorter, supra note 15, at 443. imposing a statutory cap,343Id. at 445. tying statutory damages to actual losses,344Samuelson & Wheatland, supra note 3, at 502–03; Bracha & Syed, supra note 14, at 1249–50. and imposing sanctions on plaintiffs who make inflated claims345See Depoorter, supra note 15, at 442.—could work in tandem with my proposed system. While these proposals seek to reorient or limit the courts’ discretion in adjusting the size of the award, my proposal would allow courts to do away with per-work damages altogether.

C.  Compensation

One final objection is that the proposed scheme would make it harder for plaintiffs to obtain compensation for the actual harm they suffered. As noted above, most copyright scholars agree that courts are insufficiently attentive to the rationales underlying statutory damages.346See supra Section II.A. And while it is clear that Congress intended for courts to award enhanced damages only in a narrow subset of “exceptional cases”—those involving large-scale infringement or repeat transgressors—it is also clear that, in most other cases, statutory damages were meant to serve a largely compensatory role.347Shisha, supra note 65, at 1756–57 (“The legislative history shows that increased damages for willful infringement were originally meant to apply to exceptional cases—nakedly egregious cases in which the infringer brazenly flouted the law. So while statutory damages are largely compensatory, there are exceptional circumstances in which courts might award punitive-like damages for willful infringement.” (citing Samuelson & Wheatland, supra note 3, at 441)). Thus, with one limited exception, statutory damages were meant to ensure that copyright owners would be compensated for their losses.

Yet there is a problem lurking beneath the surface: if statutory damages were engineered to serve a compensatory role, courts would have to account for the number of infringed works. After all, each work represents a distinct source of harm.348Id. at 1789. A copyright owner will often charge a licensing fee for access to each of their works. To account for the plaintiff’s lost fees, then, the court will have to take stock of each and every work at issue. My proposal, however, could frustrate this process. By severing the connection between the number of awards and the number of works, the suggested system threatens to deny plaintiffs the opportunity to obtain proper compensation for their actual damages.

But is that really the case? As I have emphasized before, there is more than one way to compensate a copyright holder for the harm they sustained.349Id. at 1790. One way is to adjust the number of awards based on the number of infringed works. But there is another possibility, and perhaps an equally effective one: “control[ing] for the number of infringed works by increasing or reducing the size of the aggregate award” within the statutory range.350Id. Courts and juries have at their disposal an incredibly robust statutory range stretching from $200 to $150,000, depending on the type of case at hand. And they can increase or reduce the total award within that range based on the number of works infringed and the defendant’s level of culpability.

For example, suppose the defendant copied three of the plaintiff’s photos. Now suppose that the plaintiff typically charges a licensing fee of $1,000 for each of their photos, meaning that the plaintiff’s total loss is $3,000. In adjusting the award, the court can pursue one of two possibilities: it can issue three separate awards of $1,000 each; or, alternatively, the court can dispense a single, aggregate award in the amount of $3,000. In other words, the court should be able to ensure full compensation by calibrating a single, aggregate award within the statutory range.

Nevertheless, there is an additional wrinkle here: if courts are to rely on a single, aggregate award, the statutory range has to be sufficiently broad to accommodate cases involving a large number of works or a small number of particularly valuable ones. To be sure, there may be cases where so many works are at stake—hundreds or even thousands—that a single award, even at the statutory maximum of $150,000, simply won’t do. For example, if the defendant copied 5,000 songs, each valued at $50, the plaintiff’s total loss would amount to $250,000 (5,000 x $50). This means that a single award of $150,000 simply wouldn’t suffice. Or imagine a case implicating 25 works, each valued at $10,000—again, a total loss of $250,000 would exceed the statutory maximum. What this suggests is that the statutory range must be robust enough to accommodate an array of possible scenarios.

These concerns, however, are overblown. First, even under the proposed approach, courts may nonetheless find that a mass-infringement event does not qualify as a single infringement episode. Perhaps such an event would be better described as a series of separate episodes, each warranting an additional statutory award. Exceptional cases call for exceptional remedies. Second, there is good reason to believe that such cases would be vanishingly rare. The vast majority of copyrighted works command little market value, if any, and the current statutory range would likely be sufficiently broad to ensure proper compensation in a large majority of cases.351Shisha, supra note 315, at 782 (concluding, on the basis of recent empirical work, that “in practice, few works carry market value, and most are only commercially viable for short periods of time”); Shisha, supra note 319, at 456–58 (noting that most creative works—across a range of different industries—command little commercial value). In a sense, any concrete cap on the amount of damages would be arbitrary—we simply have to draw the line somewhere. But given that most works have little value, the current statutory limit seems more than sufficiently robust to accommodate the typical infringement case.

CONCLUSION

The law of statutory damages is in a state of disrepair. It often produces “arbitrary, inconsistent, unprincipled, and grossly excessive awards.”352Samuelson & Wheatland, supra note 3, at 497. Courts command “a contested, somewhat obscured, and even outright confused” understanding of copyright’s remedial system.353Bracha & Syed, supra note 14, at 1230. And statutory damages are susceptible to pervasive “overclaiming” across “virtually all areas of copyright law.”354Depoorter, supra note 15, at 407. But most reform proposals to date have papered over the real issue. So entwined are statutory damages in our copyright system that any attempts at serious reform may seem hopeless. And although commentators uniformly agree that statutory damages present a problem of colossal proportions, they have not yet been able to tackle the core problem: per-work damages.

This Article attempts to do just that. It seeks to jumpstart a conversation about structural reform and offer a roadmap for legislative and judicial action to confront the risk of inflated damages. It proposes, in short, that we do away with per-work damages altogether. It develops an alternative system that would accord judges a significant degree of discretion to look beyond the number of infringed works. Courts could do so by examining whether the defendant’s actions are collectively attributable to a larger infringement episode. When the defendant’s conduct arises out of a single episode, courts should issue only a single statutory award—no matter how many individual works are at issue. If adopted, this system would reduce the risk of excessive awards, prompt courts to properly adjust damages, and introduce a degree of pragmatism into our system.

This framework may seem radical. That is so in part because it appears to mandate a sharp break from a centuries-old system of per-work damages. But, in a sense, the proposed system is, in fact, a familiar one—after all, courts have forged similar doctrines to address analogous problems across a variety of otherwise distinct areas of law, including criminal law, civil procedure, and immigration law.

In the end, this Article envisions a world without per-work damages. What distinguishes the proposed system from other proposals is that it takes seriously the core issue: per-work remedies make it too easy for courts and juries to aggregate damages in ways that lead to outlandish awards. After thirty years of faint-hearted debates, it is well past time that policymakers try something different.

97 S. Cal. L. Rev. 1029

Download

* Assistant Professor, SMU Dedman School of Law; Fellow, Berkman Klein Center for Internet & Society, Harvard Law School, 2021–22. For valuable feedback and generative conversations, I thank William Fisher, William McCoy, Guy Rubinstein, Brian Soucek, Oren Tamir, and Rebecca Tushnet. I received helpful comments from participants at the Harvard Law School Art of the Response Paper Workshop. I am also grateful to Vice Dean Gabriella Blum and Assistant Dean Catherine Peshkin for providing institutional support for this project. I am indebted to the editors of the Southern California Law Review, particularly Madeline Goossen, Ariana Croll, Mariem Masmoudi, Carus Newman, and Chloe Williams, for superb editorial work. Finally, I would like to thank the Harvard Law School Graduate Program for generous financial support.