Self-Defense Exceptionalism and the Immunization of Private Violence

After the high-profile trial of Kyle Rittenhouse, the parameters of lawful self-defense are a subject of intense public and scholarly attention. In recent years, most commentary about self-defense has focused on “Stand Your Ground” policies that remove the duty to retreat before using lethal force. But the reaction to Rittenhouse’s case reflects a different, more extreme way that the law governing defensive force is changing. In particular, advocates and legislators say that private citizens like Rittenhouse who exercise self-defense should be entitled to immunity—an exemption from prosecution—giving them an extraordinary procedural benefit not attaching to other defenses that are adjudicated at trial. As this Article reveals, this effort to transform self-defense into something exceptional within criminal law began more than a decade ago in the shadows of Stand Your Ground. One-quarter of U.S. states have already enacted laws providing for self-defense immunity.

This Article examines this fundamental yet understudied shift in self-defense law. It shows how the concept of immunizing defensive force is foreign to the Anglo-American legal tradition as well as settled principles of modern criminal law and procedure, including the exceedingly narrow role of immunities. It tells the story of how self-defense immunity arose not as part of the broader criminal justice reform movement, but rather at the behest of the movement to insulate defensive gun use from liability. And it demonstrates the costs of treating self-defense as an immunity, such as increasing violence, diminishing the institution of the jury, delegitimizing criminal law outcomes, and undermining judicial economy. After exposing the unreasoned rise and inevitable costs of self-defense immunity, this Article concludes that self-defense should remain an affirmative defense to criminal charges rather than immunize a defendant from being prosecuted at all. Self-defense reform should move in lockstep with other criminal law defenses so as to avoid the societal harms that result from immunizing defensive violence.

INTRODUCTION

On August 25, 2020, seventeen-year-old Kyle Rittenhouse traveled to Kenosha, Wisconsin, with an illegally obtained AR-15–style rifle in the wake of the shooting of Jacob Blake by a police officer.1Kim Bellware, What to Know About the Contentious Trial of Kyle Rittenhouse, Wash. Post (Nov. 10, 2021, 8:03 AM), https://www.washingtonpost.com/nation/2021/11/10/rittenhouse-trial-faq [https://perma.cc/ED9L-K3MG]. Rittenhouse said he went heavily armed to provide medical aid and protect property, albeit strangers’ property, during racial justice protests and unrest following yet another police shooting of a Black man.2Id. Instead, he shot three men during altercations, killing two of them.3Id. Rittenhouse was charged with crimes including murder,4See Crim. Complaint, State v. Rittenhouse, 2020 CF 000983 (Aug. 27, 2020). Wisconsin does not have a crime called “murder”; instead, it proscribes “first-degree intentional homicide” when a person “causes the death of another human being with intent to kill that person.” Wisc. Stat. § 940.01 (2022). and in his defense he asserted self-defense: he feared that the men would disarm him and use his own rifle against him unless he shot them first.5Shaila Dewan, Can Self-Defense Laws Stand Up to a Country Awash in Guns?, N.Y. Times (Nov. 13, 2021), https://www.nytimes.com/2021/11/13/us/rittenhouse-arbery-self-defense.html [https://
perma.cc/YC5U-XKFD].

Rittenhouse’s case was closely watched and controversial, splitting the nation into diametrically opposed camps regarding the appropriateness of his conduct. It also raised difficult factual and legal questions, including whether he provoked the confrontations and thereby negated the lawfulness of his defensive force.6Cynthia Lee, How a Vaguely Worded Wisconsin Law Could Let Rittenhouse Walk, Politico (Nov. 17, 2021), https://www.politico.com/news/magazine/2021/11/17/wisconsin-self-defense-law-rittenhouse-522814 [https://perma.cc/7C86-Y292] (describing Wisconsin’s initial aggressor doctrine in relation to the Rittenhouse case). At the end of a two-week trial at which dozens of witnesses testified, a jury deliberated for three days and returned a verdict of not guilty.7Julie Bosman, Kyle Rittenhouse Was Found Not Guilty of Intentional Homicide and Four Other Charges, N.Y. Times (Nov. 19, 2021), https://www.nytimes.com/live/2021/11/19/us/kyle-rittenhouse-trial [https://perma.cc/6A5R-6XEW]. The outcome should have pleased those who supported Rittenhouse’s conduct that summer night. Instead, a common reaction was, as former President Donald Trump put it, that Rittenhouse “shouldn’t have been prosecuted in the first place.”8Fox News, Trump on Rittenhouse Verdict, YouTube (Nov. 19, 2021), https://
http://www.youtube.com/watch?v=b0lReIesfZE&t=6s [https://perma.cc/39J9-D7PW]; see also Bosman, supra note 7 (quoting Republican candidate for Wisconsin governor, Rebecca Kleefisch, as asserting that the prosecution of Rittenhouse was a “complete disgrace”).

If that sentiment were simply a feature of modern political rhetoric, it might be undeserving of close scrutiny. Indeed, the politics of self-defense shone brightly after the Rittenhouse trial. U.S. Representative Marjorie Taylor Greene even introduced a bill to award Rittenhouse a civilian’s highest congressional tribute, a Congressional Gold Medal, for his “courageous actions.”9Kyle H. Rittenhouse Congressional Gold Medal Act, H.R. 6070, 117th Cong. (Nov. 23, 2021); Mariana Alfaro, Rep. Greene Introduces Bill to Award Congress’s Highest Honor to Kyle Rittenhouse, Who Fatally Shot Two Men, Wash. Post (Nov. 24, 2021, 7:35 PM), https://www.washingtonpost.com/
politics/greene-rittenhouse-congressional-gold-medal/2021/11/24/c09980d2-4d49-11ec-a1b9-9f12bd39
487a_story.html [https://perma.cc/6XEN-XCX7]. Greene voted not to grant the same award to the police officers who defended the Capitol during the riots of January 6, 2021. Annie Grayer & Kristin Wilson, 21 Republicans Vote No on Bill to Award Congressional Gold Medal for January 6 Police Officers, CNN: Politics (June 16, 2021, 12:19 PM), https://www.cnn.com/2021/06/15/politics/congressional-gold-medal-house-vote/index.html [https://perma.cc/82HH-EDCN].
Several Republican politicians invited Rittenhouse to intern in their offices.10Jon Skolnik, Lauren Boebert Challenges Madison Cawthorn to “Sprint” for Rittenhouse Internship, Salon (Nov. 24, 2021, 5:25 PM), https://www.salon.com/2021/11/24/lauren-boebert-challenges-madison-cawthorn-is-in-a-wheelchair-to-sprint [https://perma.cc/H96Z-X8JF]. Just days after the verdict, he was welcomed at Trump’s Mar-a-Lago Club in Florida.11Jennifer Hassan, Donald Trump Meets with Kyle Rittenhouse After Verdict, Calls Him “A Nice Young Man,” Wash. Post (Nov. 24, 2021, 6:28 AM), https://www.washingtonpost.com/nation/
2021/11/24/trump-meets-kyle-rittenhouse [https://perma.cc/MU3U-99SP].

But this Article shows how the notion that people “should not fear exposure to criminal prosecution when they use firearms to defend themselves and their homes” is more than rhetoric.12Amicus Brief of Attorney General Eric Schmitt Supporting Dismissal of the Case, State v. McCloskey, No. 2022-CR01300, at *1 (Cir. Ct. Mo. July 20, 2020). Rather, it is the foundation for an effort to grant an exemption from prosecution to those who, like Rittenhouse, claim self-defense in defending against criminal charges. After Rittenhouse’s acquittal, one advocate penned “Kyle’s Law” to cement the exalted status of self-defense.13Kyle’s Law: Stopping Politically Motivated Prosecutions of Self-Defense, Law of Self Defense [hereinafter Kyle’s Law], https://losd.ubpages.com/kyleslaw/ [https://perma.cc/DV72-N9UN]. The proposed statute would alter the law in various ways, including effectively immunizing lawful defensive force from prosecution altogether.14See id. (“Let’s make ALL probable cause hearings in self-defense cases into something akin to self-defense immunity hearings—if the prosecution can’t disprove self-defense by a preponderance of the evidence at this pre-trial hearing, the matter is dismissed with prejudice . . . .”). The measure also proposes exposing prosecutors to personal liability in self-defense cases. Id. As it turns out, more than one-fourth of U.S. states have already done just that,15See Ala. Code § 13A-3-23(d) (2016); Colo. Rev. Stat. § 18-1-704.5(3) (1985); Fla. Stat. § 776.032 (2005); Ga. Code Ann. § 16-3-24.2 (2014); Kan. Stat. Ann. § 21-5231 (2011); Ky. Rev. Stat. Ann. § 503.085 (West 2006); Okla. Stat. tit. 21 § 1289.25(F) (2018); S.C. Code Ann. § 16-11-450 (2006); Mich. Comp. Laws § 780.961(1) (2006); Idaho Code § 19-202A(1) (2018); Utah Code Ann. § 76-2-309 (2021); S.D. Codified Laws § 22-18-4.8 (2021); Iowa Code § 704.13 (2017); N.C. Gen. Stat. § 14-51.3 (2011). and the trend is likely to continue.16See, e.g., S. 1120, Reg. Sess. 2023–2024 (N.Y. 2023); S. 666, 101st Gen. Assemb., 2d. Reg. Sess. (Mo. 2022); see also S. 215, 134th Gen. Assemb., Reg. Sess. (Ohio 2021); S. 71, 64th Leg., Budget Sess. (Wyo. 2018).

In the past decade, legal scholarship has explored “Stand Your Ground,” or the removal of the common law duty to retreat before using lethal defensive force in public.17See, e.g., Megan Miller & John Pepper, Assessing the Effect of Firearms Regulations Using Partial Identification Methods: A Case Study of the Impact of Stand Your Ground Laws on Violent Crime, 83 Law & Contemp. Probs. 213 (2020); Tamara Rice Lave, Shoot to Kill: A Critical Look at Stand Your Ground Laws, 67 U. Mia. L. Rev. 827 (2013); Jeannie Suk, The True Woman: Scenes from the Law of Self-Defense, 31 Harv. J.L. & Gender 237 (2008). Civic groups, including the American Bar Association, have also evaluated and critiqued Stand Your Ground. See, e.g., Am. Bar Ass’n, National Task Force on Stand Your Ground Laws: Final Report and Recommendations (Sept. 2015) [hereinafter ABA Task Force], https://www.americanbar.org/content/dam/aba/administrative/
diversity/SYG_Report_Book.pdf [https://perma.cc/SM5C-4BPU]; Giffords Law Ctr., “Stand Your Ground Kills”: How These NRA-Backed Laws Promote Racist Violence (May 2021), https://giffords.org/lawcenter/report/stand-your-ground-kills-how-these-nra-backed-laws-promote-racist-
violence [https://perma.cc/9YYG-RANF]; Rand Corp., The Effects of Stand Your Ground Laws (Apr. 2020), https://www.rand.org/research/gun-policy/analysis/stand-your-ground.html [https://perma.
cc/8JVJ-N384].
That literature shows how Stand Your Ground interacts with an expansion of gun rights in a way that can lead to more violence and exacerbate existing patterns of discrimination in the criminal justice system.18See infra notes 234–38 and accompanying text (discussing literature). Articles have likewise explored additional features of the intersection of criminal law, self-defense, and gun rights.19In earlier work, I considered how increased gun carry can dilute the ways self-defense law traditionally has operated to steer conflicts away from unnecessary lethal violence. Eric Ruben, An Unstable Core: Self-Defense and the Second Amendment, 108 Cal. L. Rev. 63, 100–01 (2020) (“If the Second Amendment protects a broad right to carry handguns virtually everywhere and at all times, and most Americans choose to exercise that right, conflicts would regularly present a threat of lethal violence, and lethal force would regularly be perceived as a reasonably proportional and necessary response. In such a world, necessity and proportionality mean less, no longer moderating between lethal and nonlethal defensive force.” (citations omitted)). Others have observed how the criminal law provides “thin and blurry” answers to the question of when brandishing a gun is lawful self-defense or a crime, Joseph Blocher, Samuel W. Buell, Jacob D. Charles & Darrell A.H. Miller, Pointing Guns, 99 Tex. L. Rev. 1173, 1190 (2021), and how citizen arrest provisions, when combined with gun rights, can lead to deadly outcomes, Kimberly Kessler Ferzan, Taking Aim at Pointing Guns? Start with Citizen’s Arrest, Not Stand Your Ground, 100 Tex. L. Rev. Online 1, 7–12 (2021). And legal scholars are starting to explore whether self-defense law might be bolstered in light of changed circumstances—especially the proliferation of gun carry—to limit the unnecessary loss of life.20Cynthia Lee recently has proposed that policymakers adjust the initial aggressor doctrine to place more of a burden on those who carry guns and then claim self-defense after using them in confrontations. Cynthia Lee, Firearms and Initial Aggressors, 101 N.C. L. Rev. 1 (2022). Rafi Reznik has argued that self-defense should be conceived as an excuse, not a justification, for otherwise unlawful violence. Rafi Reznik, Taking a Break from Self-Defense, 32 S. Cal. Interdisc. L.J. 19 (2022); see also infra notes 207–09 and accompanying text (discussing the justification/excuse distinction and Reznik’s argument). Meanwhile, Guha Krishnamurthi and Peter N. Salib explain how the confluence of expansive self-defense laws and firearm possession creates dangers of violence for even well-intentioned, rational actors. See Guha Krishnamurthi & Peter N. Salib, Small Arms Races, U. Chi. L. Rev. Online (June 3, 2022), https://lawreviewblog.uchicago.edu/2022/06/03/krishnamurthi-salib-small-arms-races [https://
perma.cc/6TGF-CQXY]. After the Supreme Court established a broad Second Amendment right to carry handguns in New York State Rifle & Pistol Association v. Bruen, 142 S. Ct. 2111 (2022), the focus on how self-defense law—as well as the criminal law more generally—might be adjusted to achieve optimal outcomes will only increase. See generally Eric Ruben, Public Carry and Criminal Law After Bruen, 135 Harv. L. Rev. F. 505 (2022) (highlighting intersections between criminal law and public carry beyond licensing that could attract policymaking attention after Bruen).

Yet the notion that self-defense is exceptional and “deserves” to be immunized, as one legislative witness put it,21Self-Defense Amendments: Hearing on H.B. 227 Before the H. Judiciary Comm., 64th Leg., 2021 Gen. Sess. (Utah 2021), https://le.utah.gov/av/committeeArchive.jsp?timelineID=180423 [https://perma.cc/C63Q-C36R] (statement of Mitch Vilos). has evaded close scrutiny. Articles about Stand Your Ground have acknowledged what Cynthia Ward termed the “curious beast” of self-defense immunity as well as the “confusion” it invites.22Cynthia V. Ward, Three Questions About “Stand Your Ground” Laws, 95 Notre Dame L. Rev. Reflection 119 (2021); see also Benjamin M. Boylston, Immune Disorder: Uncertainty Regarding the Application of “Stand Your Ground” Laws, 20 Barry L. Rev. 25 (2014) (discussing vagueness in how states are to implement self-defense immunity); Jennifer Randolph, Comment, How to Get Away with Murder: Criminal and Civil Immunity Provisions in “Stand Your Ground” Legislation, 44 Seton Hall L. Rev. 599, 618 (2014) (observing how self-defense immunity provisions are unclear, which could lead to inconsistent application). In an earlier article, Jonathan Markovitz critiqued how self-defense immunity can “increase opportunities for racial stereotypes to cloud the reasonableness component of the self-defense determination.” Jonathan Markovitz, “A Spectacle of Slavery Unwilling to Die”: Curbing Reliance on Racial Stereotyping in Self-Defense Cases, 5 U.C. Irvine L. Rev. 873, 877 (2015). Mary Anne Franks, in an article about the asymmetrical distribution of violence between genders, observed how “immunity, by decreasing the likelihood of arrest or prosecution of a person using deadly force, lowers the transaction costs of using such force, which arguably makes the use of violence more appealing.” Mary Anne Franks, Men, Women, and Optimal Violence, 2016 U. Ill. L. Rev. 929, 936 (2016). I build on this observation in Section III.A. However, self-defense immunity warrants a sustained analysis in terms of how it began as an adjunct to the gun rights movement and how it fits within the criminal justice system today. That, in turn, calls for an examination of a more general topic that similarly has received little attention: the procedural treatment of criminal law defenses and why prosecutorial immunities are so few in number. To exempt a category of defendants from the ordinary criminal process is profound, bestowing “a far greater right than any encompassed by an affirmative defense, which may be asserted during trial but cannot stop a trial altogether.”23Bunn v. State, 667 S.E.2d 605, 608 (Ga. 2008). Examining why the criminal law is generally opposed to granting an exemption from prosecution is an important, understudied part of the inquiry.24See infra notes 94–105 and accompanying text (discussing immunity in the context of criminal law’s distinctive function of expressing a community’s moral condemnation).

This Article proceeds in three parts. Part I shows how justifications for otherwise criminal conduct, like self-defense, have traditionally been adjudicated: as affirmative defenses to criminal charges. Some have argued that immunizing self-defense is simply a return to past protections that have been lost in recent times.25See infra notes 174–78 and accompanying text (discussing reliance on historical arguments in advocacy for self-defense immunity). But those engaging in private violence have always been exposed to criminal prosecution and trial. The argument that self-defense exceptionalism is rooted in tradition is unsupported.

Part I also shows how modern pretrial criminal procedure is consistent with the historical antecedents. The formal process is overwhelmingly structured to bring cases forward to trial, even if few cases get that far.26See Carissa Byrne Hessick, Punishment Without Trial: Why Plea Bargaining Is a Bad Deal 32–33 (2021) (noting that guilty plea rates have been above 90% since the 1990s). Pretrial screening is largely geared toward questioning the basis for the charged offense, not adjudicating potential defenses.27See infra Section I.B (discussing pretrial screening mechanisms). The criminal law makes exceptions for a narrow set of pretrial matters—narrower than in the civil context. The scant prosecutorial immunities and their narrow justifications can be linked to the criminal law’s aims and distinctive character, which are especially protective of public prosecutions. The exceptions that receive prosecutorial immunity tend to be fundamentally different than self-defense in both their scope and purpose. In particular, other criminal law immunities benefit narrow classes of defendants and must be addressed ahead of trial to protect distinctive public interests like maintaining foreign relations or preserving the balance of powers.28See infra Section I.C (discussing immunities and other pretrial matters). Self-defense, in contrast, can be invoked by any defendant and, like a multitude of other defenses,29“Current law recognizes a surprising variety of . . . possible bars to conviction, from amnesia to withdrawal.” Paul H. Robinson, Criminal Law Defenses: A Systemic Analysis, 82 Colum. L. Rev. 199, 203 (1982). Paul Robinson identifies fifty-four such bars to conviction. Id. at 203 n.7. can be adjudicated at trial without undermining its role as justifying otherwise unlawful conduct. Moreover, interests served by self-defense law—like maintaining the legitimacy of the legal order—are actually undermined by immunity.

Part II then turns to the next logical question: Why are states now diverging from American legal tradition and standard practices to treat self-defense as something exceptional? The Article traces self-defense immunity from a barely debated and misunderstood change to Colorado law in the 1980s to a primary ambition of gun rights advocates in the 2000s. The resulting legal changes are often characterized as “Stand Your Ground laws,” but that understates the transformation that is afoot. Stand Your Ground relates to just one of many ways that legislators are remaking the law governing defensive force. Indeed, one possible reason why self-defense immunity has escaped close scrutiny is that the typical focus is on the substantive elements establishing what lawful self-defense is, and especially the duty to retreat, while glossing over changes to how self-defense is adjudicated.30Cf. Ward, supra note 22, at 138 (“Clarifying the issues is a necessary step toward a rational conversation not only about Stand Your Ground, but also about other controversial elements of self-defense.”).

Yet while Stand Your Ground has garnered the most attention, advocates—and especially gun rights advocates—have pursued a deeper goal: insulating defensive gun use from legal oversight to the greatest extent possible. It is hard to overstate the degree to which the quick rise of self-defense immunity is due to lobbying by advocates for one deadly weapon (the gun) that is used in a minuscule percentage of self-defense confrontations.31See Eric Ruben, Law of the Gun: Unrepresentative Cases and Distorted Doctrine, 107 Iowa L. Rev. 173, 202 (2021) (“According to the [National Crime Victimization Survey], fewer than 1 percent of crime victims report using a gun in self-defense . . . .” (citing David Hemenway & Sara J. Solnick, The Epidemiology of Self-Defense Gun Use: Evidence from the National Crime Victimization Surveys 2007–2011, 79 Preventative Med. 22, 22 (2015))). The loudest voices advocating for immunizing self-defense tend not to be those seeking criminal justice reform generally but rather those seeking to expand gun rights. A National Rifle Association (“NRA”) lobbyist, for example, drafted and led the campaign to institute self-defense immunity in Florida, which then became a model for states across the nation.32See infra notes 162–64 and accompanying text (discussing the involvement of the National Rifle Association in the spread of self-defense immunity laws). The playbook for transforming self-defense into an immunity mirrors the one used to expand gun rights.33See infra notes 149–54 and accompanying text (describing similarities in arguments raised for gun rights and self-defense immunity). The overlap between gun rights and self-defense rights advocacy begs the question of whether any principle other than bestowing a benefit on gun users is guiding self-defense’s transformation from an affirmative defense into an immunity. Part II raises several possibilities, but it finds each too thin to justify such an immense procedural departure.

Part III then explores functional and institutional costs of immunizing private violence. Self-defense immunity sends a signal that people can judge for themselves when to deploy violence in the name of self-protection without exposure to prosecution, thereby encouraging unnecessary violence.34See infra Section III.A. Meanwhile, by preventing the community, through the jury, from evaluating the lawfulness of defensive force, immunity jettisons the institution best suited for adjudicating self-defense.35See infra Section III.B. In addition, immunizing self-defense creates an inefficient process by which courts consider the same witnesses and arguments that will be presented at trial during a separate pretrial hearing, setting up the sort of mini-trial that criminal procedure generally disfavors.36See infra Section III.C.

Trials like Rittenhouse’s spark intense disagreement and debate. But such trials are a feature—not a bug—of the American justice system. The Article concludes that policymakers should keep self-defense in its traditional place as an ordinary affirmative defense to criminal charges. Criminal justice reform is desperately needed, but treating private violence as privileged at the behest of gun rights advocates is a perilous path.

I.  SELF-DEFENSE AND PRETRIAL CRIMINAL PROCEDURE

       As Carl Sagan famously put it: “You have to know the past to understand the present.”37Carl Sagan, Cosmos 41 (1980). That maxim applies equally well for modern criminal law. This Part thus explores how self-defense was historically implemented in criminal procedure. It shows how the criminal justice system that the United States adopted from England was “trial-centered, in the sense that the legal system sought to resolve most criminal business at trial,”38John H. Langbein, The Origins of Adversary Criminal Trial 7 (2003). including claims of self-defense. This Part then shows how that treatment continued in modern times until the recent effort to grant pretrial prosecutorial immunity for self-defense. The effort to recharacterize self-defense as an immunity invites a question about how immunities fit within the criminal justice system. This Part closes by addressing that question, showing how and why prosecutorial immunities are few in number and narrowly construed, and how and why their typical rationale does not apply to self-defense.

A.  Historical Procedure

In 1841, in People v. McLeod, a New York court considered a habeas corpus petition for a defendant charged with murder.39People v. McLeod, 1 Hill 377 (N.Y. Sup. Ct. 1841). The defendant sought his “unqualified discharge” on the basis of pretrial evidence that, among other things, he acted in lawful self-defense.40Id. at 392–93. The court emphatically rejected the “extraordinary” request,41Id. at 406. noting the “absurdity of such a proposition in practice, and its consequent repudiation by the English criminal courts” whose law and procedure the United States inherited.42Id. at 404. Among other things, granting the defendant’s request “would be to trench on the office of the jury.”43Id. at 397. As the court explained, “[a]n innocent man may be, and sometimes unfortunately is[,] imprisoned. Yet his imprisonment is no less lawful than if he were guilty. He must await his trial before a jury.”44Id. at 404. That early American understanding of the appropriate time—and the appropriate entity—to adjudicate self-defense was firmly rooted in the English common law tradition.

During the seventeenth and eighteenth centuries in England, after a felony was charged, judges lacked authority to discharge defendants “without further trial.”45Michael Dalton, The Country Justice 407 (1618) (“[I]t is not fit that a [m]an once arrested and charged with Felony (or suspicion thereof) should be delivered upon any [m]an’s discretion, without [further] [t]rial.”). Justices of the peace played the central role in administering the criminal law. See generally Larry M. Boyer, The Justice of the Peace in England and America from 1506 to 1776: A Bibliographic History, 34 Q.J. Libr. Cong. 315 (1977) (discussing the power and reach of justices of the peace in criminal matters); see also Saul Cornell, The Right to Keep and Carry Arms in Anglo-American Law: Preserving Liberty and Keeping the Peace, 80 Law & Contemp. Probs. 11 (2017) (discussing justice of the peace manuals used by English and American officials between 1688 and 1835). This was true regardless of whether the defendant was believed to be justified in engaging in the alleged offense conduct.46See Richard Burn, The Justice of the Peace and Parish Officer 207 (1756) (“If a felony is committed, and one is brought before a justice upon suspicion thereof, and the justice finds upon examination that the prisoner is not guilty, yet the justice shall not discharge him, but he must either be bailed or committed; for it is not fit that a man once arrested and charged with felony, or suspicion thereof, should be delivered upon any man’s discretion, without further trial.”); see also Langbein, supra note 38, at 46–47 (“[T]he JPs had no power to dismiss felony charges for insufficiency of the evidence.”); id. at 47 (“What passed for truth in English criminal procedure would have to emerge at trial, from the altercation of citizen accusers and citizen accused.”). Some justices of the peace pressured prosecutors to discharge cases, while recognizing their own limited ability to discharge cases before trial. Id. at 47 n.184. In the 1700s, judges began conducting a “pretrial inquiry” that “increasingly took on the trappings of a public hearing, which would ultimately come to be known as the preliminary hearing.”47Langbein, supra note 38, at 274. At such hearings, however, the defense attorney was limited to challenging the prosecution’s case and was not entitled to present the defense’s case.48Id. at 274–75; see also id. (“As late as 1787 an experienced Old Bailey barrister serving as defense counsel remarked in response to a question from the bench that ‘[t]he Magistrates at Bow Street never receive evidence for prisoners, only for prosecutors.’ ” (citing Darcy Wentworth & Mary Wilkerson, Old Bailey Sessions Papers (“OBSP”) 15, 19 (Dec. 1787, #8) (quoting Newman Knowlys))).

Classic common law treatises demonstrate how self-defense was just like other defenses in that it was a trial issue, not a pretrial issue. For example, Michael Foster, a judge on the King’s Bench and the author of a widely read treatise published in 1762, observed that the defendant raising self-defense “standeth upon just the same foot that every other Defendant doth: the Matters tending to Justify, Excuse, or Alleviate, must appear in Evidence before He can avail himself of them.”49Michael Foster, A Report of Some Proceedings on the Commission of Oyer and Terminer and Goal Delivery for the Trial of the Rebels in the Year 1746 in the County of Surry, and of Other Crown Cases 255 (1762). And the opportunity to introduce that evidence was not until trial: “[W]hether the Facts alledged by way of Justification, Excuse, or Alleviation are True, is the proper and only Province of the Jury.”50Id.; see also id. (“In every Charge of Murder, the Fact of Killing being first proved, all the Circumstances of Accident, Necessity, or Infirmity are to be satisfactorily proved by the Prisoner.”).

Several years after Foster’s publication, William Blackstone completed “the preeminent authority on English law for the founding generation,”51Alden v. Maine, 527 U.S. 706, 715 (1999); see also District of Columbia v. Heller, 554 U.S. 570, 593–94 (2008). in which he explained that “it is incumbent upon the prisoner to make out, to the satisfaction of the criminal court and jury,” any “circumstances of justification, excuse, or alleviation.”52See 4 William Blackstone, Commentary on the Laws of England *201 (1769). The jury, Blackstone wrote, is “to decide whether the circumstances alleged [regarding self-defense or other affirmative defenses] be proved to have actually existed”; the judge then decides “how far [the proved circumstances] extend to take away or mitigate the guilt.”53Id.

Edward Hyde East, in his influential 1803 treatise, built on Blackstone’s and Foster’s accounts and elaborated on the lack of a pretrial process for asserting self-defense.541 Edward Hyde East, Treatise of the Pleas of the Crown 340 (1803). He wrote that “the jury alone [is] to decide” on “the truth” of the defendant’s allegations of “justification, excuse, or alleviation,” though the judge could consider such defenses when deciding on bail.55Id.; see also id. (“And where a party is committed upon such a charge [of homicide], he may be brought up by habeas corpus before the court of [the King’s Bench], and if a clear case be laid before the court, whereby the homicide appears to be either justifiable or excusable, they will upon view of the depositions and commitment admit the party accused to bail, as in Mrs. Barney’s case . . . where the charge clearly appeared to be groundless.”). The McLeod case demonstrates that this current continued in the United States into the nineteenth century.56See supra notes 39–44 and accompanying text. In his 1872 Commentaries on the Law of Criminal Procedure, Joel Prentiss Bishop described how a defendant entering a plea of not guilty at arraignment formally “puts himself upon the country,” or submits to a trial by jury.57Joel Prentiss Bishop, Commentaries on the Law of Criminal Procedure; or, Pleading, Evidence, and Practice in Criminal Cases 487 (2d ed. 1872); see also Going to the Country, Black’s Law Dictionary (11th ed. 2019) (“The act of requesting a jury trial. A defendant was said to be ‘going to the country’ by concluding a pleading with the phrase ‘and of this he puts himself upon the country.’ ”); see also Francis Wharton, A Treatise on the Criminal Law of the United States § 530 (1874) (“In all cases of felony the prisoner shall be arraigned, and where any person on being so arraigned shall plead not guilty, every such person shall be deemed and taken to put himself upon the inquest or country for trial . . . .” (quoting criminal procedure rules in Pennsylvania)). The jury therefore remained the primary entity to decide disputed fact issues in criminal cases, including regarding self-defense.58Joel Prentiss Bishop, Commentaries on the Criminal Law § 735 (1868) (discussing how “inquiries concerning facts . . . must be passed upon by the jury”); Wharton, supra note 57, § 488 (describing how in a self-defense case, “[t]he jury must judge whether the danger was apparent”).

Pretrial processes, like the preliminary hearing and the grand jury, generally did not provide a defendant an opportunity to introduce evidence of any particular defense.59See James Manford Kerr & Francis Wharton, A Treatise on Criminal Procedure § 112 (10th ed. 1918) (“[N]or has the practice of taking the prisoner’s examination [at the preliminary magistrate’s review] been generally adopted.”); id. § 1288 (“The question before the grand jury being whether a bill is to be found, the general rule is that they should hear no other evidence but that adduced by the prosecution.”). Kerr and Wharton recognize limited exceptions “to avoid circuity and oppression,” such as if “the defendant, in a liquor prosecution, tenders a license.” Id. § 113. As the 1918 edition of Francis Wharton’s treatise on criminal procedure observed, “the better opinion is that on a preliminary hearing the magistrate is to hold the defendant for trial” when “there is made out a probable case of guilt.”60Id. § 114. Similarly, in a proceeding before the grand jury, “it is not the usage to introduce, in matters of confession and avoidance, witnesses for the defense, unless their testimony becomes incidentally necessary to the prosecution.”61Id. § 1288; see also id. § 1290 (“[A] grand jury has no authority by law to ignore a bill for murder on the ground of insanity, though it appear plainly from the testimony of witnesses, as examined by them on the part of the prosecution, that the accused was in fact insane . . . .”); see also Confession and Avoidance, Black’s Law Dictionary, supra note 57 (defining “confession and avoidance” to be “[a] plea in which a defendant admits allegations but pleads additional facts that deprive the admitted facts of an adverse legal effect”); Brooks v. Haslam, 4 P. 399, 399 (1884) (noting that self-defense “amounts simply to a plea in confession and avoidance”); Jordan v. State, 593 S.W.3d 340, 343 (Tex. Crim. App. 2020) (“Self-defense is a confession-and-avoidance defense requiring the defendant to admit to his otherwise illegal conduct.”).

The notion that self-defense could be adjudicated by a judge before trial thus has no basis in the common law tradition imported from England and implemented in America. The next Section shows how that basic understanding carried forward to modern times.

B.  Modern Procedure

In 1971, Indiana passed a statute providing that “[n]o person . . . shall be placed in legal jeopardy of any kind whatsoever” after exercising lawful self-defense.62Loza v. State, 325 N.E.2d 173, 176 (Ind. 1975) (quoting and discussing Ind. Code § 35-13-10-1 (repealed 1976)). Armed with that broad statutory language, one defendant sought a pretrial determination of the lawfulness of his claimed self-defense.63Id. In Loza v. State, Indiana’s highest court recognized the novelty of the proposition before reacting much like the New York court did more than a century earlier in McLeod.64See id. (“This statute has not been previously interpreted by our courts, and our research discloses no interpretation of any similar statute by any sister state.”); supra notes 39–44 and accompanying text (discussing People v. McLeod, 1 Hill 377 (N.Y. Sup. Ct. 1841)). In particular, in order “to prevent absurdity,” the court held that the new law “neither creates a new remedy nor does it alter our procedure in any respect.”65Loza, 325 N.E.2d at 176; see also Myers v. State, 137 N.E. 547, 548 (Ind. 1922) (noting that alleged facts surrounding claims of self-defense are “proper matters for the jury alone to consider and weigh”); Landreth v. State, 171 N.E. 192, 194 (Ind. 1930), overruled in part on other grounds by Burris v. State, 34 N.E.2d 928 (Ind. 1941) (“[T]he defense of self-defense is an ultimate fact solely for the determination of the jury from the evidence.”). In other words, self-defense remained a trial issue. The Loza court’s understanding was consistent with modern pretrial procedure.

Modern criminal procedure is heavily constitutional,66See William J. Stuntz, Substance, Process, and the Civil-Criminal Line, 7 J. Contemp. Legal Issues 1, 7 (1996) (“Only in criminal procedure does constitutional law dominate the field.”). and an overview of the minimalist pretrial constitutional requirements for defenses (like self-defense) is therefore instructive. Under the Fourth Amendment, police officers must have probable cause before making an arrest,67See generally Terry v. Ohio, 392 U.S. 1 (1968) (discussing when the probable cause requirement applies in police-citizen interactions). The probable cause standard is expressly referenced in the Fourth Amendment. U.S. Const. amend. IV (“The right of the people to be secure in their persons, houses, papers, and effects, against unreasonable searches and seizures, shall not be violated, and no Warrants shall issue, but upon probable cause, supported by Oath or affirmation, and particularly describing the place to be searched, and the persons or things to be seized.” (emphasis added)). and an impartial magistrate must review whether probable cause exists if the arrestee is to remain in custody.68See Gerstein v. Pugh, 420 U.S. 103, 114 (1975) (“[W]e hold that the Fourth Amendment requires a judicial determination of probable cause as a prerequisite to extended restraint of liberty following arrest.”). The Supreme Court has described probable cause as “a fluid concept” that “requires only a probability or substantial chance of criminal activity, not an actual showing of such activity.”69Illinois v. Gates, 462 U.S. 213, 232, 243–44 n.13 (1983). Admittedly, probable cause is “not a high bar.”70Kaley v. United States, 571 U.S. 320, 338 (2014).

Importantly, moreover, probable cause does not require robust consideration of self-defense, if it requires any at all. The Third Circuit has held that “affirmative legal defenses”—like self-defense—“are not a relevant consideration in [a police] officer’s determination of probable cause.”71Holman v. City of York, 564 F.3d 225, 229 (3d Cir. 2009). In contrast, the Second Circuit has held that “a police officer’s awareness of the facts supporting a defense can eliminate probable cause.”72Jocks v. Tavernier, 316 F.3d 128, 135 (2d Cir. 2003). That said, such evidence must be “conclusive” or first-hand,73See Ryan P. Sullivan, Revitalizing Fourth Amendment Protections: A True Totality of the Circumstances Test in § 1983 Probable Cause Determinations, 105 Iowa L. Rev. 687, 708–09 (2020) (discussing Jocks, 316 F.3d 128, and other relevant case law). and once an officer has probable cause to make an arrest, the officer does not constitutionally have “to investigate exculpatory defenses offered by the person being arrested or to assess the credibility of unverified claims of justification.”74Jocks, 316 F.3d at 135–36; see also Baker v. McCollan, 443 U.S. 137, 145–46 (1979) (observing that police officers do not have “to investigate independently every claim of innocence”); District of Columbia v. Wesby, 138 S. Ct. 577, 588 (2018) (“[P]robable cause does not require officers to rule out a suspect’s innocent explanation for suspicious facts.”). Self-defense is not singled out for special treatment, but rather is treated like any other defense.75Jocks, 316 F.3d at 135.

Subsequently, once a prosecutor makes a charging decision, there is “no federal constitutional right to any review” of that decision before trial “apart from the grand jury clause of the Fifth Amendment.”76Ronald Jay Allen, Joseph L. Hoffmann, Andrew D. Leipold, Debra Livingston & William J. Stuntz, Criminal Procedure: Adjudication and Right to Counsel 1037 (2011) (citing Gerstein v. Pugh, 420 U.S. 103, 119 (1975)). The grand jury, meanwhile, is also guided by the standard of whether there is “probable cause necessary to initiate a prosecution for a serious crime.”77Kaley v. United States, 571 U.S. 320, 328 (2014). In United States v. Williams,78United States v. Williams, 504 U.S. 36, 51 (1992). the Supreme Court held that, notwithstanding the constitutional obligation to disclose material exculpatory evidence to a defendant before trial,79Brady v. Maryland, 373 U.S. 83, 87 (1963) (“[T]he suppression by the prosecution of evidence favorable to an accused upon request violates due process where the evidence is material either to guilt or to punishment.”). the Constitution does not require prosecutors to disclose substantial exculpatory evidence to the grand jury, including regarding a potential claim of self-defense.80Williams, 504 U.S. at 51. Looking back to the common law history, the Court explained that the grand jury is “an accusatory [body],” not “an adjudicatory body,” and its task is “to assess whether there is adequate basis for bringing a criminal charge.”81Id. Historically, “it has always been thought sufficient for the grand jury to hear only the prosecutor’s side.”82Id. at 37.

In some jurisdictions, by either law or internal policy, prosecutors are held to a higher standard than the federal constitutional baseline with respect to grand juries.83Sara Sun Beale, William C. Bryson, James E. Felman & Katherine Earle Yanes, Prosecutor’s Duty to Present Exculpatory Evidence, in Grand Jury Law and Practice § 4:17 (2d ed. 2021) (“In approximately a quarter of the states, there are statutes or judicial decisions that require prosecutors to inform the grand jury of exculpatory evidence in some circumstances.”). However, most such departures only require presenting “evidence that is clearly exculpatory” or “that would exonerate the accused or lead the grand jury to refuse to indict.”84Id. The United States Justice Manual, for example, provides that “when a prosecutor conducting a grand jury inquiry is personally aware of substantial evidence that directly negates the guilt of a subject of the investigation, the prosecutor must present or otherwise disclose such evidence to the grand jury before seeking an indictment against such a person.” U.S. Dep’t of Just., Presentation of Exculpatory Evidence, in Department of Justice Manual § 9-11.233 (2021). That is a hard standard for a defendant to satisfy. Beale et al., supra note 83, § 4:17 (characterizing this “test” as “very difficult . . . to satisfy”). The Manual provides that “failure to follow the Department’s policy should not result in dismissal of an indictment,” but that “appellate courts may refer violations of the policy to the Office of Professional Responsibility for review.” U.S. Dep’t of Just., supra. Given the low bar for indictment—again, probable cause85Kaley v. United States, 571 U.S. 320, 328 (2014).—even these jurisdictions stop far short of adjudicating self-defense before trial.

The Federal Rules of Criminal Procedure, which “almost always reflect the basic position adopted in a substantial number of states,”86Wayne R. LaFave, Jerold H. Israel & Nancy J. King, Principles of Criminal Procedure: Post-Investigation 4 (2004). provide other pretrial procedural steps apart from the grand jury, most notably a preliminary hearing.87Fed. R. Crim. P. 5.1(e). Yet the preliminary hearing—consistent with historical practices88See supra notes 47, 58–60 and accompanying text (discussing the historical focus on prosecution evidence at preliminary hearings).—focuses on the prosecution’s evidence for the charged offense, and not evidence of self-defense or any other affirmative defense. Again, the standard is probable cause: the prosecutor need only show “probable cause to believe an offense has been committed and the defendant committed it.”89Fed. R. Crim. P. 5.1(e). Moreover, the prosecutor gets to decide whether to have a preliminary hearing at all: if the prosecutor secures an indictment before a grand jury, then the defendant has no right to demand a pretrial hearing.90The same is true in “most states and for most charges.” Marc L. Miller, Ronald F. Wright, Jenia I. Turner & Kay L. Levine, Criminal Procedures: Prosecution and Adjudication 188 (6th ed. 2019) (discussing preliminary hearings).

It thus has remained true under conventional criminal procedure that “[i]f a defendant claims innocence or has a defense,” including self-defense, “the proper body to decide the issue is the petit jury.”91Beale et al., supra note 83. Recent reform efforts, however, characterize self-defense not as a “defense” but as an “immunity,” calling to mind exceptions to the general rule—a category of traditional immunities and other matters that are adjudicated pretrial. The next Section addresses such pretrial issues in relation to self-defense.

C.  Immunities from Prosecution

Recent legislation declaring that self-defense is an immunity from prosecution has led judges and commentators to treat self-defense as a “true immunity” comparable to others.92Rogers v. Commonwealth, 285 S.W.3d 740, 753 (Ky. 2009) (“[T]he General Assembly has made unmistakably clear its intent to create a true immunity, not simply a defense to criminal charges.”). This classification invites questions about how other prosecutorial immunities operate, why they exist, and whether they share anything in common with self-defense.93In considering these questions, I build on Cynthia Ward’s observation that self-defense immunity “seems quite different” from traditional immunities. See Ward, supra note 22, at 134–35 (“Traditionally, immunity from prosecution is offered to certain government officials, or to citizens performing important roles in the legal process (such as witness in a criminal case), where it might reasonably be argued that society’s interests in protecting such roles and functions outweighs its interest in prosecuting the individual. That seems quite different from the immunity procedure outlined in Florida’s self-defense law.” (citation omitted)).

Common immunities from prosecution include diplomatic immunity, judicial immunity, legislative immunity, executive immunity, immunity after compelled testimony, and immunity bestowed on the basis of a plea agreement.94Since my focus is on immunities from criminal prosecution, I do not address the operation of immunities geared toward civil suits and liability such as sovereign and qualified immunity. See, e.g., State v. Velky, 821 A.2d 752, 759 (Conn. 2003) (“Sovereign immunity is not applicable in criminal cases, because, at least ordinarily, the charges are not brought ‘in effect’ against the government.”); Kipps v. Caillier, 197 F.3d 765, 768 (5th Cir. 1999) (“Public officials acting within the scope of their official duties are shielded from civil liability by the qualified immunity doctrine.” (emphasis added)); Temich v. Cossette, No. 11CV958, 2015 U.S. Dist. LEXIS 76064, at *6 (D. Conn. June 12, 2015) (“The defense of qualified immunity is not germane to a criminal proceeding.”). These are “defenses” in the sense that they are asserted by a defendant as a way to avoid a conviction. But their essence goes beyond ordinary defenses because immunities operate to exempt a person from the mandate of the criminal law, not to justify otherwise criminal conduct because of the circumstances surrounding that conduct.95Immunity, Black’s Law Dictionary supra note 57 (“Any exemption from a duty, liability, or service of process; esp., such an exemption granted to a public official or governmental unit. Cf. IMPUNITY.”). Black’s Law Dictionary cross-references “impunity” in its definition of “immunity,” which similarly denotes an “[e]xemption from punishment.”96Impunity, Black’s Law Dictionary, supra note 57. The example that Black’s uses to describe impunity relates to diplomatic immunity: “because she was a foreign diplomat, she was able to park illegally with impunity.”97Id. Immunity gets asserted early in the criminal process to head off the prosecution of someone possessing such an exemption.

As such, prosecutorial immunities are a remarkable departure from the ordinary criminal process described above; moreover, they are in tension with a basic, distinctive function of criminal law. Criminal law is traditionally viewed as a means to declare “a formal and solemn pronouncement of the moral condemnation of the community.”98Henry M. Hart, Jr., The Aims of the Criminal Law, 23 Law & Contemp. Probs. 401, 405 (1958) (describing distinctions between criminal and civil wrongs); see also Paul Robinson, Criminal Law 21 (1997) (discussing the criminal law’s role in “creating and maintaining the social consensus on morality necessary to sustain norms”). The community’s role in implementing the criminal law—through a public prosecution and jury trial—is intertwined with that function. It is no coincidence that the prosecutor in a criminal case is called “The People” in many jurisdictions.99See, e.g., Law Reporting Bureau of the State of N.Y., New York Law Reports Style Manual § 8.1(a) (2012), http://www.courts.state.ny.us/reporter/new_styman.htm [perma.cc/62GF-KVATJ] (“In criminal actions, the prosecuting authority is usually described as ‘The People of the State of New York.’ ”).

Prosecutorial immunity dilutes the formal power of the public in assessing an alleged crime, and it thus raises special concerns in criminal law that might exist only to a lesser extent in the civil context, where immunity is sometimes granted, for example, primarily to avoid costs.100See generally Alexandra B. Klass, Tort Experiments in the Laboratories of Democracy, 50 Wm. & Mary L. Rev. 1501 (2009) (describing a growing conferral of tort immunity without accompanying compensatory schemes); John C.P. Goldberg, The Constitutional Status of Tort Law: Due Process and the Right to a Law for the Redress of Wrongs, 115 Yale L.J. 524 (2005) (describing and critiquing widespread tort reform); see also Protection of Lawful Commerce in Arms Act, Pub. L. No. 109-92, 119 Stat. 2095 (codified at 15 U.S.C. § 7901 et seq. (2005)) (shielding federally licensed firearm manufacturers, dealers, and sellers from civil, but not criminal, actions “resulting from the criminal or unlawful misuse” of firearms). In the criminal context, immunities tend to be justified by a narrower, more compelling rationale. As a general matter, only when avoiding the criminal justice process is a defense’s entire raison d’être is it exempted from prosecution as an “immunity.” Put differently, the public policies underlying the above-mentioned criminal law immunities necessarily require the avoidance of prosecution and trial.

Consider diplomatic immunity. A key reason why we immunize conduct by foreign diplomats in the United States is to protect American diplomats outside the United States from exposure to foreign court systems.101See U.S. Dep’t of State Off. of Foreign Missions, Diplomatic and Consular Immunity: Guidance for Law Enforcement and Judicial Authorities 5 (2018) [hereinafter Diplomatic and Consular Immunity] (“On a practical level, a failure of the authorities of the United States to fully respect the immunities of foreign diplomatic and consular personnel may complicate diplomatic relations between the United States and the other country concerned. It may also lead to harsher treatment of U.S. personnel abroad, since the principle of reciprocity has, from the most ancient times, been integral to diplomatic and consular relations.”); William F. Marmon, Jr., Note, The Diplomatic Relations Act of 1978 and Its Consequences, 19 Va. J. Int’l L. 131, 134, 142 n.64 (1978) (“[I]t is to our advantage not to expose our personnel to [foreign] court systems.” (quoting the testimony of Hampton Davis during a Senate Foreign Relations Hearing)). There is no way to satisfy that goal through an affirmative defense at trial. Consistent with the purpose of diplomatic immunity, it also does not protect diplomats from sanction upon return to their home countries.102See Vienna Convention on Diplomatic Relations, Apr. 18, 1961, 23 U.S.T. 3227, 500 U.N.T.S. 95, at art. 31(4) (“The immunity of a diplomatic agent from the jurisdiction of the receiving State does not exempt him from the jurisdiction of the sending State.”). Judicial, legislative, and executive immunities are similarly geared to specific policy rationales necessitating avoidance of a trial. Each protects “governmental officials from personal liability arising from their official duties” because of the strong interest in facilitating their ability to serve the public.103Robinson, supra note 29, at 231. The Supreme Court has explained how legislative immunity enables “representatives to execute the functions of their office without fear of prosecutions.”104Tenney v. Brandhove, 341 U.S. 367, 374 (1951). An added component of legislative and judicial immunity is to preserve the balance of power between the three branches of government by insulating legislative and judicial officers from prosecutions by the executive branch.105James Walton McPhillips, Note, “Saturday Night’s Alright for Fighting”: Congressman William Jefferson, the Saturday Night Raid, and the Speech or Debate Clause, 42 Ga. L. Rev. 1085, 1093 (2008) (observing how legislative immunity insulates legislators from an “unfriendly executive”). Again, interests that these governmental immunities serve cannot be furthered—and indeed would be undermined—if they were treated as defenses to be proved at trial. The remarkable benefit of immunity is thus granted because of strong public policy arguments that inherently entail a bar to prosecution.

How does self-defense relate to immunities? Self-defense is not about trial avoidance but exculpation.106Robinson, supra note 29, at 220 (observing that justification defenses exculpate because “by the infliction of the intermediate harm or evil, a greater societal harm is avoided or benefit gained”). Like other justification defenses and unlike immunities, it can be adjudicated in the traditional way—through trial—without undermining its rationale.107Id. at 220. “The societal benefit underlying [immunities] arises not from [the defendant’s] conduct, but from foregoing his conviction.” Id. at 232. Moreover, unlike typical immunities, self-defense furthers interests that are in fact undermined by short-circuiting a prosecution and trial.

T. Markus Funk has identified seven values served by self-defense law: protecting the state’s monopoly on force, protecting the individual attacker’s right to life, maintaining the equal standing between people, protecting the defender’s autonomy, ensuring the primacy of the legal process, maintaining the legitimacy of the legal order, and deterring attackers.108T. Markus Funk, Rethinking Self-Defence: The ‘Ancient Right’s’ Rationale Disentangled 18 (2021). Immunity arguably advances the interests in protecting a defender’s autonomy or deterring attackers. But it runs roughshod over other values, especially self-defense law’s dual roles of ensuring the primacy of the legal process and maintaining the legitimacy of the legal order. Both roles underlie the idea that “the authority to punish and condemn” remain with “the liberal state,” not with individual citizens.109Id. at 44. In his discussion of ensuring the primacy of the legal process, Funk notes that “[t]o the extent possible, . . . the justice system must promote the resolution of disputes in the courts.”110Id. at 43. Immunity, however, dilutes the state’s oversight of defensive violence and, perhaps worse still, undermines the community’s role through the jury to assess the lawfulness of violence—a point addressed in greater depth in Part III. In other words, in contrast to typical immunities, whose purposes areoverall advanced by providing an exemption from prosecution, key values underlying self-defense law are undercut by providing such an exemption.

Immunities, of course, are not the only matters that receive pretrial resolution. Some defenses—like those based on statutes of limitations, double jeopardy, and speedy trial requirements—are also adjudicated in advance of trial. Other issues, like competency to stand trial, also receive pretrial determination. In the effort to implement self-defense immunity, some have analogized self-defense to those other pretrial issues even though they are not technically “immunities.”111See, e.g., People v. Guenther, 740 P.2d 971, 977 (Colo. 1987) (en banc) (analogizing self-defense immunity to prosecutorial bars based on the statute of limitations, double jeopardy, and speedy trial requirements); Rogers v. Commonwealth, 285 S.W.3d 740, 755 (Ky. 2009) (comparing self-defense immunity hearings to competency hearings). Yet these issues, like traditional immunities, protect interests that necessarily call for avoiding trial and thus are dissimilar to self-defense. Statutes of limitations affirm the belief that “[a]fter a period of time, a person ought to be allowed to live without fear of prosecution.”112Model Penal Code § 1.07, cmt. at 16–17 (Tentative Draft No. 5, 1956); see also Toussie v. United States, 397 U.S. 112, 114–15 (1970) (observing that a limitations period “is designed to protect individuals from having to defend themselves against charges when the basic facts may have become obscured by the passage of time and to minimize the danger of official punishment because of acts in the far-distant past”). Double jeopardy protections are “designed to protect an individual from being subjected to the hazards of trial and possible conviction more than once for an alleged offense.”113Green v. United States, 355 U.S. 184, 187 (1957). Speedy trial guarantees mandate “the Government [to] move with the dispatch that is appropriate to assure [the defendant] an early and proper disposition of the charges against him.”114United States v. Marion, 404 U.S. 307, 313 (1971). And resolving competency questions must also happen before a trial since the entire point is to determine the defendant’s “ability to participate meaningfully in the trial.”115Rogers, 285 S.W.3d at 755.

In connection with competency hearings, one exception to the general rule of limiting pretrial criminal matters to those that inherently require pretrial determination involves the insanity defense. Courts tend to draw a clear line between the question of competency to stand trial, which is adjudicated in advance of trial, and insanity at the time of the offense, which is a trial issue.116See, e.g., Bishop v. Superior Court ex rel. County of Pima, 724 P.2d 23, 25–26 (Ariz. 1986) (en banc) (stating that competency and the insanity defense “are distinctly different inquiries, one leading to a determination of whether the trial can proceed at all, and the other to the trial defense of insanity”); Ricks v. State, 242 S.E.2d 604, 606 (Ga. 1978) (“The issue of the accused’s insanity at the time of the alleged crime is a question for the trial jury. The issue of the accused’s competency to stand trial is a question for a special jury upon a special plea of insanity.”). As a general matter, therefore, an insanity defense is submitted to the fact finder at trial and is not decided at a pretrial hearing.117See, e.g., Tenn. Code. Ann. § 39-11-501 (2014) (providing the defense of insanity is “a matter for the trier of fact alone”); Wis. Stat. § 971.165 (2008) (requiring a continuous, bifurcated trial for the insanity defense); State v. Fichera, 903 A.2d 1030, 1035 (N.H. 2006) (“[S]anity is a question of fact to be determined by the jury . . . .” (quoting State v. Hall, 808 A.2d 55 (N.H. 2002))); State ex rel. Smith v. Scott, 280 S.E.2d 811, 814 (W. Va. 1981) (“Consequently, we hold that a trial court judge is not under any duty to hold a hearing on the issue of criminal responsibility in advance of trial regardless of how compelling the pretrial reports may be. Criminal responsibility is a jury question . . . unless both prosecutor and judge concur that the outcome of the proceedings would be a foregone conclusion.”); Bonner v. State, 520 S.W.2d 901, 906 n.2 (Tex. Crim. App. 1975) (“The issue of insanity at the time of the commission of an offense is a defensive one, and therefore is properly raised during the course of the trial on the merits.”); People v. Ford, 235 N.E.2d 576, 578 (Ill. 1968) (“The defense of insanity at the time of the crime, like any other defense, must be raised at the time of trial and submitted to the jury who are hearing the case, and no special jury is called or pretrial hearing conducted to determine this question.”). However, such bifurcation is not universally followed. Pennsylvania law, for example, grants a judge the discretion to “hear evidence on whether the person was criminally responsible for the commission of the crime charged” so long as the judge is already conducting a competency hearing.11850 Pa. Cons. Stat. § 7404(a) (2014); see also Commonwealth v. Scott, 578 A.2d 933, 936–37 (Pa. Super. Ct. 1990) (describing procedure).

In that context, judicial economy might weigh in favor of considering evidence of both competency and insanity at a pretrial hearing. At least one other state—North Carolina—gives courts discretion to hold a pretrial insanity hearing so long as the state consents.119N.C. Gen. Stat. § 15A-959 (1973) (“Upon motion of the defendant and with the consent of the State the court may conduct a hearing prior to the trial with regard to the defense of insanity at the time of the offense.”). That exception is highly limited in that courts and prosecutors can override a defendant’s request for a hearing, making it quite different from self-defense immunity.120See infra Part II. And in Washington, a defendant may request a pretrial insanity determination, but the statute notes that any acquittal under the statute cannot be used to contest mental health detention—a possibility that distinguishes insanity and self-defense.121Wash. Rev. Code § 10.77.080 (1998) (“The defendant may move the court for a judgment of acquittal on the grounds of insanity: PROVIDED, That a defendant so acquitted may not later contest the validity of his or her detention on the grounds that he or she did not commit the acts charged.”); see also Christopher Slobogin, The Guilty but Mentally Ill Verdict: An Idea Whose Time Should Not Have Come, 53 Geo. Wash. L. Rev. 494 (1985) (discussing “not guilty but mentally ill” verdicts, by which a defendant is still incarcerated for treatment despite being found not guilty by reason of insanity).

This Section has set out the limited nature of criminal law immunities and other pretrial matters and offered a normative explanation, rooted in the criminal law’s distinctive role, for that narrow scope. Below, the Article considers additional arguments for and against expanding immunities to include self-defense.122See infra Section II.C, Part III. First, however, the Article turns to the story of how self-defense immunity arose in the first place.

II.  THE PUSH TO MAKE SELF-DEFENSE EXCEPTIONAL

In light of the American criminal law tradition of adjudicating self-defense at trial, how did self-defense immunity arise? This Part shows how self-defense immunity emerged out of Colorado in 1986, laid dormant for almost two decades, and then became a central component of gun rights advocacy in the 2000s. The Part then analyzes the thin rationales put forward for treating self-defense as deserving of exceptional treatment through prosecutorial immunity.

A.  Inauspicious Beginning in Colorado

Accounts of recent self-defense reforms tend to begin with Florida’s 2005 Stand Your Ground legislation.123See, e.g., Elizabeth Chuck, Florida Had First Stand Your Ground Law, Other States Followed in “Rapid Succession,” NBC News (July 18, 2013, 7:03 AM), https://www.nbcnews.com/news/us-news/forida-had-first-stand-your-ground-law-other-states-followed-flna6c10672364 [perma.cc/QX22-DB36]. Indeed, Florida’s law served as a model that influenced legal changes across the country.124See infra notes 162–69 and accompanying text (describing the influence of Florida’s self-defense reform). But the first example of a self-defense immunity statute was not Florida’s but rather a last-minute compromise bill from Colorado twenty years earlier.125See Colo. Rev. Stat. § 18-1-704.5 (1986); Dirk Johnson, “Make My Day”: More Than a Threat, N.Y. Times (June 1, 1990) (noting that “[n]o other state [was] believed to have such a law” providing immunity from criminal prosecution for lawful self-defense).

The Colorado law did not, at first, provide for prosecutorial immunity. Rather, the bill initially added a legal presumption to self-defense law to enhance the scope of lawful self-defense against home intruders.126William Wilbanks, The Make My Day Law: Colorado’s Experiment in Home Protection 31 (1990). To be sure, homeowners already had an expanded right to self-defense through the “Castle Doctrine,” which generally removed a person’s duty to retreat before using lethal defensive force in the home.127See Blackstone, supra note 52, at *223 (“[T]he law of England has so particular and tender a regard to the immunity of a man’s house, that it stiles it his castle, and will never suffer it to be violated with impunity.”); 1 Matthew Hale, The History of the Pleas of the Crown 486 (1680) (writing that when a man is assailed in his own house he “need not fl[y] as far as he can, as in other cases of se defendendo, for he hath the protection of his house to excuse him from flying, for that would be to give up the protection of his house to his adversary by flight”). All American jurisdictions accept some version of the Castle Doctrine. Sanford H. Kadish, Stephen J. Schulhofer & Rachel E. Barkow, Criminal Law and Its Processes: Cases and Materials 924 (2017); see also People v. Tomlins, 107 N.E 496 (N.Y. 1914) (“It is not now and never has been the law that a man assailed in his own dwelling is bound to retreat. If assailed there, he may stand his ground and resist the attack.”). However, Colorado policymakers wanted to do more, so they borrowed from a California statute that a person confronting a home intruder is legally “presumed” to fear for their life.128See Cal. Penal Code § 198.5 (1984) (“Any person using force intended or likely to cause death or great bodily injury within his or her residence shall be presumed to have held a reasonable fear of imminent peril of death or great bodily injury to self, family, or a member of the household when that force is used against another person, not a member of the family or household, who unlawfully and forcibly enters or has unlawfully and forcibly entered the residence and the person using the force knew or had reason to believe that an unlawful and forcible entry occurred.”). That presumption would satisfy one requirement of lethal defensive force—that the defender reasonably perceives a threat of death or serious bodily injury129See Colo. Rev. Stat. § 18-1-704(2)(a) (“Deadly physical force may be used only if a person reasonably believes a lesser degree of force is inadequate and . . . [t]he actor has reasonable ground to believe, and does believe, that he or another person is in imminent danger of being killed or of receiving great bodily injury.”).—thereby relieving the defendant of the need to produce evidence of such heightened fear.

Prosecutors objected because they “believed that it would be very difficult, if not impossible, to rebut the presumption in favor of the homeowner.”130Wilbanks, supra note 126, at 42. The presumption would result in a helpful jury instruction for the defendant and could help a defendant avoid taking the stand to demonstrate a fear of death or great bodily injury. The presumption would not shift the burden of proof, however, since the prosecution already had to disprove self-defense beyond a reasonable doubt. See Martin v. Ohio, 480 U.S. 228, 236 (1987) (“[A]ll but two of the States, Ohio and South Carolina, have abandoned the common-law rule and require the prosecution to prove the absence of self-defense when it is properly raised by the defendant.”). There was little public debate regarding the subsequent compromise that became the nation’s first law providing immunity from prosecution for self-defense.131Wilbanks, supra note 126, at 38 (noting the compromise negotiations were “held behind closed doors” and “were unannounced . . . and lacked formality”). Yet the law appears to have imported a civil immunity provision enacted in Colorado in 1982 into the criminal law.132See infra notes 133–35 and accompanying text (describing Colorado’s civil immunity law).

By way of background, in 1981, a Colorado jury awarded a plaintiff more than $300,000 in damages from a defendant for gunshot injuries incurred while the plaintiff was burglarizing the defendant’s shop.133Wilbanks, supra note 126, at 21–23. The public outcry was swift and the shop owner’s lawyer helped to draft a bill immunizing people like his client from civil damages.134Id. at 23. The resulting law barred payouts for personal injuries “sustained during the commission of or during immediate flight from” a felony if the person inflicting the injury reasonably believed that physical force was “reasonable and appropriate” to prevent both injury and the commission of the felony.135Id. at 24. The wisdom of such civil immunity is beyond the scope of this Article; more important for present purposes is that it did not address immunity from criminal liability. As discussed above, criminal liability is geared toward vindicating public harms in a way that civil liability is not.136See supra notes 92–98 and accompanying text. The Colorado shop owner case demonstrates another distinction between civil and criminal cases in that the prosecutor declined to prosecute. Wilbanks, supra note 126, at 22. In criminal cases, a prosecutor with legal experience weighs the viability of a case before pressing charges and then must prove the case beyond a reasonable doubt. In re Winship, 397 U.S. 358, 364 (1970) (“[T]he Due Process Clause protects the accused against conviction except upon proof beyond a reasonable doubt of every fact necessary to constitute the crime with which he is charged.”). In contrast, civil plaintiffs are frequently not lawyers (even if they have representation by one) and face a lesser burden of proof: they have to prove their case by a preponderance of the evidence, not beyond a reasonable doubt. See Addington v. Texas, 441 U.S. 418, 423–24 (1979) (observing that in “the typical civil case involving a monetary dispute between private parties[, s]ince society has a minimal concern with the outcome of such private suits, plaintiff’s burden of proof is a mere preponderance of the evidence,” whereas “[i]n a criminal case, on the other hand, the interests of the defendant are of such magnitude that . . . the state [must] prove the guilt of an accused beyond a reasonable doubt”). This presents a risk of over-litigation in the civil context that is generally absent from the criminal context. Nonetheless, the criminal immunity bill that later passed in Colorado in 1986 mirrored the earlier civil immunity law. The law provided that a person “shall be immune from criminal prosecution” if the person used defensive force and four conditions were met relating to an unlawful home intrusion.137Colo. Rev. Code § 18-1-704.5(3) (1986) (emphasis added). The four conditions were that (1) the defendant was an “occupant of a dwelling”; (2) another person “made an unlawful entry into the dwelling”; (3) “the occupant ha[d] a reasonable belief that such other person . . . committed a crime in the dwelling in addition to the uninvited entry, or [wa]s committing or intend[ed] to commit a crime against a person or property in addition to the uninvited entry”; and (4) “the occupant reasonably believe[d] that such other person might use any physical force, no matter how slight, against any occupant.” Id.

The 1986 law’s legislative sponsors and the negotiating prosecutors appeared to have different beliefs about what the new law actually accomplished. The sponsors appreciated that they had achieved “greater protection [for defendants] than a presumption for the homeowner as part of an affirmative defense at trial.”138Wilbanks, supra note 126, at 46. The negotiating prosecutors, in contrast, believed that they gave up nothing. Denver’s district attorney, for example, publicly commented that the “compromise is just a clarification of existing law.”139Id. at 45 (quoting Norman Early).

In that vein, some prosecutors tried to argue in subsequent litigation that the new provision could not possibly grant true immunity for self-defense.140People v. Guenther, 740 P.2d 971, 975 (Colo. 1987). Among other things, they pointed out that the provision appears alongside other affirmative defenses in Colorado’s criminal code.141Id. When the issue reached the Colorado Supreme Court, however, the justices rejected the prosecutors’ interpretation that self-defense remained an ordinary defense to be proved at trial, noting that “[i]t must be presumed that the legislature has knowledge of the legal import of the words it uses.”142Id. at 976. The plain meaning of “shall be immune from criminal prosecution” in the statute, they concluded, was “to bar criminal proceedings against a person for the use of force under the circumstances set forth” in the law.143Id. at 975. In the course of reaching that holding, the justices acknowledged what went unsaid during the legislative hearings: that “the immunity created by [the law] is an extraordinary protection which, so far as we know, has no analogue in Colorado statutory or decisional law.”144Id. at 980. In fact, immunity for self-defense in criminal cases does not appear to have existed anywhere else in the country.145See Johnson, supra note 125 (“No other state is believed to have such a law.”).

Perhaps because of its unusualness, or because it was an eleventh-hour deal seemingly unrooted in any principle other than compromise, Colorado’s self-defense immunity law was not immediately enacted elsewhere. In 1987, for example, Oklahoma’s governor vetoed legislation similar to Colorado’s, which subsequently passed after the immunity provision was removed.146Wilbanks, supra note 126, at 50–51. Nonetheless, Colorado’s immunity provision was on the books, providing a template for future efforts.

B.  Auspicious Effort by Gun Rights Advocates

The Colorado self-defense immunity law was not instituted at the behest of gun rights advocates or other lobbyists, but rather, it arose as a compromise with prosecutors after a locally elected leader perceived a need for expanding self-defense protections against home intruders.147Id. at 31 (“Rep. Armstrong says that the idea and initiative for the original bill was her own as she did not contact any lobbyists (the Colorado District Attorneys Council, the National Rifle Association, homeowners associations) to seek help in drafting the initial bill.”). In more recent times, however, gun rights advocates and the NRA in particular have led a campaign to expand not only the right to have and carry guns but also to brandish and shoot them when gun owners feel threatened.148For accounts of the NRA’s recent focus on self-defense law, and especially Stand Your Ground, see Mary Anne Franks, The Cult of the Constitution 85 (2019) and Caroline E. Light, Stand Your Ground: A History of America’s Love Affair with Lethal Self-Defense 161–62 (2017). Most public attention to this campaign has centered around Stand Your Ground, but looking closely at testimony and commentary reveals a deeper ambition: immunizing defensive gun use from prosecution.

The parallels between the NRA’s lobbying for gun rights and its lobbying for self-defense immunity is striking. Gun rights advocates frequently claim that the right to keep and bear arms is being disrespected in the courts and therefore that the Second Amendment needs more protection.149Joseph Blocher and I explore that rhetorical move in Eric Ruben & Joseph Blocher, “Second-Class” Rhetoric, Ideology, and Doctrinal Change, 110 Geo. L.J. 613, 613 (2022); see also Joseph Blocher & Eric Ruben, No, Courts Don’t Treat the Second Amendment as a ‘Second-Class Right,’ Wash. Post (Nov. 17, 2021, 6:00 AM), https://www.washingtonpost.com/outlook/2021/11/17/no-courts-dont-treat-second-amendment-second-class-right [https://perma.cc/S9QU-UHDD] (discussing “allegations of widespread mistreatment” of the right to keep and bear arms). The claim with self-defense is similar: as one gun rights advocate put it, self-defenders are “victimized . . . in court.”150Self-Defense Amendments: Hearing on H.B. 227 Before the Senate Natural Resources, Agriculture, and Environment Comm., 2021 Gen. Sess., at 35:30 (Utah 2021), https://le.utah.gov/
av/committeeArchive.jsp?timelineID=182900 [https://perma.cc/8K2K-MH86] (statement of Clark Aposhian).
The executive director of the NRA’s Institute for Legislative Action lamented that “people who defend themselves are more likely to be charged with crimes and, as the old sayings go, be forced to ‘tell it to the judge’ and ‘let the jury sort it out.’ ”151Chris W. Cox, “Castle Doctrine” Legislation: Protecting Your Right to Protect
Yourself, NRA-ILA (Apr. 1, 2012), https://www.nraila.org/articles/20120401/castle-doctrine-legislation-protecting-your-right-to-protect-yourself [https://perma.cc/7M2Q-PW9V].
That creates a problem, he explained, because “a murder trial puts the defendant at risk of a long prison sentence—or worse.”152Id. The NRA lobbyist most directly involved with Florida’s landmark Stand Your Ground bill in 2005 was likewise moved by this notion.153Mike Spies, The N.R.A. Lobbyist Behind Florida’s Pro-Gun Policies, New Yorker (Feb.
23, 2018), https://www.newyorker.com/magazine/2018/03/05/the-nra-lobbyist-behind-floridas-pro-gun-policies [https://perma.cc/ND4Z-RRE2] (describing Marion Hammer’s role in the enactment of Florida’s 2005 law and subsequent amendments).
A basic problem, in her view, was that people were “being arrested” and “prosecuted . . . for exercising self-defense that was lawful.”154Id. (quoting Marion Hammer).

An answer to that feeling of disregard for self-defense was to transform it from an affirmative defense to an immunity. The NRA devised a self-defense immunity law155Id. and found legislative sponsors in Florida who agreed with the complaint that, as one put it, “law-abiding citizens” who “protect themselves [are] in a posture that they have to defend themselves from their own government.”156Talk of the Nation, Opinion, Why I Wrote “Stand Your Ground” Law, NPR (Mar. 26, 2012, 1:00 PM), https://www.npr.org/2012/03/26/149404276/op-ed-why-i-wrote-stand-your-ground-law [https://
perma.cc/AKP6-KD45] (interview of State Rep. Dennis Baxley (R-Fla.)).
The measure passed in 2005 and went even further than Colorado’s, extending prosecutorial immunity to all self-defense—not just self-defense in the context of home invasions.157See id. In particular, the law provided that someone using lawful self-defense is “immune from criminal prosecution,” with “criminal prosecution” defined to “include[] arresting, detaining in custody, and charging or prosecuting the defendant.”158See Fla. Stat. § 776.032 (2005). The self-defense immunity provision adopted in Florida in 2005 is as follows:

Immunity from criminal prosecution and civil action for justifiable use of force.—

(1) A person who uses force as permitted in s. 776.012, s. 776.013, or s. 776.031 is justified in using such force and is immune from criminal prosecution and civil action for the use of such force, unless the person against whom force was used is a law enforcement officer, as defined in s. 943.10(14), who was acting in the performance of his or her official duties and the officer identified himself or herself in accordance with any applicable law or the person using force knew or reasonably should have known that the person was a law enforcement officer. As used in this subsection, the term “criminal prosecution” includes arresting, detaining in custody, and charging or prosecuting the defendant.

(2) A law enforcement agency may use standard procedures for investigating the use of force as described in subsection (1), but the agency may not arrest the person for using force unless it determines that there is probable cause that the force that was used was unlawful.

(3) The court shall award reasonable attorney’s fees, court costs, compensation for loss of income, and all expenses incurred by the defendant in defense of any civil action brought by a plaintiff if the court finds that the defendant is immune from prosecution as provided in subsection (1).

Id. The law, formally called, “An act relating to the protection of persons and property,” 2005 Fla. Laws 199, also enacted Stand Your Ground, 2005 Fla. Laws 202, and two presumptions making it easier to defend deadly defensive force in a person’s home and cars, see id. (creating Fla. Stat. § 776.013(1), (4)).

After some Florida judges placed the burden on the defendant to prove self-defense at a pretrial hearing, legislators stepped in to strengthen the immunity provision by clarifying that the burden of proof is on the prosecutor to disprove self-defense before trial by clear and convincing evidence.159See Love v. State, 286 So. 3d 177, 180 (Fla. 2019) (recounting the history of the burden shift for self-defense immunity in Florida). That standard is much higher than the probable cause standard that prosecutors must satisfy to indict, which, as discussed above, is the primary focus of traditional and modern pretrial screening.160See supra Section I.B (discussing pretrial screening and the probable cause standard). And there have been efforts to increase the burden even more, such as by requiring the prosecutor to disprove self-defense beyond a reasonable doubt—the same burden borne by the prosecutor at trial.161See Lizette Alvarez, Florida Poised to Strengthen ‘Stand Your Ground’ Defense, N.Y. Times (Mar. 15, 2017), https://www.nytimes.com/2017/03/15/us/stand-your-ground-florida.html [https://
perma.cc/HF98-Z8EM] (describing effort to increase the burden for disproving self-defense at immunity hearings to the beyond-a-reasonable-doubt standard).

Unlike Colorado’s law, which failed to attract buy-in elsewhere, Florida’s law was aggressively promoted by the NRA and the conservative American Legislative Exchange Council (“ALEC”),162See NRA Presents ALEC Model Legislation in Grapevine, Texas, NRA Inst. Legis. Action (Aug. 12, 2005), https://www.prwatch.org/files/NRA_2005.png [https://perma.cc/MK8P-8AAY] (“At the recent Annual Meeting of the American Legislative Exchange Council (ALEC) in Grapevine, TX, Marion Hammer presented the ALEC Criminal Justice Task Force with proposed model legislation based on Florida’s landmark “Castle Doctrine” law, that passed in Florida earlier this year.”); Press Release, ALEC Statement on “Stand Your Ground” Legislation (Mar. 26, 2012), https://www.alec.org/
press-release/alec-statement-on-stand-your-ground-legislation-32612 [https://perma.cc/T8Q2-8X58] (“Florida’s ‘Stand Your Ground’ law was the basis for the American Legislative Exchange Council’s model legislation, not the other way around.”)
which described the need to “[p]rotect[] citizens from prosecution or liability if they use a firearm in self defense [sic] inside or outside their homes.”163See, e.g., ALEC, 2007 Legislative Scorecard, http://www.alec.org/am/pdf/2007
alecscorecard.pdf [https://web.archive.org/web/20081106044025/http://www.alec.org/am/pdf/2007
alecscorecard.pdf].
Similar laws were introduced in states across the country,164Id. (tracking where ALEC model legislation had been successfully introduced or enacted); see also Adam Weinstein, How the NRA and Its Allies Helped Spread a Radical Gun Law Nationwide, Mother Jones (June 7, 2012), https://www.motherjones.com/politics/2012/06/nra-alec-stand-your-ground [https://perma.cc/34DP-N7NG]. and the NRA-promoted sentiment that civilians asserting self-defense should have a path to immunity was frequently invoked. When legislators debated Iowa’s self-defense law, one objected that a person must “spend eternity in prison trying to defend themselves” after being put “in that untenable situation where they have to make that snap decision and defend themselves or another from an aggressor.”165Iowa House of Representatives Floor Debate on HF 517 During the 87th General Assembly, Iowa Legislature, at 1:15:45 PM (Mar. 7, 2017), https://www.legis.iowa.gov/perma/093020194217 [https://web.archive.org/web/20230421065522/https://www.legis.iowa.gov/dashboard?view=video&chamber=H&clip=H20170307124009459&dt=2017-03-07&offset=1793&bill=HF%20517] (statement of Rep. Matt Windschitl); see also id. at 1:52:00 PM (“We want to make absolutely certain that, if someone ever does find themselves in that situation where they’ve used Stand Your Ground or not retreated, that we provide to them the protections from criminal and civil actions against them.”). In Ohio, a legislative witness inveighed that “[t]he mere fact of acting justly in self-defense should not result in dragging folks who used defensive force in accordance with Ohio law through the mud, costing them valuable time and resources.”166Memorandum of Support for Senate Bill 215 from Ohio Gun Owners to the Ohio Senate Veterans and Public Safety Committee (Oct. 5, 2021) (statement of Rob Knisley, Ohio Gun Owners). In South Carolina, a self-defense bill’s sponsor argued that “the State should have to prove you did something wrong before they can send you to jail” to await trial in homicide cases.167WCBD News 2, Stand Your Ground in South Carolina, YouTube (May 19, 2016), https://www.youtube.com/watch?v=RptJ8dKVWJg [https://perma.cc/HQ7R-2MCQ] (interviewing House Rep. Greg Delleney, Jr., regarding H. 4703). And in Utah, an advocate complained that people should not have to “go through the crucible of a self-defense trial.”168Self-Defense Amendments: Hearing on H.B. 227 Before the Senate Natural Resources, Agriculture, and Environment Comm., 2021 Gen. Sess. (Utah 2021) [hereinafter Hearing on H.B.
227], https://le.utah.gov/av/committeeArchive.jsp?timelineID=182900 [https://perma.cc/8K2K-MH86] (statement of Mitch Vilos).
Ultimately, after the passage of Florida’s law, more than twenty other states passed some sort of self-defense reform, such as Stand Your Ground,169See The Effects of Stand Your Ground Laws, Rand Corp. (Apr. 22, 2020), https://
http://www.rand.org/research/gun-policy/analysis/stand-your-ground.html#fn3 [https://perma.cc/TA4X-5R64] (counting twenty-four states that passed self-defense reform in the decade after Florida’s 2005 enactment).
with at least thirteen enacting self-defense immunity.170See Ala. Code § 13A-3-23(d) (2016), Colo. Rev. Stat. § 18-1-704.5(3) (1985), Fla. Stat. § 776.032 (2005), Ga. Code Ann. § 16-3-24.2 (2014), Kan. Stat. Ann. § 21-5231 (2011), Ky. Rev. Stat. Ann. § 503.085 (West 2006), Okla. Stat. tit. 21 § 1289.25(F) (2018), S.C. Code Ann. § 16-11-450 (2006), Mich. Comp. Laws § 780.961(1) (2006), Idaho Code § 19-202A(1) (2018); Utah Code Ann. § 76-2-309 (2021), S.D. Codified Laws § 22-18-4.8 (2021), Iowa Code § 704.13 (2017), N.C. Gen. Stat. § 14-51.3 (2011).

But the fact that people who lawfully defend themselves are sometimes prosecuted and forced to argue self-defense is unexceptional. It is a truism that self-defense sometimes exculpates—that is precisely why it is an available defense to criminal charges. Singling out self-defense for special treatment as an immunity should have a compelling rationale similar to the ones that justify other prosecutorial immunities. The next Section searches for such a rationale in the legislative debates and commentary.

C.  Searching for a Rationale

A common assertion among advocates for self-defense immunity is that awaiting trial is “not giving the right to self-defense the consideration it deserves.”171Hearing on H.B. 227, supra note 168, at 8:40 (statement of Mitch Vilos). But why not? After all, awaiting trial is the traditional process and the one afforded other defenses. In his systematic analysis, Paul Robinson identifies dozens of other affirmative defenses that bar conviction.172Robinson, supra note 29, at 203 n.7. What is the basis for treating self-defense differently than these other defenses? Though legislative debates offer no consistent rationale, four can be teased out: restoring procedural protections for self-defense lost to history, stopping politically motivated prosecutions of self-defenders, vindicating the notion that self-defense is a “natural right,” and reducing defense costs for gun owners. None of these is as strong as the rationale for traditional immunities—an inherent need for pretrial adjudication.173See supra Section I.C (discussing traditional prosecutorial immunities). Moreover, each is unpersuasive on its own terms.

Some advocates argue that prosecutorial immunity restores self-defense to an exalted place from a bygone era. In Florida, for example, a witness testified that making the prosecutor disprove self-defense before trial “recover[s] a right that we as citizens lost to defend ourselves from criminals.”174Mark Obbie, The Politician Who Brought America ‘Stand Your Ground’ Is Pushing to Make Self-Defense Claims More Bulletproof, Trace (Sept. 27, 2015), https://www.thetrace.org/2015/09/stand-your-ground-florida-bill-baxley [https://perma.cc/2HEH-HTVM] (quoting testimony of Eric Friday). In Utah, a witness testified that “Utah used to have a robust preliminary hearing procedure” as it relates to self-defense, and that immunity “restores some much-needed balance.”175Self-Defense Amendments: Hearing on H.B. 227 Before the Senate Natural Resources, Agriculture, and Environment Comm., 2021 Gen. Sess., at 2:37 (Utah 2021) (testimony of Mark Moffatt), https://le.utah.gov/av/committeeArchive.jsp?timelineID=182900 [https://perma.cc/8K2K-MH86].

A related move has been to couple self-defense immunity with Stand Your Ground and then defend both on the basis of Stand Your Ground history. For example, the NRA has said that Stand Your Ground laws, such as Florida’s (which includes an immunity provision), “focus on the narrow issue of whether and to what extent a person who would otherwise have a right to self-defense forfeits that right by not first attempting to flee the confrontation.”176Stand Your Ground, NRA Inst. Legis. Action (Feb. 1, 2014), https://www.nraila.org/
articles/20140201/stand-your-ground [https://perma.cc/WKE5-VCKB].
With omnibus bills like Florida’s so purportedly reduced, the NRA then asserted that removing the duty to retreat has “a pedigree in American law dating back over 150 years.”177Id. Other advocates have similarly ignored everything in recent self-defense legislation other than Stand Your Ground and then defended the entirety on the basis of Stand Your Ground history.178A legal scholar with the Cato Institute, which also supports Florida-style self-defense laws, similarly downplayed their ambition. As he put it, “[Stand Your Ground] laws are a tremendously misunderstood aspect of the debate over firearms regulation and criminal-justice reform” because “[a]ll they do is allow people to assert their right to self-defense in certain circumstances without having a so-called ‘duty to retreat.’ ” Ilya Shapiro, Testimony Before the U.S. Senate Judiciary Committee’s Subcommittee on the Constitution, Civil Rights, and Human Rights: Hearing on “‘Stand Your Ground’ Laws: Civil Rights and Public Safety Implications of the Expanded Use of Deadly Force” 1 (Oct. 29, 2013), https://www.cato.org/sites/cato.org/files/pubs/pdf/syg_senate_
testimony_-_shapiro_with_attachments.pdf [https://perma.cc/NVT7-T2Y7]; see also id. (arguing that “there’s nothing particularly novel” about Stand Your Ground laws). The misdirection might be unwittingly assisted by opponents of immunity legislation who adopt a similar Stand Your Ground framing. See, e.g., ABA Task Force, supra note 17.

Nostalgia is a staple of gun rights advocacy,179See Ruben & Blocher, supra note 149, at 632 (describing rhetorical appeals to an imagined past in gun rights advocacy). so it is unsurprising to see appeals to history when it comes to self-defense immunity. Yet, as shown in Section I.A, there is no basis in Anglo-American legal tradition for immunizing private defensive violence. Treating self-defense as exceptional through immunity is a thoroughly modern innovation.

An alternative rationale is that people exercising lawful self-defense are targeted for “political” prosecutions.180See, e.g., Tucker Carlson, Kyle Rittenhouse’s Trial Is the Most Bizarre Court Proceeding Ever Caught on Camera, Fox News (Nov. 10, 2021), https://www.foxnews.com/opinion/tucker-carlson-kyle-rittenhouse-trial [https://perma.cc/9N8K-GCRX] (saying Kyle Rittenhouse’s prosecutor “didn’t want to know what happened that night” and was “under enormous political pressure” to “declare Kyle Rittenhouse a murderer”). Indeed, it has become an article of faith on the political right that people exercising self-defense with firearms are targeted for political prosecutions. See, e.g., Kyle’s Law, supra note 13 (“Too often, rogue prosecutors bring felony criminal charges against people who were clearly doing nothing more than defending themselves, their families, or others from violent criminal attack.”). Prosecutors have vigorously rejected that narrative, and advocates for immunizing self-defense have failed to offer convincing evidence of political prosecutions, let alone the sort of systemic abuses that would justify a radical change to self-defense law. Advocates for both of the first immunity statutes—in Colorado (1986) and Florida (2005)—could not point to a single example of an improper prosecution.181See Wilbanks, supra note 126, at 54 (“[T]he sponsors of the bill were not able to point to any case in the past where they viewed the prosecutor to have incorrectly (in their view of the homeowner’s right of self-defense) charged a homeowner.”); Spies, supra note 153 (“Hammer and the Republican sponsors of Stand Your Ground could not point to a single instance in which a person had been wrongfully charged, tried, or convicted after invoking Florida’s traditional self-defense law.”). Rather, the chief NRA lobbyist for the Florida law ultimately contended that whether bad prosecutions have been brought is “not relevant.”182Spies, supra note 153; see also Daniella Rivera, ‘It’s Not Working’: KSL Investigates Unintended Consequences of New Utah Self-Defense Law, KSL.com (Nov. 16, 2021, 12:17 PM), https://www.ksl.com/article/50284891/its-not-working-ksl-investigates-unintended-consequences-of-new-utah-self-defense-law [https://perma.cc/R34W-ZCRG] (“Lisonbee said the [Utah immunity] law was intended to address politically motivated prosecutions but could not provide examples of that happening in Utah.”).

In subsequent efforts to immunize self-defense, advocates have invoked the prosecutions of George Zimmerman for the shooting death of Trayvon Martin and Rittenhouse for the Kenosha incident as exemplars of political prosecutions justifying self-defense immunity.183See, e.g., Kyle’s Law, supra note 13 (naming the Zimmerman and Rittenhouse prosecutions as evidence of political prosecutions that rationalize the adoption of self-defense immunity). Looking to Zimmerman’s prosecution is somewhat ironic given that it took place in Florida after Florida adopted its 2005 immunity provision and Zimmerman opted not to have a pretrial immunity hearing.184See Lizette Alvarez & Cara Buckley, Zimmerman Is Acquitted in Trayvon Martin Killing, N.Y. Times (July 13, 2013), https://www.nytimes.com/2013/07/14/us/george-zimmerman-verdict-trayvon-martin.html [https://perma.cc/LWH5-H78G] (noting that the shooting occurred on February 26, 2012, and the trial took place in 2013). Furthermore, in both cases the juries reached verdicts only after extensive deliberation. The lead homicide investigator in the Zimmerman case recommended charges but was initially overruled.185Matt Gutman, Trayvon Martin Investigator Wanted Manslaughter Charge, ABC News (Mar. 27, 2012, 8:18 AM), https://abcnews.go.com/US/trayvon-martin-investigator-wanted-charge-george-zimmerman-manslaughter/story?id=16011674 [https://perma.cc/8TGG-8BQJ]. Many perceived the declination of charges as reflecting racial bias, as Martin was an unarmed Black teenager.186See Markovitz, supra note 22, at 879–80 n.32 (recounting how many thought “the criminal justice system was indifferent to Trayvon Martin’s death, and was disinclined to try to provide justice”). A special prosecutor ultimately brought charges and a trial was held.187See Alvarez & Buckley, supra note 184. The law considered by Zimmerman’s jury did not include how initial aggressors have a limited right to self-defense, since the judge declined to instruct the jury on the initial aggressor doctrine;188See Alafair Burke, What You May Not Know About the Zimmerman Verdict: The Evolution of a Jury Instruction, HuffPost (July 15, 2013), https://www.huffpost.com/entry/george-zimmerman-jury-instructions_b_3596685 [https://perma.cc/BDD9-7TGS]. perhaps that would have made a difference in the verdict. Others have argued that prosecutors in both cases made strategic errors that may have affected the outcomes.189Some legal scholars have asserted that the Zimmerman prosecution made a tactical error by pursuing a murder theory rather than solely a manslaughter theory. David G. Savage & Michael Muskal, Zimmerman Verdict: Legal Experts Say Prosecutors Overreached, L.A. Times (July 14, 2013,
12:00 AM), https://www.latimes.com/nation/la-xpm-2013-jul-14-la-na-zimmerman-legal-20130715-story.html [https://perma.cc/8BM3-XDAN]. That, of course, is different than saying Zimmerman should not have been prosecuted at all. Various commentators have also critiqued the strategy and tactics deployed in the Rittenhouse prosecution. See, e.g., Ashley Collman, Did Prosecutors Bungle the Kyle Rittenhouse Case? Legal Experts’ Reviews Are Mixed, Insider (Nov. 16, 2021, 12:29 PM), https://www.insider.com/legal-experts-say-kyle-rittenhouse-prosecution-made-some-mistakes-2021-11 [https://perma.cc/T6T7-97DW].
In the Zimmerman trial, half of the jurors reportedly wanted to convict but changed their minds.190Richard Luscombe, George Zimmerman: Half of Jurors ‘Initially Favored Conviction,’ Guardian (July 16, 2013, 7:22), https://www.theguardian.com/world/2013/jul/16/george-zimmerman-jurors-trayvon-martin [https://perma.cc/S4E7-4GY3].Deliberations in both cases extended over multiple days before the jurors returned not guilty verdicts.191Id.; see Bosman, supra note 7 (noting that the Rittenhouse jury deliberated for three days before reaching its verdict).

Of course, in an ideal world, prosecutors would have perfect clarity into guilt and innocence, and prosecutions that result in acquittals after trial would never be brought. That, of course, is not realistic and is the reason why affirmative defenses and trials exist.192Cf. Ward, supra note 22, at 136 (“The adjudication process itself is a recognition of human imperfection—because we can never have perfect knowledge, we subject our suspicions to the test of a criminal trial (or at least the prospect of a criminal trial) before punishing someone suspected of a crime.”). Moreover, in light of the radical nature of the change wrought by singling out self-defense for immunity, if political prosecutions are the justification, then advocates should put forth more and better examples.

Another rationale that advocates raise is that self-defense is philosophically or morally distinct as a natural or human right.193See, e.g., Self-Defense Amendments: Hearing on H.B. 227 Before the H. Judiciary Comm., 64th Leg., 2021 Gen. Sess. (Utah 2021), https://le.utah.gov/av/committeeArchive.jsp?timelineID=
180423 [https://perma.cc/XB2Y-GLUD] (testimony of Clark Aposhian, Utah Shooting Sports Council, noting “[s]elf-defense is a basic human right”).
The Republican Party platform refers to the right of self-defense as “God-given.”194See Republican Nat’l Convention, Republican Party Platform of 2016, at 12 (2016), https://prod-cdn-static.gop.com/static/home/data/platform.pdf [https://perma.cc/S4TQ-NA62]. And the argument that self-defense is a justification and not an excuse is often explained by referencing moral philosophy.195See Reznik, supra note 20, at 26–27. But these understandings of self-defense as a natural, divine, or human right have long existed in harmony with adjudication at trial. Blackstone, for example, referred to self-defense as a natural right,1961 William Blackstone, Commentary on the Laws of England *139–40 (1765). but he believed, as described above, that self-defense is squarely a jury question.197See supra notes 52–53 and accompanying text (discussing Blackstone’s account of the process for raising affirmative defenses). Saying that self-defense is a natural right does not rationalize treating it as an immunity any more than it rationalizes erasing the common law elements of necessity and proportionality that have long guided self-defense decision-making.198See, e.g., Isaacs v. State, 25 Tex. 174, 177 (1860) (stating that the right to self-defense “is founded on the . . . law of nature” but that the common law requirement of “necessity of the case, and that only . . . justifies a killing”).

That leaves the fourth explanation, which perhaps arises most often: that gun owners should not have to pay typical criminal defense costs if they have a claim of self-defense. The NRA’s former executive director noted that “the legal fees . . . can easily top $50,000.”199Cox, supra note 151. A representative of a gun rights advocacy group in Wyoming expressed a similar view: “We don’t want to have a gun owner bankrupted by the criminal process just because he had to use a firearm in self-defense.”200Arno Rosenfeld, Senate Removes Immunity from ‘Stand Your Ground’ Law, Cody Enter. (Feb. 28, 2018), https://www.codyenterprise.com/news/local/article_d303bdba-1cc8-11e8-8673-776a19213ae2.html [https://perma.cc/6TPX-XS8P] (quoting Aaron Dorr of Wyoming Gun Owners discussing Senate File 71). And in Utah, an advocate said, “I have people calling me all the time [and saying] I’m afraid it will ruin me if I have to defend myself.”201Self-Defense Amendments: Hearing on H.B. 227 Before the H. Judiciary Comm., 64th Leg., 2021 Gen. Sess. (Utah 2021), https://le.utah.gov/av/committeeArchive.jsp?timelineID=180423 [https://perma.cc/EGF2-RRLM] (statement of Mitch Vilos). The legislative sponsor of the Utah bill recounted how a person leaving a gun carry class remarked, “I would rather die than financially ruin my family” by using a gun in self-defense.202House of Representatives Floor Debate on H.B. 227 During the 2021 General Session, Utah State Legislature, at 1:00:11 (Feb. 22, 2021), https://le.utah.gov/av/floorArchive.jsp?
markerID=114533 [https://perma.cc/YF2H-3WSJ] (statement of State Rep. Karianne Lisonbee).

The cost of criminal defense is a concern for all defendants, not just those asserting that violent conduct was justified as self-defense, and cost typically is not a sufficient rationale for prosecutorial—as opposed to civil—immunity.203See supra notes 100–01 and accompanying text (comparing rationales for civil and prosecutorial immunities); cf. Ward, supra note 22, at 135–36 (questioning the trial hardship rationale for self-defense immunity procedures). If self-defense, alone among affirmative criminal law defenses, is to be immunized, it warrants a much stronger rationale than cost saving for gun owners. This is especially true in light of the costs incurred as a result of self-defense immunity that are discussed in the next Part.

III.  THE COSTS OF IMMUNIZING PRIVATE VIOLENCE

The previous Section showed how the usual arguments put forth to support self-defense immunity are thin. It also is important to consider whether immunizing private violence has costs that further undercut exceptional treatment of defensive force. This Part contends that it does: immunizing self-defense can lead to more unlawful violence with less legal oversight; diminish the jury, thereby inviting less accurate and less legitimate outcomes; and introduce inefficiency into the criminal justice process.

A.  More Unlawful Violence (and Increased Impunity)

The message that self-defense immunity sends is troubling: that people can engage in defensive violence that they believe is lawful with less legal oversight. Both logic and data suggest that this message could bring about more assaults and homicides because of the impunity it signals—and in fact provides. Frederick Schauer has observed that “[q]uite often, officials who are immune for one reason or another from formal legal sanctions violate the law with some frequency.”204Frederick F. Schauer, The Force of Law 90 (2015). One can expect the same result from self-defense immunity, except for a much larger swath of the population; relatively few people receive official immunity, but everyone is entitled to assert self-defense when defending against criminal charges.205Moreover, officials often are constrained by other forms of oversight that could compensate for the negative effects of granting immunity. See id. at 86 (discussing internal punishment that can play a role “in ensuring official obedience to law”).

Rafi Reznik has recently argued that the modern understanding of self-defense as a justification, not an excuse, can signal societal acceptance of the alleged offense conduct in a way that promotes more violence;206See Reznik, supra note 20, at 68 (“[W]e should not want to tell self-defenders that they have done the right thing, nor provide them with the powers that justification confers, vindicate the values that justificatory self-defense stands for, or accept the socio-political conditions that self-defense laws create or perpetuate.”). immunity sends an even more powerful signal. As Reznik describes, in the dominant view, a justification indicates that “the wrongfulness of the act is negated.”207Id. at 26. Excuses, on the other hand, do not negate the wrongfulness of the conduct but “negate the blameworthiness of the actor.”208Id. at 27. The upshot is that “[j]ustifying self-defense,” as opposed to excusing it, “can . . . amount to an encouragement and it can even amount to an imperative.”209Id. at 33; see also Markovitz, supra note 22, at 875–76 (observing how “legal determinations of self-defense are, in effect, reflective of policy determinations about socially acceptable forms of violence”). Reznik argues that self-defense should be considered an excuse, which it was under English common law.210See Reznik, supra note 20, at 65; see also Darrell A. H. Miller, Self-Defense, Defense of Others, and the State, 80 Law & Contemp. Probs. 85, 87–95 (2017) (tracing the intellectual history of self-defense from an excuse to a justification). On the ground, however, the trend is going in the opposite direction: jurisdictions are granting immunity to self-defenders, which goes even further down the path toward encouraging the use of violence than considering self-defense a justification.211Cf. Schauer, supra note 204, at 7 (noting that “[s]ometimes the law” creates positive incentives “by granting immunity from otherwise applicable and legally enforced obligations”).

This trend is especially problematic because people are often wrong about the lawfulness of defensive force. One study found, for example, that a majority of self-reported defensive gun uses are likely illegal.212David Hemenway, Debra Azrael & Matthew Miller, Gun Use in the United States: Results From Two National Surveys, 6 Inj. Prevention 263, 266 (2000). People “view [a] hostile encounter from their own perspective; in any mutual combat both participants may believe that the other side is the aggressor and that they themselves are acting in self-defense.”213Id. A particular incident from the summer of the Rittenhouse shooting is exemplary.

Two months before the Rittenhouse shooting, Mark and Patricia McCloskey stood outside their St. Louis, Missouri, mansion as racial justice protesters marched nearby.214Tom Jackman, St. Louis Couple Who Aimed Guns at Protesters Charged with Felony Weapons Count, Wash. Post (July 20, 2020, 8:33 PM), https://www.washingtonpost.com/nation/2020/07/20/st-louis-couple-who-aimed-guns-protesters-charged-with-felony-weapons-count [https://perma.cc/5PW7-6PG3]. The protestors do not appear to have entered the McCloskeys’ property, though they marched on the sidewalk in a gated community in which the McCloskeys lived. See Jessica Lussenhop, Mark and Patricia McCloskey: What Really Went on in St Louis that Day?, BBC (Aug. 25, 2020), https://www.bbc.com/news/election-us-2020-53891184 [https://perma.cc/C53B-FGL2] (reporting that, while protestors walked into the private neighborhood, video from the event “does not show the protestors cross[ed] onto the McCloskeys’ property, remaining instead on the sidewalks and in the roadway”). Both were captured on video screaming angrily and wielding firearms: Mr. McCloskey, an AR-15–style rifle, and Ms. McCloskey, a handgun that she pointed at one protester after another.215See Jackman, supra note 214; see also KMOV St. Louis, Charges Filed Against Mark and Patricia McCloskey, YouTube (Jul. 20, 2020), https://www.youtube.com/watch?v=sUMfKFLGDcE [https://perma.cc/QP6H-CH8Q]. In Missouri, it is a crime to “exhibit[], in the presence of one or more persons, any weapon readily capable of lethal use in an angry or threatening manner.”216Mo. Rev. Stat. § 571.030 (2022). A local prosecutor charged the couple with violating that statute.217See Jackman, supra note 214. In their defense, the couple asserted that their conduct was justified to protect themselves and their property.218Id.

Speaking at the 2020 Republican National Convention (the McCloskeys, like Rittenhouse, became celebrities on the political right for their gun use),219Caitlin O’Kane, St. Louis Couple Who Pointed Guns at Black Lives Matter Protesters to Speak at Republican National Convention, CBS News (Aug. 18, 2020, 2:06 PM), https://www.cbsnews.com/news/republican-national-convention-mark-patricia-mccloskey-to-speak [https://perma.cc/ATS7-DQH7]. Mr. McCloskey subsequently announced his candidacy for a
U.S. Senate seat, prominently displaying on his campaign website a photograph of himself and
Ms. McCloskey holding their guns during the racial justice protest. McCloskey for
Senate, https://www.mccloskeyforsenate.com [https://web.archive.org/web/20211106143557/https://
http://www.mccloskeyforsenate.com].
Mr. McCloskey, a lawyer, expressed outrage that the prosecutor “actually charged [them] with felonies for daring to protect [their] home.”220CNBC, Couple Who Pointed Guns at BLM Protesters Speaks at RNC, YouTube (Aug. 24, 2020), https://www.youtube.com/watch?v=gK8P0vUQ4lg [https://perma.cc/AU64-926A]. Then, in a remarkable move, Missouri’s attorney general urged dismissal of the local charges on the basis of the sentiment underlying immunity: “Missourians should not fear exposure to criminal prosecution when they use firearms to defend themselves and their homes from threatening intruders.”221Amicus Brief of Attorney General Eric Schmitt Supporting Dismissal of the Case, State v. McCloskey, No. 2022-CR01300, at *1 (Cir. Ct. Mo. Jul. 20, 2020). In the end, however, the couple effectively conceded that they were not lawfully defending themselves when they pled guilty to the crimes of assault and harassment, thereby waiving any claim for self-defense.222Meryl Kornfield, St. Louis Couple Who Pointed Guns at Protesters Plead Guilty, Will Give Up Firearms, Wash. Post (June 17, 2021, 7:07 PM), https://www.washingtonpost.com/nation/
2021/06/17/st-louis-couple-guns [https://perma.cc/EFL6-859X]; see Hagan v. State, 836 S.W.2d 459, 461 (Mo.1992), overruled on other grounds by State v. Heslop, 842 S.W.2d 72 (Mo. 1992) (“The general rule in Missouri is that a plea of guilty voluntarily and understandably made waives all non-jurisdictional defects and defenses.” (citation and quotation marks omitted)). Mr. McCloskey nonetheless showed no remorse, saying of the criminal conduct, “I did it, and I’d do it again.” Joel Currier, St. Louis
Gun-Waving Couple Plead Guilty to Misdemeanor Charges, St. Louis Post-Dispatch (June
17, 2021), https://www.stltoday.com/news/local/crime-and-courts/st-louis-gun-waving-couple-plead-guilty-to-misdemeanor-charges/article_5b02e25b-0034-58a3-8181-f0a724ffa323.html [https://perma.
cc/J5NN-A3MY]. The Supreme Court of Missouri suspended both Mark and Patricia McCloskeys’ law licenses because of their convictions for offenses involving moral turpitude. See In re Mark T. McCloskey, Order, No. SC99301 (Mo. Feb. 8, 2022); In re Patricia McCloskey, Order, No. SC99302 (Mo. Feb. 8, 2022).
In other words, despite their confident assertions that they were legally justified in their actions, they ultimately admitted that they had no legal justification for their conduct.223The case did not end there. The Missouri governor, who asserted that the prosecution was “political” and “out of control,” pardoned the couple. Meryl Kornfield, Missouri Governor Pardons
Mark and Patricia McCloskey, Who Pointed Guns at Protestors, Wash. Post (Aug. 3, 2021, 10:25
PM) https://www.washingtonpost.com/nation/2021/08/03/mccloskey-pardon [https://perma.cc/UP68-AY2Q]; Marc Cox Show, Interview of Governor Mike Parson, Facebook (July 17, 2020), https://www.facebook.com/watch/?v=273414383946013 [https://perma.cc/L63N-AUWR].

Unlawful defensive force imposes an especially troubling risk to Black men and women, like many of those marching in front of the McCloskey house, who are mistaken as threats all too frequently. Data has consistently shown that Black people are more likely to be misperceived as a threat than white people.224L. Song Richardson & Phillip Atiba Goff, Self-Defense and the Suspicion Heuristic, 98 Iowa L. Rev. 293, 307–14 (2012) (discussing data). According to L. Song Richardson and Phillip Atiba Goff, this is in part because Black people “serve as our mental prototype (i.e., stereotype) for the violent street criminal.”225Id. at 310. A prosecution and trial can separate out biased and unreasonable threat perceptions from unbiased and reasonable ones better than any individual can in the moment.226To be sure, I am not saying that juries are never biased. The point, rather, is that a jury with representation from a cross-section of the community as required by law should reflect more diverse voices than a lone defendant (or judge), which would make it better suited to discern biased and unreasonable threat assessments. I discuss virtues of the jury in greater detail below. See infra Section III.B. And getting it right is important for ensuring a fair and just implementation of criminal law.

Well-intentioned people can have flawed perceptions of lawfulness, but encouraging restraint for defensive violence through the threat of prosecution and punishment is even more important for those who are ill-intentioned. For some people, “genuine and sanction-independent obedience [to the law] is rare.”227Schauer, supra note 204, at 75; see id. at 59 (noting how law serves to “constrain[] moral outliers”). In that circumstance, “coercion through the threat of sanctions emerges as the principal mechanism for securing the obedience that turns out to be so often necessary.”228Id. at 75. Immunity lessens the law’s constraining force and risks that someone prone to violence will construe immunity as a license to commit violence.229See generally Dan M. Kahan, Gentle Nudges vs. Hard Shoves: Solving the Sticky Norms Problem, 67 U. Chi. L. Rev. 607 (2000) (discussing the criminal law’s ability to shape norms and behavior).

In this regard, it is notable that a study of the effects of Colorado’s 1986 immunity law found that those invoking immunity “used force (sometimes deadly force) as much out of anger as self-defense.”230Wilbanks, supra note 126, at 322. Moreover, the legal change primarily benefited defendants other than the intended beneficiaries—homeowners confronting stranger intruders.231Id. In the years immediately following the enactment, the only “strangers” who intruded into homes and faced defensive force triggering immunity were police officers.232Id. at 321.

Unfortunately, more recent empirical studies on the impact of changes to self-defense law do not distinguish between the effect of various simultaneous changes, such as Stand Your Ground, presumptions, and immunity. Several such studies have shown that self-defense reforms that include an immunity provision correlate with more violent crime.233See, e.g., Alexa R. Yakubovich, Michelle Degli Esposti, Brittany C. L. Lange, G. J. Melendez-Torres, Alpa Parmar, Douglas J. Wiebe & David K. Humphreys, Effects of Laws Expanding Civilian Rights to Use Deadly Force in Self-Defense on Violence and Crime: A Systematic Review, Am. J. Pub. Health (Mar. 10, 2021) (reviewing the literature). One study found that in the decade following Florida’s 2005 legislation, “monthly rates of homicide increased by 24.4% and monthly rates of homicide by firearm by 31.6%.”234David K. Humphreys, Antonio Gasparrini & Douglas J. Wiebe, Evaluating the Impact of Florida’s “Stand Your Ground” Self-Defense Law on Homicide and Suicide by Firearm: An Interrupted Time Series Study, 177 JAMA Internal Med. 44, 49 (2017). Another found that the law was associated with a 44.6% increase in adolescent firearm homicides.235Michelle Degli Esposti, Douglas J. Wiebe, Jason Gravel & David K. Humphreys, Increasing Adolescent Firearm Homicides and Racial Disparities Following Florida’s ‘Stand Your Ground’ Self-Defence Law, 26 Inj. Prevention 187 (2020). In February 2020, the U.S. Commission on Civil Rights released a report finding no evidence of crime deterrence and an increase in homicide rates in states that adopted such laws.236U.S. Comm’n on C.R., Examining the Race Effects of Stand Your Ground Laws and Related Issues 6 (2020). A commissioner recommended rejecting self-defense immunity because it “remove[s] incentives to mitigate or reduce the use of deadly force by protecting the claimant regardless of the collateral consequences.”237Id. at 26 (statement of Michael Yaki). Yet, as noted, the power of these studies as regards the impact of self-defense immunity is limited and, hopefully, future empirical studies will seek to isolate the effect of self-defense immunity.

A corollary to the signals sent by self-defense immunity is that sometimes immunity can in fact hinder or prevent a conviction of someone who engages in unlawful violence. The analysis of cases soon after Colorado passed its self-defense immunity law in 1986 found that the statute likely led to an acquittal in one case that would otherwise have been a probable conviction, as well as decisions not to prosecute in others.238Wilbanks, supra note 126, at 321–24. The district attorney for a single county in Kansas has reported “declin[ing] to file charges against thirty-three people based on self-defense immunity,” thirty of which were deemed homicides by the coroner.239Report of District Attorney Marc Bennett 18th Judicial District of Kansas
43 (Jan. 18, 2022), https://www.sedgwickcounty.org/media/60604/final-c-lofton-january-18-2022.pdf [https://perma.cc/6N6Y-5KC2].
Three additional cases were charged by the district attorney but dismissed on self-defense immunity grounds by a judge.240Id. at 45.

Those arguing in support of self-defense immunity do not contest, and implicitly concede, much of this analysis. They acknowledge that the risk of having to defend against a prosecution causes gun owners to hesitate before deploying lethal force, and they seek to reduce such hesitation.241See, e.g., supra notes 202–03 and accompanying text (expressing gun owner concerns about the cost of defending against a prosecution). However, a cost of immunizing self-defense is to transform the signals sent by conventional self-defense law in a way that likely leads to more unlawful, and at times discriminatory, violence. Furthermore, immunizing self-defense erects an obstacle to achieving a basic goal of the criminal justice system: punishing those who commit crimes of violence.

B.  Fewer Juries in Matters of Community Importance

Another consequence of granting a defendant immunity is to disempower a jury from deciding facts surrounding a properly charged crime. The institution of the jury has long played a central role in self-defense cases. The jury is well-equipped to resolve disputes about the lawfulness of violence. Moreover, and importantly in the context of self-defense, the community’s involvement through the jury legitimates the law and promotes acceptance of outcomes as well as community healing.

Today, the jury is most often discussed solely in the context of defendants’ rights,242See U.S. Const. amend. VI (granting defendants the right to “an impartial jury”); Duncan v. Louisiana, 391 U.S. 145, 155 (1968) (“A right to jury trial is granted to criminal defendants in order to prevent oppression by the Government.”). but the jury’s importance to society is actually far deeper. At the nation’s founding, Anti-Federalists were adamant about protecting the institution of the jury because, even more than protecting the defendant, the jury integrated “the people in the administration of government.”243Herbert J. Storing, What the Anti-Federalists Were For, in 1 The Complete Antifederalist 19 (Herbert J. Storing ed. 1981). As Laura I. Appleman has put it, “the right of the jury trial” is about “the participation of the citizenry in [the] rule of law.”244See Laura I. Appleman, The Lost Meaning of the Jury Trial Right, 84 Ind. L.J. 397, 413 (2009); see also id. (noting that juries play an invaluable role for “the local community and to the people at large”); accord Stephen A. Siegel, The Constitution on Trial: Article III’s Jury Trial Provision, Originalism, and the Problem of Motivated Reasoning, 52 Santa Clara L. Rev. 373 (2012); Meghan J. Ryan, Juries and the Criminal Constitution, 65 Ala. L. Rev. 849, 882 (2014); George C. Harris, The Communitarian Function of the Criminal Jury Trial and the Rights of the Accused, 74 Neb. L. Rev. 804 (1995). This feature of the jury—as a key means of community involvement in the law’s implementation—is reflected in the fact that a defendant has no federal constitutional right to waive a jury trial, even if a defendant can demand one.245U.S. Const. amend. VI (“In all criminal prosecutions, the accused shall enjoy the right to a speedy and public trial, by an impartial jury of the State and district wherein the crime shall have been committed . . . .”); Singer v. United States, 380 U.S. 24, 34 (1965) (“[T]here is no federally recognized right to a criminal trial before a judge sitting alone.”). Some states do grant defendants a right to demand a bench trial as a matter of state law. See, e.g., Md. Code Ann., Crim. § 4-246 (West 2023) (“A defendant may waive the right to a trial by jury at any time before the commencement of trial . . . .”). Prosecutors and courts generally can demand jury trials even over the defendant’s objection.246See, e.g., Singer, 380 U.S. at 24–26 (upholding Rule 23(a) of the Federal Rules of Criminal Procedure, which requires the government to consent to and the court to approve a defendant’s waiver of a jury trial); Fla. R. Crim. P. 3.260 (“A defendant may in writing waive a jury trial with the consent of the state.”); Ky. R. Crim. P. 9.26 (“Cases required to be tried by jury shall be so tried unless the defendant waives a jury trial in writing with the approval of the court and the consent of the Commonwealth.”); State v. Greenwood, 297 P.3d 556, 558–59 (Utah 2012) (holding that a trial court erred when granting a defendant’s request for a bench trial over the prosecution’s objection); State v. Burks, 674 N.W.2d 640, 647 (Wis. Ct. App. 2003) (permitting the trial judge to insist on a jury trial even when both the defense and prosecution prefer a non-jury trial). Today, as in the past, there is a “strong preference for jury trials on all elements of a criminal case.”247Rodgers v. Commonwealth, 285 S.W.3d 740, 755 (Ky. 2009) (emphasis added).

Accuracy is one important interest served by this longstanding commitment to juries, because “[j]uries . . . are considered the best deciders of fact.”248See Ryan, supra note 244, at 872; see also Paul F. Kirgis, The Right to a Jury Decision on Sentencing Facts After Booker: What the Seventh Amendment Can Teach the Sixth, 39 Ga. L. Rev. 895, 905 (2005) (“As our system has implicitly recognized for centuries, juries are simply the best actors to decide fact questions.”); Jenia Iontcheva, Jury Sentencing as Democratic Practice, 89 Va. L. Rev. 311, 339–343 (2003) (discussing the virtues of the jury as a deliberative democratic body); Colleen P. Murphy, Integrating the Constitutional Authority of Civil and Criminal Juries, 61 Geo. Wash. L. Rev. 723, 745 (1993) (“The Founders considered the jury to be superior to a single judge in finding facts because it embodied the common sense of twelve individuals with a variety of experiences and knowledge.”). This is in no small part because juries “are more representative of their communities than judges . . . . They better represent various races, socio-economic classes, various levels of formal education, differing religions, and a broader spectrum of political engagement than do judges.”249Ryan, supra note 244, at 878; see also Laura Gaston Dooley, Our Juries, Our Selves: The Power, Perception, and Politics of the Civil Jury, 80 Cornell L. Rev. 325, 325 (1995) (“[T]he modern jury is the most diverse of our democratic bodies.”). This is especially true when the task is assessing “matters reflecting their communities’ values,”250Ryan, supra note 244, at 878. See generally Andrew Guthrie Ferguson, Why Jury Duty Matters (2012) (discussing the value of juries and jury duty in the American democracy). like self-defense.

Self-defense is inherently fact-based, calling for answering difficult questions about the reasonableness of a defendant’s perception of—and violent response to—a threat. Evaluating the lawfulness of self-defense calls for an assessment of whether defensive force was reasonably necessary and proportionate to a reasonably perceived threat.251See Ruben, supra note 19, at 81–89 (discussing elements of necessity and proportionality in self-defense law); United States v. Peterson, 483 F.2d 1222, 1229 (D.C. Cir. 1973) (“ ‘[T]he law of self-defense is a law of necessity’; the right of self-defense arises only when the necessity begins, and equally ends with the necessity . . . .”). A counterargument to the claim that a jury is best placed to decide on self-defense reasonableness might be that judges already decide questions of reasonableness for other purposes, especially determining the lawfulness of searches and seizures under the Fourth Amendment, so why not do so for self-defense, too? However, judicial determination of reasonableness in the Fourth Amendment context is itself heavily criticized, not least because “judges are not representative of the societal standards upon which [such] questions are based, thus likely skewing judges’ conclusions.” Ryan, supra note 244, at 874; see also id. at 877 n.177 (collecting scholarship critical of judicial determinations of reasonableness in the Fourth Amendment context). Criminal law scholars devise complex classifications in an attempt to capture the permutations of defensive confrontations and how they intersect with the law of self-defense,252See, e.g., Larry Alexander, Recipe for a Theory of Self-Defense: The Ingredients, and Some Cooking Suggestions, in The Ethics of Self-Defense 20, at 21–28 (Christian Coons & Michael Weber eds., 2016) (categorizing those who might be involved in self-defense situations and affect the application of law to facts as the victim, a nonthreatened third party, a culpable aggressor, a culpable person, a culpable faker, an innocent aggressor, an anticipated innocent aggressor, and an innocent bystander). but it is impossible to resolve self-defense claims through any sort of rote analysis. It is necessary to apply community values and experiences to assess reasonableness, and judges, unlike juries, are often removed from both.253See Ryan, supra note 244, at 874 (noting that judges “are not representative of society, nor are they usually representative of the individual communities that they serve”); id. at 874–77 (surveying literature on judicial diversity). Simply because a jury is comprised of a cross-section of the community, the jury will incorporate perspectives and experiences that lead to a fair resolution of disputed facts more so than a single judge who is likely insulated from the circumstances that gave rise to the violence.

Moreover, importantly, community resolution of the difficult factual questions that go into self-defense can legitimate the law and promote acceptance of outcomes.254See Funk, supra note 108, at 49 (“[W]idely rejected self-defence decisions can adversely impact the broader public’s view of the legitimacy of the legal order.”); id. (“Self-defence outcomes that are broadly rejected as immoral threaten to incrementally erode the justice system’s moral credibility, undermine compliance with the law, and reduce cooperation with legal authorities.”). Precisely because “juries have the power to incorporate societal norms and values into their decisions . . . citizens can view these determinations as legitimate and as not influenced by the political leanings of government-employed judges.”255Ryan, supra note 244, at 881. That sense of legitimacy, in turn, can help a community accept a case’s outcome and move past the trauma of community violence.

For example, after the killing of Trayvon Martin, the quick decision not to prosecute George Zimmerman led to mass protests across the country.256Patrik Jonsson, George Zimmerman Charged in Trayvon Martin Case: Why Now, and What Next?, Christian Sci. Monitor (Apr. 11, 2012), https://www.csmonitor.com/USA/Justice/2012/0411/
George-Zimmerman-charged-in-Trayvon-Martin-case-Why-now-and-what-next [https://perma.cc/
VZ8T-7P28] (describing protests).
Many thought that the declination of charges suggested that “the criminal justice system was indifferent to Trayvon Martin’s death and was disinclined to try to provide justice.”257Markovitz, supra note 22, at 879–80 n.32. The fact that Martin was a Black teenager triggered speculation that race was part of the reason for not immediately prosecuting Zimmerman.258Id. This speculation is consistent with data: as one researcher found, “[w]ith respect to race, controlling for all other case attributes, the odds a white-on-black homicide is found justified is 281 percent greater than the odds a white-on-white homicide is found justified.” John K. Roman, Race, Justifiable Homicide, and Stand Your Ground Laws: Analysis of FBI Supplementary Homicide Report Data 9 (2013). If the homicide occurred in a state with a Stand Your Ground law, like Florida, that “increases the odds of a justifiable finding by 65 percent.” Id. at 9–10; see also Nicole Ackermann, Melody S. Goodman, Keon Gilbert, Cassandra Arroyo-Johnson & Marcello Pagano, Race, Law, and Health: Examination of “Stand Your Ground” and Defendant Convictions in Florida, 142 Soc. Sci. & Med. 194 (2015) (finding a defendant was two times as likely to be convicted for killing a white victim as a non-white victim under Florida’s 2005 self-defense law). When a special prosecutor subsequently charged Zimmerman, the move brought great relief. Martin’s mother commented that “[w]e simply wanted arrest, nothing more, nothing less, and we got it.”259Jonsson, supra note 256 (quoting Trayvon’s mother, Sybrina Fulton). Although many people who wanted a prosecution may have been disappointed by the jury verdict of not guilty, that the process was followed, and that the decision was rendered by a jury certainly lowered the temperature of the earlier protests.

Conversely, a prosecution’s dismissal because of immunity sends a very different signal to the community. Victims and family members can never know how a jury of their peers would decide on the legality of defensive force. Indeed, a homicide case in Utah elicited the opposite reaction after the defendant was discharged because of self-defense immunity.260See Rivera, supra note 182 (describing the discharge of Troy James Pexton). A family member of the victim of the alleged homicide exclaimed in court: “We all feel the justice system has no doubt failed us.”261Id. (quoting from court audio recordings). Another said: “This has forever changed my outlook on the system and the faith that I once had that justice would prevail.”262Id. Similarly, in Kansas, after a prosecutor declined to bring homicide charges against juvenile detention officers citing a self-defense immunity law, the victim’s family viewed the decision as “yet another instance of an unarmed Black teenager killed by law enforcement with impunity” and without “even an ounce of accountability.”263Ryan Newton, Laura McMillan & Stephanie Nutt, Sedgwick County Prosecutor: No Charges in Cedric Lofton’s Death, KSN.com (Jan. 18, 2022), https://www.ksn.com/news/local/watch-live-da-holds-news-conference-unknown-subject [https://perma.cc/LP62-X55D] (quoting statement from Cedric Lofton’s family). Likewise, a community partnership expressed “outrage[]” at the declination of charges, viewing it as a “blatant disregard for the life” of the victim.264Id. (quoting statement by the Progeny youth/adult partnership).

The denouncements above demonstrate that self-defense immunity can not only prevent a community from healing, but can also undermine the rule of law and faith in the judiciary. In this regard, it is notable that the criticism in such cases is not at the legislature for passing a self-defense immunity bill, or at the governor for signing it, but rather at the “justice system” that “no doubt failed.”265Rivera, supra note 182. Moreover, under the law of self-defense, the harm caused by defensive violence is supposed to “remain[] a legally recognized harm which is to be avoided whenever possible,”266Robinson, supra note 29, at 213. and the conduct underlying self-defense is supposed to “remain[] generally condemned and prohibited.”267Id. at 220. Immunity dilutes the force of such legal values and erodes trust that the judicial system will enforce them.

C.  Inefficient Mini-Trials

One counterargument to concerns about self-defense immunity is that it will only weed out rare, egregious prosecutions. In some places where self-defense immunity is already enacted, the defendant has the burden of proving self-defense at an immunity hearing,268See, e.g., People v. Guenther, 740 P.2d 971, 977 (Colo. 1987) (en banc). or, in the alternative, the prosecutor must only show probable cause that self-defense did not justify the defendant’s violence.269See, e.g., Rodgers v. Commonwealth, 285 S.W.3d 740, 754 (Ky. 2009); State v. Hardy, 390 P.3d 30, 39 (Kan. 2017). In those places, most self-defense cases might still proceed to trial. This, however, raises a question about judicial economy.

To be sure, the likely trajectory for self-defense immunity is for legislators to strengthen it, similar to how Florida recently placed the burden on prosecutors to disprove self-defense by clear and convincing evidence at a pretrial hearing.270See supra notes 159–61 and accompanying text. Since Florida has led the way for NRA-backed initiatives to be subsequently passed elsewhere,271See David Cole, Engines of Liberty: The Power of Citizen Activists to Make Constitutional Law 105 (2016) (“Florida has generally been the NRA’s starting line for legislative gun rights campaigns . . . .”). it is no surprise that when Utah passed its self-defense immunity law in spring 2021, a legislative sponsor said the law “basically copie[d] and paste[d]” the clear and convincing evidence standard “from Florida[’s] statute.”272Rivera, supra note 182 (quoting Rep. Karianne Lisonbee, R-Clearfield, on the floor of Utah’s House of Representatives); Utah Code Ann. § 76-2-309 (West 2021) (setting out clear and convincing evidence standard). Furthermore, even in jurisdictions with lesser prosecutorial immunity standards currently, immunity still sends troubling signals that could increase violence.273See supra Section III.A.

Setting aside these concerns and focusing narrowly on the argument that immunity will have little impact on prosecutions outside of rare cases, a question arises: Why undertake an expensive immunity hearing that will mirror the eventual trial at all? Two goals of the rules governing criminal procedure are to “secure simplicity of procedure” and “to eliminate unjustifiable expense.”274Fed. R. Crim. P. 2. Self-defense immunity runs counter to those goals.

In this regard, it is helpful to contrast self-defense with other pretrial issues discussed above,275See supra Section I.C. which generally implicate evidence that is both clear-cut and distinct from proof of guilt or innocence. Whether too much time has passed between criminal conduct and a prosecution so as to violate a statute of limitations, for example, may call only for simple arithmetic unrelated to the alleged offense conduct.276See Toussie v. United States, 397 U.S. 112, 114–15 (1970) (“The purpose of a statute of limitations is to limit exposure to criminal prosecution to a certain fixed period of time following the occurrence of those acts the legislature has decided to punish by criminal sanctions.”). The same could be said for speedy trial issues.277See United States v. Loud Hawk, 474 U.S. 302, 312 (1986) (“[T]he Sixth Amendment’s guarantee of a speedy trial ‘is an important safeguard to prevent undue and oppressive incarceration prior to trial, to minimize anxiety and concern accompanying public accusation and to limit the possibilities that long delay will impair the ability of an accused to defend himself.’ ” (quoting United States v. Ewell, 383 U.S. 116, 120 (1966))). Determining whether a pending prosecution is substantially the same as an earlier one, thereby violating double jeopardy protections, calls for a comparison of the two prosecutions.278See Brown v. Ohio, 432 U.S. 161, 165 (1977) (“The legislature remains free under the Double Jeopardy Clause to define crimes and fix punishments; but once the legislature has acted courts may not impose more than one punishment for the same offense and prosecutors ordinarily may not attempt to secure that punishment in more than one trial.”). And determining whether diplomatic immunity attaches often only requires inquiring into the defendant’s status as a diplomat and whether the sending state has waived the immunity.279See, e.g., United States v. Khobragade, 15 F. Supp. 3d 383, 385 (S.D.N.Y. 2014) (“With several exceptions not applicable here, diplomatic officers may not be arrested, detained, prosecuted or sued unless their immunity is waived by the sending state.”); see also Diplomatic and Consular Immunity, supra note 101, at 7–8 (“Diplomatic agents . . . enjoy complete immunity from the criminal jurisdiction of the host country’s courts and thus cannot be prosecuted no matter how serious the offense unless their immunity is waived by the sending state . . . .”).

Yet proving or disproving whether self-defense exculpates requires consideration of the same witnesses and evidence that will be introduced at trial to prove the charged crime. Indeed, this is implicit in affirmative defenses (like self-defense), which contend that something happening at the time of the alleged offense justified or excused the underlying conduct. Resolving the lawfulness of self-defense ahead of trial would call for delving into the circumstances surrounding the charged offense and receiving testimony from the same witnesses of the alleged crime who will testify at trial. Self-defense immunity hearings, when they do not result in a dismissal, involve “mini-trials of the evidence in advance of the actual trial” that criminal procedure typically seeks to avoid.280See, e.g., United States v. Bazezew, 783 F. Supp. 2d 160, 166 (D.D.C. 2011) (discussing preference to avoid mini-trials in the context of evidentiary disputes).

To be sure, adding costs and inefficiencies is not always inappropriate. Many scholars agree that grand juries are ineffective at eliminating bad prosecutions281See, e.g., Roger A. Fairfax Jr., Grand Jury Innovation: Toward a Functional Makeover of the Ancient Bulwark of Liberty, 19 Wm. & Mary Bill Rts. J. 339, 341–45 (2010) (summarizing critiques); Andrew D. Leipold, Why Grand Juries Do Not (and Cannot) Protect the Accused, 80 Cornell L. Rev. 260, 265–69 (1995). The classic cliché is that a grand jury would “indict a ham sandwich.” See In re Grand Jury Subpoena of Stewart, 545 N.Y.S.2d 974, 977 n.1 (Sup. Ct. 1989), aff’d as modified, 548 N.Y.S.2d 679 (App. Div. 1989) (“[M]any lawyers and judges have expressed skepticism concerning the power of the Grand Jury. This skepticism was best summarized by the Chief Judge of this state in 1985 when he publicly stated that a Grand Jury would indict a ‘ham sandwich.’ ”). and that the plea bargain system that is used to resolve the vast majority of criminal prosecutions creates injustices.282See Jenia I. Turner, Transparency in Plea Bargaining, 96 Notre Dame L. Rev. 973, 974 (2021) (“Today, over ninety-five percent of convictions at the state and federal levels are the product of guilty pleas.”); Jenia I. Turner, Plea Bargaining, in 3 Reforming Criminal Justice: Pretrial and Trial Processes 73, 80–88 (Erik Luna ed., 2017) (reviewing critiques of plea bargaining). Some scholars and advocates have thus suggested reforms that would be costly, like enhancing internal prosecutorial screening283See Ronald Wright & Marc Miller, The Screening/Bargaining Tradeoff, 55 Stan. L. Rev. 29, 30–35 (2002) (“By prosecutorial screening we mean a far more structured and reasoned charge selection process than is typical in most prosecutors’ offices in this country.”); see also Allen et al., supra note 76, at 1039 (“In a system that resolves a huge majority of cases without trials, the choice of how best to screen prosecutors’ charging decisions is critically important to the quality of justice the system delivers.”). or devising something akin to summary judgment for criminal procedure.284Carrie Leonetti, When the Emperor Has No Clothes: A Proposal for Defensive Summary Judgment in Criminal Cases, 84 S. Cal. L. Rev. 661, 666, 685 n.105 (2011). But self-defense immunity is extrinsic to that broader conversation, which is about how to improve the pretrial process for all issues bearing on guilt and innocence, and for all defendants. Self-defense immunity grants a benefit for one defense championed by powerful lobbyists. That may explain why self-defense immunity is passing in legislatures, but it hardly rationalizes the costs.

CONCLUSION

A central goal of this Article is to show that the exceptionalism reflected in self-defense immunity laws is not rooted in history, tradition, or longstanding priorities of criminal law and procedure. Self-defense has always been an affirmative defense, embedded in a system of defenses and vindicated through the same criminal justice process as other defenses. Those pursuing self-defense immunity have thus far failed to put forward a compelling rationale for a radical departure from legal tradition. Self-defense should remain unexceptional within the system of criminal law defenses to avoid the unwarranted harms that can come from immunizing private violence.

96 S. Cal. L. Rev. 509

Download

* Associate Professor, SMU Dedman School of Law; Fellow, Brennan Center for Justice at N.Y.U. School of Law. Many thanks to Hillel Bavli, Joseph Blocher, Jake Charles, Guillermo Jose Garcia Sanchez, Chris Jenks, Cynthia Lee, Pamela Metzger, Darrell A. H. Miller, Adam Sopko, Jenia Iontcheva Turner, and Cynthia Ward, as well as participants in the U.C. Davis Law Review’s 2021 symposium, for helpful comments and suggestions. Tiereney Bowman, Robert Larkin, Maggie Gianvecchio, Darby O’Grady, Meredith Palmer, and Nick Salinaro provided excellent research assistance.

Suing SPACs

In 2020, the financial world became transfixed by a massive increase in the number of firms going public through special

Read More »

Suing SPACs

In 2020, the financial world became transfixed by a massive increase in the number of firms going public through special purpose acquisition company (“SPAC”) transactions. A SPAC is a publicly traded company formed solely for the purpose of raising money from investors and choosing a merger partner, thereby bringing the target company public. SPAC shareholders vote on the proposed transaction, but also have the option to redeem their shares for the price paid plus interest prior to the merger. SPACs have always been controversial; they make risky ventures available to unsophisticated investors, may involve acute conflicts of interest, and do not make the rigorous disclosures required in standard IPOs.

Is litigation a solution to these problems? The SPAC boom, as many commentators predicted, precipitated a deluge of lawsuits. Although several studies examine the SPAC transactions themselves, this project is the first comprehensive study of SPAC-related litigation. Using a dataset of all SPAC transactions completed since 2014 and all SPAC-related lawsuits filed since 2017, I assess the prevalence and characteristics of these lawsuits. I find that the probability that a SPAC transaction will generate a lawsuit appears to be unrelated to the returns on the deal, the size of the merger, the industry of the target, and various proxies for SPAC quality. However, I find a negative association between the likelihood of litigation and redemption rate. This is surprising because it means that the SPAC transactions more likely to generate lawsuits are those in which the SPAC shareholders choose to keep, rather than redeem, their shares, presumably signaling greater confidence in the quality of the deal. I argue that many of these lawsuits are opportunistic and may be of questionable quality. I further argue that these lawsuits are an inadequate substitute for the liability that firms face in connection with standard IPOs.

INTRODUCTION

Beginning in 2020, the financial world has been transfixed by a striking rise in the number of firms going public through special purpose acquisition company (“SPAC”) transactions. Although SPACs have been in and out of vogue for several decades, the most recent explosion is unprecedented.1See Max H. Bazerman & Paresh Patel, SPACs: What You Need to Know, Harv. Bus. Rev. July/Aug. 2021, at 104. In 2020, 53% of all initial public offerings (“IPOs”) were SPACs, and in the first quarter of 2021, that percentage rose to 62%,2Distribution of traditional IPOs and special purpose acquisition company (SPAC) IPOs in the United States from 2016 to 2021, Statista (July 4, 2022), https://www.statista.com/statistics/1234111/
number-traditional-spac-ipo-usa [https://perma.cc/YKA9-CYQA].
when SPACs issued more than $30 billion per month in equity.3Ortenca Aliaj & Miles Kruppa, The SPAC Machine Sputters Back to Life After Dramatic Meltdown, Fin. Times (Nov. 21, 2021), https://www.ft.com/content/d1723a8e-c146-4d48-8475-01cc9947a5d6 [https://perma.cc/Z3ZF-CJN5]; see also John C. Coates, SPAC Law and Myths 2 (Feb. 11, 2022) (unpublished manuscript), https//ssrn.com/abstract=4022809 [https://perma.cc/7L32-9QEU].

A SPAC is essentially a publicly traded shell company whose sole purpose is to merge with a private company, thereby bringing it public. The SPAC raises money from investors in an IPO and has a limited period (often about two years) to search for a target. Once the SPAC identifies a merger partner, the target merges into the SPAC in a “de-SPAC” transaction, thus going public while itself avoiding the time-consuming and expensive IPO process. Upon the announcement of a merger, SPAC shareholders are given the opportunity to redeem their shares for the issuing price plus any interest accrued—usually about ten dollars. Low redemption rates are thought to signal confidence in the quality of the merger—shareholders think the transaction is a good one, and thus elect to retain their shares. High redemption rates, by contrast, may signal that shareholders and the market in general are skeptical of the deal.

SPAC transactions have been controversial for most of their existence. They allow young, risky firms to access the public markets—and potentially, unsophisticated investors.4Although recent studies have found that most investors in SPAC IPOs are institutional investors, see Michael Klausner, Michael Ohlrogge, & Emily Ruan, A Sober Look at SPACs, 39 Yale J. on Reg., 228, 298 (2022), they are known as “poor-man’s private equity” because they allow the person on the street access to riskier, though potentially higher-reward transactions, see Usha Rodrigues & Michael Stegemoller, Redeeming SPACs 1 (Univ. of Ga. Sch. of L., Research Paper No. 2021-09, 2021), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3906196 [https://perma.cc/6DM8-B5HZ]. Commentators have also argued that retail investors may be more likely to lose out in de-SPAC transactions. See Bobby Reddy, The SPACtacular Rise of the Special Purpose Acquisition Company: A Retail Investor’s Worst Nightmare 3 (Univ. of Cambridge Legal Stud., Research Paper No. 32/2021, 2021), http://ssrn.com/abstrat=3968983 [https://perma.cc/UV7A-M7DL]. De-SPAC transactions are conducted on tight timelines by management who will lose everything if the merger does not close. They may settle for subpar targets or terms, skimp on diligence, or even engage in outright fraud rather than risk losing a deal and returning all the IPO proceeds to the shareholders. Moreover, shareholders that see the transaction through face a risk of significant dilution that they may not understand.5Klausner et al., supra note 4, at 253.

Is litigation a solution to these problems? It would not be crazy to think that surely the heightened risks that investors face in these transactions should be fully disclosed and that SPACs should face dire consequences if these disclosures are incomplete and inaccurate. But by design, the heavy hammer of strict liability under section 11 of the Securities Act and Exchange Act (“section 11”), which penalizes misstatements in standard IPOs, is generally inapplicable to SPACs. This has not deterred plaintiffs, however; top legal blogs6See, e.g., Douglas A. Rappaport, Jacqueline Yecies & Stephanie Lindmuth, Recent SPAC Shareholder Suits in New York State Courts: The Beginning Wave of SPAC Litigation, Harv. L. Sch. F. on Corp. Governance (Apr. 23, 2021), https://corpgov.law.harvard.edu/2021/04/23/recent-spac-shareholder-suits-in-new-york-state-courts-the-beginning-wave-of-spac-litigation [https://perma.cc/
8ATG-7ZWH]; Ross Todd, A Wave of SPAC Litigation is Coming. Here’s What to Watch For, AmLaw Litig. Daily (May 11, 2021), https://www.law.com/litigationdaily/2021/05/11/a-wave-of-spac-litigation-is-coming-heres-what-to-watch-for/?slreturn=20220917141038 [https://perma.cc/AVW4-EBJ6].
and press articles7See, e.g., Alison Frankel, The New ‘Deal Tax’: SPAC Defendants Are Paying Plaintiffs Lawyers to Drop N.Y. State Suits, Reuters (May 5, 2021), https://www.reuters.com/business/legal/new-deal-tax-spac-defendants-are-paying-plaintiffs-lawyers-drop-ny-state-suits-2021-05-05 [https://perma.
cc/8CMC-E7WK?type=image]; Kevin LaCroix, Trouble Brewing in SPAC-Land?, D&O Diary (Apr. 13, 2021), https://www.dandodiary.com/2021/04/articles/securities-litigation/trouble-brewing-in-spac-land [https://perma.cc/W8BU-DMVS].
began to bubble with anecdotal reports of a deluge of SPAC-related lawsuits in the spring of 2021. But while numerous commentators have examined the characteristics of the SPACs constituting the boom, so far there has been no comprehensive study of the litigation they have generated. What kinds of claims comprise this flood, and what is their role?

In a first attempt to answer this question, I analyze a sample of 230 SPAC transactions and 150 lawsuits (generated by 62 of those transactions). My sample encompasses the SPAC boom through the second quarter of 2021, which, by all accounts, seems to be the height of the frenzy. I find that the majority of claims arising from de-SPAC transactions are, perhaps unsurprisingly, so-called “merger objection” claims. These are brought either under state law, claiming violation of managers’ fiduciary duties, or under section 14 of the Securities and Exchange Act, alleging fraud in the proxy materials by which management solicited shareholders to vote for the merger. Collectively, these merger objection claims comprise over 60% of the claims in my sample, and most are filed before the merger closes.8I find 110 merger objection claims out of a total of 181 claims of the main types that I tabulate. See infra Table 3. The second main class of claims comprising the SPAC litigation deluge are broader claims under Securities and Exchange Commission (“SEC”) Rule 10b-5,917 C.F.R. § 240.10b-5. For simplicity, this Article will refer to this Rule as simply “Rule 10b-5.” the all-purpose workhorse of securities litigation, which punishes material misstatements or omissions in connection with the purchase or sale of a security. These are typically brought after the closing of the de-SPAC transaction and comprise roughly 30% of the claims in my sample.10There are 53 Rule 10b-5 claims out of 181 claims of the main types that I tabulate. See infra Table 3.

Do these lawsuits target, improve, or deter bad SPACs? Although these lawsuits are in their early days, based on my sample, they appear in fact to target quite good SPACs. In examining these transactions and claims, my main finding is an intuitively odd one: that the likelihood that a de-SPAC transaction will generate a lawsuit—any kind of lawsuit—is negatively related to redemption rate. Transactions with higher redemption rates are commonly thought to be qualitatively worse mergers and might seem likely to generate more lawsuits. My sample, however, shows the reverse: that the probability that a transaction will generate litigation rises as redemption rates decline. This finding is robust to the inclusion of a panoply of controls. An equally puzzling finding is that based on my sample, the probability of being sued appears to be unrelated to deal size, target industry, returns of the merged company, managerial savvy, or various other proxies for deal quality.

What explains this surprising relationship between lawsuits and redemption rates? There are two potential explanations. One cheery possibility is that SPAC-related lawsuits may target fraudulent transactions in which misrepresentation by the management has had the effect of inducing investors to keep, rather than redeem their shares. If this is true, SPAC-related lawsuits may be fulfilling precisely the function for which they are designed: if brought after the merger, they are vindicating the interests of shareholders who did not redeem their shares because they were lied to. And if brought before the merger, these lawsuits may halt bad deals or prompt managers to improve transactions, thus inducing shareholders to keep their shares. If these stories are true, SPAC-related lawsuits are spectacularly successful and should be encouraged.

But there are also less rosy explanations for the negative relationship between litigation probability and redemption rate. Lower redemption rates—even if they legitimately signal a stronger transaction—could mean larger classes, more purported damages, and higher lawyers’ fees, and may thus be tempting targets for lawsuits after the merger. And in the notoriously dysfunctional ecosystem of merger litigation, challenges to SPACs brought before the deal closes may be filed indiscriminately. Even more damning, they may target fraudulent transactions but nonetheless fail to induce shareholders to redeem. I argue that this is likely because these lawsuits are of low quality, as many merger challenges infamously are.11See Matthew D. Cain, Jill E. Fisch, Steven Davidoff Solomon & Randall S. Thomas, The Shifting Tides of Merger Litigation, 71 Vand. L. Rev. 603, 604–05 (2018) [hereinafter Cain et al., Shifting Tides]; Matthew D. Cain, Jill Fisch, Steven Davidoff Solomon & Randall S. Thomas, Mootness Fees, 72 Vand. L. Rev. 1777, 1777 (2019) [hereinafter Cain et al., Mootness Fees]. I find that the eventual returns on deals that generated merger objection claims are no better than those that did not, and all of the mergers in my sample closed. There thus appears to be no evidence that these lawsuits are improving de-SPAC transactions or weeding out the bad ones. Roughly half of the merger objection cases in my sample were voluntarily dismissed without litigation within the sample period. The few settlements that are publicly disclosed are for paltry fees. And they are generally filed prolifically and sloppily by a handful of entrepreneurial plaintiffs’ firms composed of fewer than ten lawyers each.

The merger litigation in my sample is the product of a deliberate swap whereby SPACs substitute liability under the merger regime for the heavy cudgel of section 11 liability. In general, we are most concerned about misrepresentations precisely when firms are new to the market and have no history for shareholders to assess; this explains the rigidity of section 11 of the Securities Act, which provides virtually strict liability for misstatements and omissions in a registration statement, which firms file when they conduct an IPO. Because they are shells when they go public, SPACs have nothing to disclose in a registration statement and thus avoid exposure under section 11. To assess the effects of this “SPAC litigation swap,” I compare pre-closing merger objection claims in my SPAC sample to section 11 claims in a sample of IPOs over the same period. I find that in substituting liability under the Securities Act for liability under the merger regime, SPACs have, in terms of sheer volume, fallen from the frying pan into the fire. The number of merger objection claims in 2020 and 2021 is equal to nearly 70% of the number of de-SPAC transactions (as opposed to less than 9% of the IPOs that drew section 11 claims across my sample). But while the merger challenges in my sample have so far disclosed only a few settlements with a mean of roughly $200,000, the mean section 11 settlement in my sample—including cases that were dismissed or are ongoing—is roughly $4.5 million. All of this suggests that merger litigation, especially before the deal closes, may not perform the same policing function as section 11 liability. These cases are pervasive and cheap to settle, meaning that shareholders may not take them seriously as a signal of misconduct, and thus do not redeem their shares. Moreover, the penalties these lawsuits impose are likely too low to induce managers to be truthful.

If SPACs have problems in need of correction, it is unclear whether private litigation is up to the job. Pre-closing merger objection cases, which constitute much of the SPAC litigation deluge, seem to reflect an effort to collect many small fees for small plaintiffs’ lawyers and are unlikely to result in better deals. And although their ultimate impact is unclear, even some lawsuits brought after the merger, largely under Rule 10b-5, may be calculated to procure a high fee for plaintiffs’ counsel, rather than targeting true misconduct. If the SPAC structure constitutes a deliberate bargain by which public investors receive the opportunity to invest in potentially high-risk-high-reward ventures normally unavailable to them in return for less information, perhaps this is no cause for concern. On this view, SPAC litigation may be working as designed; expensive section 11 lawsuits are virtually unknown and are replaced with merger objection claims. Many of these claims, though brought in high volume, are low impact, and the nominal fees to plaintiffs’ counsel are a “tax” that all deal participants are willing to pay. However, if we view SPACs as vehicles for debuting firms to the public market that should disclose information of the same quality to their investors as is required in other IPOs, SPAC-related litigation currently appears unlikely to adequately fulfill this role. While government intervention could be an avenue to remedying any deficiencies in SPACs, reform of shareholder litigation—most urgently, pre-closing merger challenges—is necessary to help police future market innovations.

This Article proceeds as follows. Part I provides some brief background on SPACs and the litigation they have generated. Part II describes my data. Part III discusses descriptive statistics and empirical analysis, and Part IV investigates potential explanations for the odd inverse relationship between litigation probability with redemption rate. Part V compares merger liability in SPACs to section 11 liability in IPOs. Part VI assesses policy implications.

I. BACKGROUND

A special purpose acquisition company, or SPAC, is a public company formed by a sponsor that, following its IPO, has no operations except to search for a non-public firm to merge with and thereby bring public.12Minmo Gahng, Jay R. Ritter & Donghang Zhang, SPACs 1 (July 13, 2022) (unpublished manuscript), https://ssrn.com/abstract=3775847 [https://perma.cc/XFX4-GQ3L]. SPACs originated in the 1980s as “blank check” companies, which often made speculative investments and were considered penny stocks.13See 17 C.F.R. § 230.419(a)(2)(ii) (2021). The SEC adopted tighter controls on blank check companies in 1990 with the Penny Stock Reform Act and Rule 419.14William K. Sjostrom, Jr., The Truth About Reverse Mergers, 2 Entrepreneurial Bus. L.J. 743, 756–57 (2008) (describing blank check companies as “vehicles frequently used by boiler rooms in the 1980s for ‘pump and dump’ schemes”). While SPACs have evolved in various ways (in some cases, specifically to avoid the strictures of Rule 419), many of their defining characteristics remain the same. In an IPO, SPACs typically sell units priced at $10. Most units consist of a share and a warrant entitling the holder to buy some percentage of a share for $11.50 at a date five years after the completion of the merger.15Gahng et al., supra note 12. SPACs hold the proceeds from the IPO in an escrow account, in which they accumulate interest.16The escrow funds are typically invested in short-term treasury bonds or qualifying money-market funds while the SPAC searches for a merger target. This feature has prompted recent arguments that SPACs are in fact illegal investment funds that should be registered under the Investment
Company Act of 1940. See Over 60 of the Nation’s Leading Law Firms Respond to Investment Company
Act Lawsuits Targeting the SPAC Industry, Ropes & Gray (Aug. 27, 2021), https://www.ropesgray.
com/en/newsroom/alerts/2021/August/49-of-the-Nations-Leading-Law-Firms-Respond-to-Investment-Company-Act-Lawsuits [https://perma.cc/7G3L-WQU4].
The sponsors of the SPAC have no access to the escrow account.17Gahng et al., supra note 12. Sponsors usually retain 20% of the shares (called the “sponsor promote”) as compensation, which are contingent on completing a merger.18Id. at 1–2.

SPACs cannot form already having identified a merger target, and usually set an 18-to-24-month deadline to find and merge with a target company.19Id. at 2. Once a target is identified, shareholders of the SPAC are informed and must vote to approve the merger. Independently however, shareholders must also decide whether to keep or redeem their shares; those who redeem are entitled to the price paid for the units, plus a pro rata share of any interest accrued in the trust.20See id. Notably, shareholders who choose to redeem their shares can nonetheless both vote for the transaction and keep their warrants (which are unbundled from the shares to trade separately following the IPO).21Id. at 1–2; see also Redemption Rights at SPACs, Greenberg Traurig (June 28, 2021), https://www.gtlaw.com/en/insights/2021/6/published-articles/redemption-rights-bij-spacs [https://perma.
cc/EG63-JB3L].
If, however, the SPAC managers fail to identify a merger target in the specified time period, the trust must be liquidated and paid out to shareholders, and sponsors receive nothing (although extensions of time are sometimes negotiated with shareholders).22Gahng et al., supra note 12, at 2. Since it is not certain how many shareholders will choose to redeem their shares prior to the merger, de-SPAC transactions are negotiated to include a minimum amount of cash, and to meet this need, SPAC sponsors frequently arrange for private investment in public equity (“PIPE”) funding.23Id. Private investment in public entity (“PIPE”) investors are generally private equity funds, hedge funds, and other accredited investors that buy a minority stake in a SPAC. Because one purpose of PIPE investment is to lock in a certain amount of capital in case of heavy redemptions, PIPE investors do not have redemption rights and generally receive their shares at a discount on the market price.

The popularity of the SPAC skyrocketed in 2020, raising as much cash in that year as in the entire preceding decade,24Klausner et al., supra note 4, at 230. and the first quarter of 2021 alone saw more SPACs created and more money invested than in the entirety of 2020.25Bazerman & Patel, supra note 1. This meteoric rise has slowed since the second quarter of 2021,26See, e.g., Alex Rankine, The Stock Market’s “Spac” Boom Is Slowing, MoneyWeek (July 16, 2021), https://moneyweek.com/investments/stockmarkets/603564/spac-boom-is-slowing [https://perma.
cc/CAG4-R5AT]; Paul R. La Monica, Welcome to the Big SPAC Slowdown, CNN Bus. (Aug. 25, 2021), https://www.cnn.com/2021/08/25/investing/spacs-slowdown/index.html [https://perma.cc/R5SX-8AWR].
although many previously formed SPACs are still shopping for merger partners, and new SPACs are continuing to form. The slowdown is likely the result not only of increased regulatory attention,27Rankine, supra note 26; La Monica, supra note 26. but of heightened overall scrutiny of SPAC performance.

A. Accounts of the 2020–2021 SPAC Boom

Several expert accounts have painted damning portraits of the SPACs that are the product of the most recent bubble. Using a sample of all forty-seven SPACs that merged between January 2019 and June 2020, Michael Klausner, Michael Ohlrogge, and Emily Ruan found that the structure of SPACs creates substantial costs, misaligned incentives, and on the whole, losses for investors who own shares at the time of SPAC mergers (that is to say, those who do not redeem their shares). 28Klausner et al., supra note 4, at 233–34. These authors evaluated claims about the advantages SPACs offer over traditional IPOs: that they provide a less expensive, faster, and more certain route to the public markets; that they provide a path to being a public company that is often unavailable to firms engaged in complex or uncertain businesses that may be difficult to value; and that they provide more equal access to ordinary investors than alternatives such as private equity.29Id. at 230–35. The authors found that SPACs fail on all fronts, largely because of the dilution built into the SPAC structure, and that SPACs tend to drop by at least one third of their value within a year of the merger.30See id.

Minmo Gahng, Jay R. Ritter, and Donghang Zhang, in examining SPACs from 2010 to 2020, similarly found that SPAC returns tend to be negative, though cash-weighted returns are less so.31See Gahng et al., supra note 12, at 4. They also found that going public via a SPAC transaction is much more expensive for a private firm than undertaking a traditional IPO.32Id. at 6. However, they found that sponsors take large haircuts and underwriters of SPACs surrender substantial commissions for weak deals, meaning when a SPAC experiences high redemption rates after announcing a merger.33Id. at 6–7.

Finally, in a concise overview of the SPAC market as of the summer of 2021, Max H. Bazerman and Paresh Patel also suggested that the SPACs of 2020–2021 are not monolithic in type and have in fact been evolving over the course of the boom. The authors suggested that although not all SPACs are successful, they offer financing opportunities “that compete with later-stage venture capital, private equity, direct listings, and the traditional IPO process [and provide] an infusion of capital to a broader universe of start-ups and other companies, fueling innovation and growth.”34Bazerman & Patel, supra note 1, at 104. They noted that although SPACs had a “questionable start” as blank check companies and have existed as a “niche” “cottage industry” for most of the intervening years, the format has gone mainstream.35Id. at 105.

B. The SPAC Litigation Deluge

The structure of a SPAC lends itself to various critiques, and thus, legal claims. De-SPAC transactions occur, by definition, under significant time pressure. Moreover, neither the management (via the sponsor promote) nor the underwriter (via commissions) of the SPAC are compensated at all unless a transaction actually occurs. This setup creates incentives for sponsors to close a transaction—perhaps any transaction—rather than liquidate the trust. Additionally, the twenty-four-month time limit means that the SPAC may lack the means or opportunity to conduct sufficient diligence on the target or adequately evaluate the full universe of merger options. Moreover, as the end of the time limit approaches, targets may have increased leverage in negotiations with the SPAC, and though it may serve the sponsors and managers to close the deal before the deadline, shareholders might be better served by extending the deadline or waiting for a more attractive deal.36See Press Release, Paul Munter, Acting Chief Acct., U.S. Sec. & Exch. Comm’n, Financial Reporting and Auditing Considerations of Companies Merging with SPACs (Mar. 31, 2021), https://www.sec.gov/news/public-statement/munter-spac-20200331 [https://perma.cc/ZUK8-54H3]; Press Release, U.S. Sec & Exch. Comm’n Div. Corp. Fin., Staff Statement on Select Issues Pertaining to Special Purpose Acquisition Companies (Mar. 31, 2021), https://www.sec.gov/news/public-statement/
division-cf-spac-2021-03-31 [https://perma.cc/23SP-KP5N]; Caitlyn M. Campbell, Surge in SPACtivity Leads to Litigation and Regulatory Risks, Nat’l L. Rev. (Apr. 5, 2021), https://www.natlawreview.
com/article/surge-spactivity-leads-to-litigation-and-regulatory-risks [https://perma.cc/9749-YA6H].
PIPE financing may give rise to additional conflicts of interest by allowing a merger to close at the cost of substantial dilution for the original SPAC shareholders.37See Campbell, supra note 36.

Failure to disclose these conflicts could lead to lawsuits under the securities laws.38Id. Moreover, SPACs are vulnerable to lawsuits alleging that sponsors breached their fiduciary duties in pursuing the merger, and they may face greater judicial scrutiny of these claims because the SPAC directors may not be disinterested.39See Michael Klausner & Michael Ohlrogge, SPAC Governance: In Need of Judicial Review 3 (N.Y.U. Working Paper No. 22-07, 2021), https://ssrn.com/abstract=3967693 [https://perma.cc/TX83-67Y7]; see also In re MultiPlan Corp. Stockholders Litigation, 268 A.3d 784, 784–85 (Del. Ch. 2022); Christopher Kercher, Ellison Ward Merkel, Andrew Rossman & R. Brian Tommons, Litigation Risk in the SPAC World, JDSupra (Oct. 1, 2021), https://www.jdsupra.com/legalnews/litigation-risk-in-the-spac-world-88058 [https://perma.cc/EFF3-P8EE] (noting that the business judgment rule is unlikely to apply, and the duties of SPAC management in de-SPAC transactions are likely to be reviewed under an “entire fairness” standard). The de-SPAC transaction may be subject to lawsuits both before and after the merger under Rule 14a-9, which governs the content of proxy disclosures. Broadly, SPACs (and their targets, once the transaction closes) are also public companies and therefore subject to the reporting requirements and fraud prohibitions of the federal securities laws, and thus, false statements and inadequate periodic disclosures may give rise to liability.40Rodrigues & Stegemoller, supra note 4, at 46.

Practitioner reports raising alarm about rising SPAC-related litigation began in earnest in spring 2021.41See, e.g., id.; Campbell, supra note 36; Glenn A. Kopp, Jason Linder, Glenn K. Vanzura
& Bradley A. Cohen, Mitigating SPAC Enforcement and Litigation Risks, Mayer Brown (Apr.
26, 2021), https://www.mayerbrown.com/en/perspectives-events/publications/2021/04/mitigating-spac-enforcement-and-litigation-risks [https://perma.cc/S8YR-2AMR]; Edvard Pettersson & Crystal Tse, Liability for SPAC Flops: Waitr Could Be Test Case for Disgruntled Investor Lawsuits, Ins. J. (Feb. 16, 2021), https://www.insurancejournal.com/news/national/2021/02/16/601362.htm [https://perma.cc
/3ELD- D8A4]; Rappaport et al., supra note 6; Todd, supra note 6; Client Memorandum from Paul
Weiss, The SPAC Litig. Boom: What SPAC Sponsors, Dirs. & Officers Can Do to Mitigate Their Exposure (Mar. 9, 2021), https://www.paulweiss.com/media/3980884/the_spac_litigation_boom_what_
spac_sponsors_directors_and_officers_can_do_to_mitigate_their_exposure.pdf [https://perma.cc/5DFB-2A57]; Varant Yegparian & Schiffer Hicks Johnson, How to Prepare for the Deluge of SPAC Litigation, JDSupra (Apr. 22, 2021), https://www.jdsupra.com/legalnews/how-to-prepare-for-the-deluge-of-spac-2261276 [https://perma.cc/76GY-BNPW]; SPAC Litigation and Enforcement Update, Baker Botts (Apr. 23, 2021), https://www.bakerbotts.com/thought-leadership/publications/2021/april/spac-litigation-and-enforcement-update-spring-2021 [https://perma.cc/7K2G-QH2Q]; Jeffrey I. Lang & Luke Appling, The Growth of SPAC-Related Litigation, Cohen & Gresser (May 27, 2021), https://www.cohengresser.
com/app/uploads/2021/05/The-Growth-of-SPAC-Related-Litigation.pdf [https://perma.cc/A2SU-BRB9].
This coincided with multiple announcements that the SEC would scrutinize SPACs more closely.42See, e.g., Press Release by Paul Munter, supra note 36; Press Release by Sec. & Exch. Comm’n. Div. Corp. Fin., supra note 36. Various journalists and industry participants have reported informal tallies,43See Frankel, supra note 7; LaCroix, supra note 7; Leo Cho, SPAC Related Filings on the Rise, stan. l. sch. Sec. Class Action Clearinghouse (June 7, 2021), https://securities.stanford.edu/news-reports/20210607-SPAC-Related-Filings-on-the-Rise.pdf [https://perma.cc/A2U5-4LD8] (reporting an increase in SPAC-related litigation as a percentage of securities class actions). but there has so far been no deep dive into the characteristics and drivers of these lawsuits. In the recent surge of SPAC litigation, which claims predominate, and why? What are the characteristics of de-SPAC transactions that generate lawsuits versus those that do not? Who are the plaintiffs and firms bringing these lawsuits? More importantly, does this surge in litigation reflect SPAC quality, and does it offer a remedy to any of the potential negative outcomes arising from the controversial features of the SPAC format? This Article makes a first attempt at answering these questions.

II. DATA

To assess the recent increase in SPAC litigation, I compose two samples: a sample of lawsuits and a sample of transactions. For my sample of lawsuits, I search Bloomberg Law for dockets in federal district courts, Delaware Chancery Court, and New York Supreme Court44I include this jurisdiction following press reports of an explosion in SPAC-related litigation in New York Supreme Courts. See Frankel, supra note 7; Rappaport et al., supra note 6. containing any references to special purpose acquisition companies. I then manually screen the complaints and dockets of these lawsuits to determine whether the lawsuit arises from a de-SPAC transaction,45I omit, for instance, bankruptcy cases and lawsuits brought by potential SPAC targets where the SPAC ultimately selected a different target. and if so, what claims are brought.46I omit from my coding all control person claims, which are almost universally brought with any claims under the federal securities laws. I also gather information on plaintiff law firms and outcomes (if any) from these dockets. My search extends from January 1, 2017, to June 30, 2021.

Next, I compose a sample of all SPAC transactions from January 1, 2014, to July 31, 2021, from SPACInsider and the SEC EDGAR website. I use this range to account for several factors. First, although most claims in my sample are securities claims that have relatively short limitations periods, there are a non-trivial number of contract claims whose statute of limitations is significantly longer. However, my 2017 docket search captured very few lawsuits involving SPACs, and only one involved a transaction that occurred before 2014. I include lawsuits that were filed before June 30, 2021, but many of the cases in my sample are merger objection cases that are filed before the closing of the transaction; accordingly, I extend my transaction sample an additional month. The SPACInsider database furnishes redemption rate, IPO proceeds, implied enterprise value, and warrants issued in the IPO for the majority of these transactions (for those that are omitted, I use SEC EDGAR). I search the websites of the target firm for each transaction to find the target’s age at the time of the de-SPAC. To assess the quality of the sponsors, I code the number of SPACs each management team has participated in at the time of the de-SPAC transaction, which I source from SPACInsider or internet searches.47In unreported specifications, I also use alternative measures of sponsor quality (or at least, notoriety) by following in the spirit of Klausner, Ohlrogge, and Ruan. See Klausner et al., supra note 4, at 251–52. Accordingly, I code whether the manager or owner of the sponsor was previously a high-level manager or founder of a U.S. or Global Fortune 500 company or a company commonly considered a household name (for example, MGM Resorts, Allergan, and Virgin Media) that previously appeared in the Fortune 500. I also code whether the sponsor is affiliated with a fund with more than $1 billion in assets under management. Id. Finally, I take data on returns and dividends for all merged companies from Compustat Daily.48Pricing data downloaded August 21, 2021.

III. DESCRIPTIVE STATISTICS AND EMPIRICAL ANALYSIS

In this Part, I first compare the characteristics of de-SPAC transactions that generated lawsuits to those that did not. I then examine the attributes of the lawsuits in my sample. Finally, I examine the associations between lawsuits and various transaction characteristics.

A. Comparing Transactions that Generated Lawsuits to Those that Did Not

De-SPAC transactions that are subject to at least one lawsuit differ along a few key dimensions from those that are not. These descriptive statistics are reported in Table 1. First, the redemption rates of the transactions that generated lawsuits are significantly lower. The average percentage of shareholders that opted to redeem their shares in de-SPAC transactions that generated lawsuits is roughly 21%; the rate is more than double in de-SPAC transactions that generated no lawsuits at nearly 45%. Perhaps unsurprisingly, the de-SPAC transactions generating lawsuits have significantly higher implied enterprise values: roughly $2.15 billion as opposed to roughly $1.4 billion for de-SPAC transactions that did not draw lawsuits. Transactions generating lawsuits also originated from SPAC IPOs with higher proceeds than those which did not generate lawsuits, with averages of roughly $369 million and $256 million, respectively. Transactions generating lawsuits offer a smaller percentage of a warrant (that can later be used to purchase a share) with each unit sold in the IPO: sued transactions offer an average of 0.44 warrants per unit, as opposed to unsued transactions, which offer an average of nearly 0.55 warrants per unit. Consistent with industry commentary, the age of the target firm is lower for de-SPAC transactions that generate lawsuits with an average of twelve years versus nearly forty-three. However, the median target firm ages are nine and ten years, respectively, and therefore this difference is not significant.49I note that firm age is also largely (though not entirely) likely to track whether the firm is pre-revenue, which may also indicate a riskier target. Finally, de-SPAC transactions generating lawsuits take about one month longer on average to complete, at 22.4 versus 21.36 months. Although I compare and report the average three-day, seven-day, fourteen-day, thirty-day, and ninety-day post-merger returns for de-SPAC transactions that were sued versus those that were not, there is no statistically significant difference.

Table 1.  Characteristics of Sued and Unsued De-SPAC Transactions

 

Sample of Sued De-SPACs

Sample of Unsued De-SPACs

 
 

Mean

Median

Standard Deviation

Obs.

Mean

Median

Standard Deviation

Obs.

t-stat

Redemption Rate (%)

21.41995

.4683396

31.41641

62

44.83333

50.95

38.0758

168

4.3270

Implied Enterprise Value ($m)

2150.54

1283.5

2739.276

62

1399.343

925

1814.378

165

-2.3949

IPO Proceeds ($m)

369.3452

250

347.367

62

255.9107

229.5

178.7456

168

-3.2349

Warrants

.4442623

.5

.2057031

61

.5477912

.5

.2635458

166

2.7721

Target age at SH vote (years)

12.09677

9

11.54849

62

42.92216

10

218.1146

167

1.1107

Months to complete

22.40323

24

2.58913

62

21.3631

24

3.745537

168

-2.0148

3-day returns

-.0467387

-.0716895

.1475693

28

-.0051028

-.0234741

.1691097

81

1.1585

7-day returns

-.049506

-.1012345

.2070382

53

-.0304111

-.054142

.3861426

142

0.3418

14-day returns

-.0338234

-.0839047

.2580448

54

-.0118041

-.0599034

.653251

141

0.2403

30-day returns

-.0358618

-.1304813

.3650061

29

-.0495576

-.0575954

.2386171

77

-0.2258

90-day returns

-.0484598

-.1514094

.4531122

36

-.0102963

-.0514618

.4307399

84

0.4379

Notes: The sample of sued transactions includes all lawsuits involving a de-SPAC transaction filed between January 1, 2017, and June 30, 2021, located on a Bloomberg Law search of all federal, Delaware Chancery, and New York Supreme Court dockets. The full sample includes all de-SPAC transactions completed between January 1, 2014, and July 31, 2021, sourced from SPACInsider.

            

In Table 2, I report the most common industries represented among the target firms of sued and unsued de-SPAC transactions, based on Fama-French 48 industry classification. The business services industry supplies the target firms for many de-SPAC transactions that are sued and many that are not: 26.67% and 24.85% respectively. While 13.3% of the target firms in sued transactions come from the automotive industry, only 5.45% of the target firms that do not generate lawsuits are in this industry. Similarly, a higher percentage of target firms that draw lawsuits are involved in electrical equipment (8.33%) than those that do not (1.82%). Conversely, transactions involving petroleum and natural gas targets comprise 4.85% of the unsued transactions, but none of the transactions generating lawsuits. While 10.3% of unsued transactions involve pharmaceutical targets, only 3.3% of the sued transactions do. It is interesting to note that across almost all industries reported in this table, redemption rates are higher, sometimes significantly, for the transactions that are not sued than for those that are (the exception is the electrical equipment industry, which furnished just 1.82% of the target firms for unsued de-SPAC transactions).

Table 2.  Industries of Target Firms in Sued and Unsued De-SPAC Transactions Based on Fama-French 48 Classification

 

Sample of Sued De-SPAC Transactions

Sample of Unsued De-SPAC Transactions

Industry

Freq.

%

Mean Redem. Rate

Mean Ent. Value

Mean IPO Proc.

Mean Target Age

Freq.

%

Mean Redem Rate

Mean Ent. Value

Mean IPO Proc.

Mean Target Age

Pharmaceutical Products

2

3.33

7

405

93

3

17

10.30

50

615

124

7

Business Services

16

26.67

36

2571

463

14

41

24.85

43

1832

324

69

Automobiles and Trucks

8

13.33

4

2820

512

7

9

5.45

18

2161

258

29

Retail

4

6.67

16

1213

278

19

7

4.24

71

507

131

15

Petroleum and Natural Gas

0

0

    

8

4.85

32

955

312

7

Entertainment

3

5

52

760

149

4

6

3.64

53

1137

266

15

Electrical Equipment

5

8.33

41

1291

294

26

3

1.82

20

964

185

18.3

Banking

3

5

27

2689

397

9

6

3.64

43

3367

241

15

Other

19

31.67

    

68

41.2

    

Notes: This table includes only industries that comprise roughly 5% or more of the target firms in either sample.

 

B. Incidence and Attributes of SPAC-Related Lawsuits

In Tables 3, 4 and 5, I report the incidence and characteristics of the lawsuits in my sample. Table 3 reports lawsuits by claim, court, and year. The most prominent observation is the dearth of SPAC-related lawsuits in 2017, 2018 and 2019, the dramatic rise from 2019 to 2020 (from nine lawsuits to forty-six), and the near-doubling of that number in the first half of 2021 alone (to eighty-nine).50I count each separately filed complaint as a lawsuit, even though some are eventually consolidated. However, I do not count twice lawsuits that are simply transferred to other jurisdictions. Lawsuits filed in federal and New York Supreme Courts are largely responsible for this spike (although Delaware has seen a surprising rise—from zero to thirteen lawsuits—in the first half of 2021). The rise in lawsuits generally corresponds with the increase in total SPAC transactions, although the lawsuit curve is shorter and steeper in 2020 and 2021. Notably, the increase in lawsuits corresponds with a sizable decrease in the average redemption rate for all de-SPAC transactions in 2020 and the first half of 2021.

I further tabulate the claims for each lawsuit (I note here that many lawsuits have more than one claim). I report only the most common claims, which are Rule 10b-5 claims, Rule 14a-95117 C.F.R. § 240.14a-9. For simplicity, this Article will refer to this Rule as simply “Rule 14a-9”). claims, state law claims for breach of fiduciary duty, and contract claims.52For comparative purposes, I also report claims under section 11 of the Securities Act (“section 11”). I omit less common claims from this tabulation, such as unjust enrichment, waste, and common law fraud. I also do not report control person liability claims, which are almost universally brought along with any claim under the federal securities laws. The most dramatic jump has been in claims brought under state fiduciary law in 2020 and 2021, followed by Rule 10b-5 claims. Rule 14a-9 claims experienced a less pronounced jump, and contract claims have remained low and relatively stable over all the years of the sample. section 11 claims have predictably been virtually nonexistent.

Table 3.  Lawsuits for Sued Transactions by Main Claim Types (January 1, 2017–June 30, 2021)

Year

10b5 Claims

Sec. 11 Claims

14a Claims

State Fid. Duty Claims

Contract Claims

Total Lawsuits

Federal Court

Del. Ch.

Court

NY Sup.

Court

Total de-SPACs

Mean Redem.

Rate of Total de-SPACs (%)

2017

1

0

0

1

2

4

1

1

2

13

47.39

2018

0

0

0

0

1

2

1

1

0

22

54.05

2019

5

2

8

0

0

9

9

0

0

28

64.96

2020

9

0

11

30

8

46

21

0

25

64

37.45

2021 (as of June 30)

38

1

18

42

4

89

56

13

20

85*

22.98

Total

53

3

37

73

15

150

88

15

47

212

 

Notes: Each individual lawsuit may have more than one main claim type. All are tabulated here. Less common claim types, such as unjust enrichment and corporate waste, have been omitted from this table. *Reports transactions closed as of July 31, 2021

Figure 1 illustrates the timing of these claims. Eighty percent (88) of the claims brought under Rule 14a-9 and state fiduciary duties are brought before the de-SPAC merger, meaning that these claims seek injunctions.53Generally, these complaints also purport to seek damages in the event that the transaction is consummated. The remaining 20% (22) of these claims are brought after the merger for damages. Conversely, 92% (49) of the Rule 10b-5 claims in my sample are brought after the merger.

Figure 1. SPAC Litigation Timeline

I also report in Table 4 the transaction characteristics associated with each type of claim. Transactions associated with contract claims have by far the highest average redemption rate at 35.44%, the lowest average SPAC IPO proceeds at $247.87 million, and take the longest to complete at 23.4 months. Redemption rates for other claim types range from 13.44% to 18.73%, and IPO proceeds from $384.68 million to $412.18 million. Transactions targeted with contract and state fiduciary claims are associated with the highest implied enterprise values, at $2.612 billion and $2.521 billion; Rule 14(a) and Rule 10b-5 claims target transactions with median $2.146 billion and $2.048 billion respectively. Warrants range between 0.4 and 0.49 per unit, and average target age from 9.55 to 13 years.

Table 4.  Transaction Characteristics by Claim Type

 

10b5

14a

State Fiduciary Duty

Contract

 

Mean

Med.

Obs.

Mean

Med.

Obs.

Mean

Med.

Obs.

Mean

Med.

Obs.

Redemption Rate (%)

14.5

.1

20

18.732

.1

25

13.44197

.1

39

35.44367

24.11834

10

Implied Enterprise Value ($m)

2048.775

1046.5

20

2146.336

1267

25

2521.595

1570

39

2612.36

1504

10

IPO Proceeds ($m)

412.18

236.9

20

402.452

280

25

384.6821

280

39

247.87

230

10

Warrants

.4710526

.5

19

.49

.5

25

.3970085

.333

39

.4

.5

10

Target age at SH vote (years)

9.55

7.5

20

10.6

5

25

13

10

39

12.9

9

10

Months to complete

21.75

24

20

22.8

24

25

22.69231

24

39

23.4

24

10

Finally, in Table 5, I tabulate the plaintiff law firms associated with the total number of lawsuits, as well as with each type of claim. I classify plaintiff firms as top plaintiff firms, entrepreneurial SPAC firms, entrepreneurial emerging firms, and top defense firms. Top plaintiff firms are those that appear on the Legal 500 list for securities plaintiff litigation.54Securities Litigation: Plaintiff, Legal 500 (2022), https://www.legal500.com/c/united-states/dispute-resolution/securities-litigation-plaintiff [https://perma.cc/KR7X-RU39]. These are Berman Tobacco, Bernstein Litowitz, Grant & Eisenhofer, Labaton Sucharow, Pomerantz, Quinn Emmanuel, and Robbins Geller. Entrepreneurial SPAC firms are those that brought 5% or more of the lawsuits in my sample. These are Brodsky & Smith, Monteverde & Associates, Moore Kuehn, Pomerantz, Rigrodsky Law, and Robbins Geller. Entrepreneurial emerging firms in my sample are those identified by Klausner and Heglund.55Guest Post by Michael Klausner & Jason Hegland, Guest Post: Deeper Trends in Securities Class Actions 2006–2015, D&O Diary (June 23, 2016), https://www.dandodiary.com/2016/06/articles/
securities-litigation/guest-post-deeper-trends-in-securities-class-actions-2006-2015 [https://perma.cc/
U5H7-KLY4]. I note that their list includes Pomerantz, which I omit because it has appeared for the last two years in the Legal 500.
The emerging entrepreneurial firms in my sample are Glancy Prongay & Murray, Kahn Swick & Foti, and the Rosen Law Firm. Finally, top defense firms in my sample are King & Spaulding, McDermott Will & Emery, and Williams & Connolly.

The entrepreneurial SPAC firms lead the pack, with more than twice as many lawsuits as any other category. Notably, the primary main claims driving this result are Rule 14a-9 proxy fraud claims at twenty-one and, even more pronounced, claims under state fiduciary law at fifty-three (hereinafter I refer to these collectively as “merger objection claims”). Rule 10b-5 claims are relatively stable across the three categories of traditional plaintiff firms, with nineteen by top plaintiff firms, sixteen by entrepreneurial SPAC firms, and thirteen by other entrepreneurial emerging firms. The firms traditionally serving defendants, by contrast, are almost exclusively involved in SPAC-related lawsuits by virtue of contract claims, which form a small minority of the main claim types that I tabulate. These claims typically involve disputes among management, or between sponsors/management and PIPE investors, which likely accounts for the involvement of non-plaintiff firms.

Table 5.  Lawsuits by Firm

 

Total Lawsuits

10b5

14a

State Fiduciary Duty

Contract

Top Plaintiff Firms

27

19

4

4

1

Entrepreneurial SPAC Firms

82

23

23

55

0

Entrepreneurial Emerging Firms

14

13

4

3

0

Top Defense Firms

7

0

0

1

5

C. Empirical Analysis

Table 6 shows the results of OLS regressions in which the dependent variable is a dummy equal to one if the de-SPAC transaction generates at least one lawsuit, and the independent variable is redemption rate. I control throughout for industry using a dummy variable equal to one if the target firm of the de-SPAC transaction is in an industry found by other studies to be particularly vulnerable to securities litigation, including the biotechnology, computer hardware, electronics, retail, or computer software industries.56See Douglas J. Skinner, Earnings Disclosures and Stockholder Lawsuits, 23 J. Acct. & Econ. 249, 256 n.5 (1997); Jonathan L. Rogers & Phillip C. Stocken, Credibility of Management Forecasts, 80 Acct. Rev. 1233, 1257 (2005); Francois Brochet & Suraj Srinivasan, Accountability of Independent Directors: Evidence from Firms Subject to Securities Litigation, 111 J. Fin Econ. 430, 448 (2014). I also control for the log of implied enterprise value of the transaction to assess whether lawsuits are driven by the estimated value of the deal. I control for the target age in years to test anecdotal claims that plaintiff lawyers target de-SPAC transactions involving younger firms. I also control for several proxies for the quality of the de-SPAC merger, including the log of the IPO proceeds of each transaction57See id.; see also Klausner et al., supra note 4, at 240 (using IPO proceeds in alternative specifications as a measure for SPAC quality). and the number of warrants per unit issued in the IPO.58Bazerman & Patel, supra note 1, at 106 (characterizing SPACs that “raised relatively small amounts of capital and offered higher-than-average warrants as an incentive to entice investors” as “indications of lower-quality sponsor teams”). I note in addition that the warrants per share may impact redemption rate; the reason is that if shareholders retain less upside without their shares, they may be less likely to redeem. The pairwise correlation between redemption rate and warrants is 0.31. In any case, I control for both variables, thereby addressing concerns that the redemption variable may be capturing the warrants per share. Finally, I use the number of SPAC transactions the management team had completed prior to the closing of the transaction as a proxy for quality of the management team.59As an alternate measure of sponsor quality, in unreported specifications I use a dummy equal to one if the sponsor owner or manager was the owner or manager of a well-known company and a dummy equal to one if the sponsor was affiliated with a fund with over $1 billion in assets under management. The coefficient on redemption rate is similar and also significant at the 5% level. All regressions include robust standard errors. To control for the possibility that plaintiffs simply brought more lawsuits at the same time due to some unobserved trend, all regressions include year fixed effects.

The coefficient on redemption rate is negative and statistically significant in all specifications. The coefficient on redemption rate is relatively stable across specifications and is significant at the 5% level in the final specification which includes the most controls (it is significant at the 1% level in several other specifications). The likelihood of a lawsuit rises roughly 0.26% for every 1% decrease in redemption rate. In other words, a de-SPAC transaction with a 25% redemption rate is 13% more likely to generate at least one lawsuit than a transaction in which 75% of the shareholders redeem their shares in advance of the merger.

Table 6.  Probability of a Lawsuit

 

(1)

(2)

(3)

(4)

(5)

(6)

 

Lawsuit Dummy

Lawsuit Dummy

Lawsuit Dummy

Lawsuit Dummy

Lawsuit Dummy

Lawsuit Dummy

Redemption Rate

-0.00309***

-0.00270**

-0.00281**

-0.00277**

-0.00262**

-0.00262**

 

(-4.05)

(-3.18)

(-3.30)

(-3.23)

(-3.01)

(-3.00)

Vulnerable Industry Dummy

0.0153

0.0414

0.0472

0.0472

0.0553

0.0575

 

(0.25)

(0.64)

(0.73)

(0.73)

(0.84)

(0.86)

Log Enterprise Value ($m)

 

0.0431

0.0465

0.0312

0.0259

0.0274

  

(1.45)

(1.56)

(0.75)

(0.61)

(0.63)

Target Age (years)

  

-0.000277**

-0.000257*

-0.000222

-0.000222

   

(-2.68)

(-2.24)

(-1.92)

(-1.91)

Log IPO Proceeds

   

0.0298

0.0313

0.0338

    

(0.51)

(0.51)

(0.55)

Warrants

    

-0.138

-0.142

     

(-1.14)

(-1.17)

Repeat Player

     

-0.00703

      

(-0.29)

Year Fixed Effects

Yes

Yes

Yes

Yes

Yes

Yes

_cons

0.211**

-0.657

-0.714

-0.956

-0.784

-0.850

 

(3.01)

(-1.08)

(-1.17)

(-1.23)

(-0.89)

(-0.94)

r2_a

0.0658

0.0714

0.0758

0.0724

0.0750

0.0710

N

230

227

226

226

223

223

Notes: T statistics in parentheses – * p < 0.05, ** p < 0.01, *** p < 0.001.

I then bifurcate the sample based on claim type. Table 7 shows the results of OLS regressions in which the dependent variable is a dummy equal to one if the de-SPAC transaction generates at least one merger objection claim (either a state fiduciary duty claim or a Rule 14a-9 claim) after dropping transactions that generate lawsuits but no merger objection claims.

Table 8 shows the results of OLS regressions in which the dependent variable is a dummy equal to one if the de-SPAC transaction generated at least one claim under Rule 10b-5 after dropping transactions that generate lawsuits but no Rule 10b-5 claims. I use identical controls as in Table 6,60Vulnerable Industries Dummy equals one if the merged firm belongs to the biotechnology, computer hardware, electronics, retail, or computer software industries. These industries have been found by previous studies to be particularly vulnerable to securities litigation. See Skinner, supra note 56; Rogers & Stocken, supra note 56; Brochet & Srinivasan, supra note 56. Repeat Player is the number of SPAC transactions in which the management team previously participated. All regressions include robust standard errors and year fixed effects. and all regressions include year fixed effects. Although the samples are small, the coefficients on redemption rate are very similar to the one I find in Table 6. A 1% increase in redemption rate corresponds with a 0.25% decrease in the likelihood of a merger objection claim and a 0.22% decrease in the likelihood of a Rule 10b-5 claim.

Table 7.  Probability of a Merger Objection Claim

 

(1)

(2)

(3)

(4)

(5)

(6)

 

Merger Objection Dummy

Merger Objection Dummy

Merger Objection Dummy

Merger Objection Dummy

Merger Objection Dummy

Merger Objection Dummy

Redemption Rate

-0.00294***

-0.00267**

-0.00275***

-0.00266**

-0.00254**

-0.00254**

 

(-4.07)

(-3.31)

(-3.38)

(-3.24)

(-3.08)

(-3.06)

Vulnerable Industry Dummy

-0.0458

-0.0244

-0.0199

-0.0199

-0.0122

-0.0122

 

(-0.80)

(-0.40)

(-0.33)

(-0.33)

(-0.20)

(-0.19)

Log Enterprise Value ($m)

 

0.0312

0.0343

0.00555

0.00668

0.00670

  

(1.09)

(1.19)

(0.14)

(0.16)

(0.16)

Target Age (years)

  

-0.000200*

-0.000159

-0.000144

-0.000144

   

(-2.45)

(-1.69)

(-1.51)

(-1.51)

Log IPO Proceeds

   

0.0581

0.0499

0.0500

    

(1.02)

(0.84)

(0.84)

Warrants

    

-0.103

-0.104

     

(-0.87)

(-0.87)

Repeat Player

     

-0.0000783

      

(-0.00)

Year Fixed Effects

Yes

Yes

Yes

Yes

Yes

Yes

_cons

0.213**

-0.415

-0.468

-0.967

-0.769

-0.769

 

(3.07)

(-0.71)

(-0.79)

(-1.29)

(-0.88)

(-0.87)

r2_a

0.0952

0.0965

0.0992

0.0990

0.101

0.0963

N

215

212

211

211

209

209

Notes: T statistics in parentheses – * p < 0.05, ** p < 0.01, *** p < 0.001.

 

Table 8.  Probability of a Rule 10b-5 Claim

 

(1)

(2)

(3)

(4)

(5)

(6)

 

10b-5 Dummy

10b-5 Dummy

10b-5 Dummy

10b-5 Dummy

10b-5 Dummy

10b-5 Dummy

Redemption Rate

-0.00237***

-0.00223***

-0.00231***

-0.00232***

-0.00218**

-0.00221**

 

(-3.74)

(-3.42)

(-3.48)

(-3.49)

(-3.21)

(-3.22)

Vulnerable Industry Dummy

-0.0277

-0.0134

-0.00745

-0.00756

-0.00170

0.00214

 

(-0.64)

(-0.30)

(-0.16)

(-0.17)

(-0.04)

(0.04)

Log Enterprise Value ($m)

 

0.0166

0.0188

0.0263

0.0252

0.0287

  

(0.77)

(0.86)

(0.97)

(0.91)

(1.03)

Target Age (years)

  

-0.000181*

-0.000190*

-0.000167*

-0.000167*

   

(-2.48)

(-2.41)

(-2.19)

(-2.19)

Log IPO Proceeds

   

-0.0141

-0.00571

-0.00207

    

(-0.27)

(-0.11)

(-0.04)

Warrants

    

-0.0531

-0.0629

     

(-0.52)

(-0.62)

Repeat Player

     

-0.0150

      

(-1.25)

Year Fixed Effects

Yes

Yes

Yes

Yes

Yes

Yes

_cons

0.170**

-0.165

-0.201

-0.0896

-0.190

-0.306

 

(2.88)

(-0.38)

(-0.46)

(-0.13)

(-0.23)

(-0.37)

r2_a

0.0956

0.0952

0.0974

0.0926

0.0867

0.0851

N

188

185

184

184

181

181

Notes: T statistics in parentheses – p < 0.05, ** p < 0.01, *** p < 0.001.

In Table 9, I drop de-SPAC transactions that generated no lawsuits and regress the number of lawsuits per transaction on redemption rate, including the same controls. There is no statistically significant association between the number of lawsuits and redemption rate, although interestingly, the number of lawsuits is negatively related to the vulnerable industry dummy at the 10% significance level in every specification. Less surprisingly, the number of lawsuits per transaction is positively related to the log of the implied enterprise value of the merger at the 10% significance level in the specification including all controls. It seems, accordingly, that while the likelihood of being sued at all is unrelated to deal size, in the subset of transactions that draw at least one lawsuit, large deals draw more litigation than small deals. I note, however, that the number of observations is small, and therefore the predictive power of these regressions may be limited.

Table 9.  Number of Lawsuits in Sued De-SPAC Transactions

 

(1)

(2)

(3)

(4)

(5)

(6)

 

Lawsuits

Lawsuits

Lawsuits

Lawsuits

Lawsuits

Lawsuits

Redemption Rate

-0.00740

-0.00256

-0.00156

-0.00190

-0.00162

-0.00146

 

(-1.18)

(-0.42)

(-0.29)

(-0.35)

(-0.31)

(-0.29)

Vulnerable Industry Dummy

-0.905*

-0.894*

-0.858*

-0.849*

-0.862*

-0.823*

 

(-2.34)

(-2.50)

(-2.40)

(-2.33)

(-2.33)

(-2.32)

Log EnterpriseValue ($m)

 

0.525**

0.578***

0.626*

0.648*

0.675**

  

(3.28)

(3.76)

(2.49)

(2.66)

(2.75)

Target Age (years)

  

-0.0132

-0.0117

-0.0127

-0.0143

   

(-0.64)

(-0.47)

(-0.47)

(-0.51)

Log IPO Proceeds

   

-0.100

-0.210

-0.128

    

(-0.21)

(-0.42)

(-0.25)

Warrants

    

-0.590

-0.710

     

(-0.64)

(-0.72)

Repeat Player

     

-0.109

      

(-0.83)

_cons

2.788***

-8.318*

-9.301**

-8.370

-6.413

-8.418

 

(7.55)

(-2.47)

(-2.96)

(-1.42)

(-1.02)

(-1.22)

r2_a

0.0485

0.123

0.113

0.0982

0.0792

0.0693

N

62

62

62

62

61

61

Notes: T statistics in parentheses – * p < 0.05, ** p < 0.01, *** p < 0.001

In Table 10, I regress three, seven, fourteen, thirty, and ninety-day returns for the merged firms in my sample on number of lawsuits, controlling for the age of the target firm to assess whether there is a relationship between number of lawsuits and the ultimate success of the merger.61I also collect one-year and three-year returns, but my sample yields insufficient observations for these timeframes. There is no statistically significant association in any specification. Finally, to assess whether the likelihood of a 10b-5 lawsuit is related to returns on the merger, in unreported results, I regress a dummy equal to one if the transaction generated a 10b-5 claim on three, seven, fourteen, thirty, and ninety-day returns, controlling for firm age. There is no statistically significant association with returns in any specification.62I emphasize here that the sample of 10b-5 claims is small, so these results may lack predictive power.

Table 10.  Returns

 

(1)

(2)

(3)

(4)

(5)

 

Day 3 Returns

Day 7 Returns

Day 14 Returns

Day 30 Returns

Day 90 Returns

Total Lawsuits

-0.0117

0.00131

0.0176

0.0110

-0.0109

 

(-1.18)

(0.10)

(1.12)

(0.47)

(-0.43)

Target Age (years)

0.00000827

0.0000567*

0.0000455*

0.000174

0.0000264

 

(0.69)

(2.21)

(2.28)

(0.21)

(0.45)

_cons

-0.00853

-0.0520**

-0.0628***

-0.0583

-0.0170

 

(-0.50)

(-3.09)

(-3.85)

(-1.71)

(-0.40)

r2_a

-0.00569

-0.00888

0.00496

-0.0155

-0.0154

N

108

194

194

105

120

Notes: T statistics in parentheses – * p < 0.05, ** p < 0.01, *** p < 0.001. Returns are winsorized at the 1% level.

 

IV. THE ODDNESS OF THE SPAC LITIGATION DELUGE

The punchline result from my sample is an intuitively paradoxical one: that the likelihood of being sued increases for SPACs with lower redemption rates. One might more naturally expect to see lawsuits arising from de-SPAC transactions of lower quality; a high redemption rate would signal that the SPAC investors lack confidence in the proposed merger, and thus that the deal is likely to be worse. Intuitively, it seems that this should generate more lawsuits, not fewer.

There are several possible explanations for this result. First, it is possible that the transactions drawing lawsuits do, in fact, involve fraud. The fraud could induce investors not to redeem, meaning that the low redemption rates reflect the fact that shareholders were lied to, rather than the quality of the deal. This thesis is complicated somewhat by the fact that roughly half of the claims in my sample are merger objection claims that were brought before the transaction closed and shareholders redeemed their shares. In this scenario, it is possible that the lawsuit revealed the fraud and actually induced truthful disclosures that strengthened investors’ confidence in the deal, thereby inducing them not to redeem. Finally, it is possible that that plaintiffs are not targeting SPACs involving fraud, and the negative association between litigation likelihood and redemption rate arises for reasons unrelated to the quality of de-SPAC deals as measured by redemption rate. I assess each of these possibilities below.

A. SPAC Litigation Targets Fraudulent Transactions

One possible explanation for the negative association between the likelihood of a lawsuit and redemption rate is that the transactions that are challenged involve misrepresentations. If the management lies to its shareholders about the quality of its target and diligence, the shareholders may be confident that the deal is a good one and decline to redeem their shares—but this confidence is misplaced. The discovery of such misrepresentations could give rise to lawsuits even where shareholder redemption rates are low.

1. Post-Closing Lawsuits: (Mostly) Rule 10b-5

This seems like a straightforward explanation for SPAC-related lawsuits that are brought after the transaction closes. The majority of such claims in my sample are brought under Rule 10b-5.63Only twenty-two merger objection claims in my sample are brought after the merger. Of these twenty-two claims, eleven are bundled with 10b-5 claims, leaving only eleven standalone post-closing merger objection claims. By contrast, all forty-nine out of fifty-three Rule 10b-5 claims in my sample are brought after the merger. These claims generally allege a sequence of misconduct that began before the merger and continued through the de-SPAC transaction. Although the recency of the litigation makes it difficult to assess, there are some early signs that these cases may have merit. While half of the merger objection lawsuits in my sample were voluntarily dismissed within the sample period, only a single lawsuit containing a 10b-5 claim was dismissed (out of fifty-three 10b-5 claims). The 10b-5 lawsuits appear to target the de-SPAC transactions with the youngest target companies and the shortest completion timelines, potentially indicating risky business combinations or subpar diligence. Moreover, this is the area in which highly ranked plaintiffs’ law firms have become involved in SPAC-related litigation. The top ranked firms in my sample filed nineteen Rule 10b-5 claims, as compared with four 14(a) claims and four fiduciary duty claims. However, because so few outcomes are available as a result of the longer litigation timeline for these cases, it is difficult to evaluate their quality. The Rule 10b-5 claims in my sample also usually involve defendant SPACs with the highest IPO proceeds (often associated with higher quality deals).64See Bazerman & Patel, supra note 1, at 110. The average redemption rate of deals that draw 10b-5 lawsuits is only 14.5%. And finally, I find no statistically significant association between 10b-5 claims and returns on deals in any period. While this finding could be affected by the size of the sample and the availability of information, it may suggest that such lawsuits do not focus on the deals in which investors made out the worst.

There also appear (perhaps unsurprisingly) to be links between Rule 10b-5 claims and public enforcement regarding SPACs.65See Emily Strauss, Is Everything Securities Fraud?, 12 U.C. Irvine L. Rev. 1331, 1338 (2022). Rule 10b-5 claims began to climb in earnest in the first half of 2021, when the SEC announced that it would be scrutinizing SPAC deals more carefully.66See, e.g., Mark Schoeff Jr., Booming SPAC Market Draws SEC Scrutiny, InvestmentNews (Mar. 11, 2021), https://www.investmentnews.com/booming-spac-market-draws-sec-scrutiny-203844 [https://perma.cc/MF4N-WL6Y]; Jody Godoy & Chris Prentice, U.S. Regulator Opens Inquiry into Wall Street’s Blank Check IPO Frenzy—Sources, Reuters (Mar. 24, 2021), https://www.reuters.com/
article/us-usa-sec-spacs-exclusive/exclusive-u-s-regulator-opens-inquiry-into-wall-streets-blank-check-ipo-
frenzy-sources-idUSKBN2BH09F [https://perma.cc/Q4XE-A3XR]; Chris Katje, New SEC Chair Gary Gensler Could Push For SPAC Regulation, Yahoo! Fin. (Jan. 20, 2021), https://finance.yahoo.
com/news/sec-chair-gary-gensler-could-202602323.html [https://perma.cc/93TK-M4US]; Press Release, John Coates, Acting Dir., Div. Corp. Fin., U.S. Sec. & Exch. Comm’n, SPACs, IPOs & Liability Risks under the Securities Laws, (Apr. 8, 2021), https://www.sec.gov/news/public-statement/spacs-ipos-liability-risk-under-securities-laws [https://perma.cc/W2FC-AD76].
This may simply indicate that plaintiffs’ lawyers are jumping on enforcement guidance to make their claims more plausible. However, of the three de-SPAC transactions that have so far generated SEC enforcement actions, two had previously generated multiple Rule 10b-5 lawsuits. The merger between SPAC Diamond Peak Holdings and Lordstown Motors drew SEC and Department of Justice probes in July 2021.67Ben Foldy, Lordstown Motors Discloses Justice Department Investigation as Truck Launch Looms, Wall St. J. (July 16, 2021), https://www.wsj.com/articles/electric-truck-startup-lordstown-motors-discloses-justice-department-investigation-11626443066 [https://perma.cc/7SZN-4FM6]. The deal, which was completed in October of the previous year, had already generated six Rule 10b-5 lawsuits by the end of the sample period. Similarly, the merger between SPAC VectoIQ and Nikola, which culminated in fraud charges against the combined company’s CEO in July 2021,68Press Release, U.S. Sec. & Exch. Comm’n, SEC Charges Founder of Nikola Corp. with Fraud (July 20, 2021), https://www.sec.gov/news/press-release/2021-141 [https://perma.cc/L4Q8-599Y]. had drawn three Rule 10b-5 lawsuits during the sample period since its closing in June 2020.69The third SPAC deal scrutinized by the SEC, between Stable Road Acquisition Company and Momentus, did not close during the sample period or prior to the SEC’s charges of misleading disclosures ahead of the merger, and accordingly generated only a single Rule 14a-9 lawsuit in my sample. See Press Release, U.S. Sec. & Exch. Comm’n, SEC Charges SPAC, Sponsor, Merger Target, & CEOs for Misleading Disclosures Ahead of Proposed Business Combination (July 13, 2021), https://www.sec.gov/
news/press-release/2021-124 [https://perma.cc/L4Q8-599Y].
Because the SEC is resource constrained, it is commonly thought that the charges it pursues have a high likelihood of merit.70James D. Cox, Randall S. Thomas & Dana Kiku, SEC Enforcement Heuristics: An Empirical

Inquiry, 53 Duke L.J. 737, 763 (2003). That Rule 10b-5 lawsuits arose in significant quantity for these deals before the SEC inquiries suggests that plaintiffs’ lawyers in those specific cases were onto something and that at least in some instances, these lawsuits actually penalize misconduct.71In both cases, the misstatements were initially brought to light by short-seller reports. See The Lordstown Motors Mirage: Fake Orders, Undisclosed Production Hurdles, and a Prototype Inferno, Hindenberg Rsch. (Mar. 12, 2021), https://hindenburgresearch.com/lordstown [https://perma.cc/
24XK-Y6UW]; Ben Foldy, Nikola Internal Review Confirms Some Claims in Short Seller’s Report, Wall St. J. (Feb. 26, 2021), https://www.wsj.com/articles/nikola-internal-review-confirms-some-claims-in-short-sellers-report-11614350745 [https://perma.cc/N9YP-8SDV].

Though they comprise a minority of claims brought after the transaction, I note that there are also some early signs that at least some of the post-closing merger objections in my sample may have teeth. These claims consist of Rule 14a-9 claims alleging proxy fraud and state fiduciary duty claims (often, they are bundled together). Traditionally, post-closing lawsuits for breach of state fiduciary duty are brought by target shareholders alleging that they received an insufficient price for their shares. Analogous claims brought by SPAC shareholders, however, are acquirer shareholder claims, alleging that SPAC directors breached their fiduciary duties by allowing the SPAC to overpay for the target. In the absence of conflicts, such lawsuits are subject to the business judgment rule. This may explain why relatively few of the post-closing fiduciary duty lawsuits in my sample are couched in these terms. Rather, most of the post-closing fiduciary duty claims in my sample allege that the SPAC directors breached their duty of oversight by permitting the fraudulent statements alleged in the proxy.

One notable exception is In re Multiplan Stockholder Litigation72See generally In re MultiPlan Corp. S’holders Litig., 268 A.3d 784 (Del. Ch. 2022). in the Delaware Chancery Court. The case arose out of a transaction between Churchill, the SPAC, and Multiplan, a healthcare data analytics provider.73Id. at 792. The crux of the SPAC shareholders’ allegations was that Multiplan’s largest customer, on which it was dependent, was forming an in-house competitor and planned to shift its business away from Multiplan, and that Churchill’s board deliberately failed to disclose this information in the proxy.74Id. at 797. The complaint alleged that the SPAC directors, who were all appointed by and had previous ties with the SPAC sponsor, violated their fiduciary duties by “issuing a false and misleading proxy, harming stockholders who could not exercise their redemption rights on an informed basis.”75Id. at 800. The fewer shareholders redeem their shares, the more valuable the sponsor shares will be after the merger, because (1) the resulting entity will be more liquid; and (2) the sponsor shares will be less diluted. See Klausner & Ohlrogge, supra note 39, at 6. In a widely cited opinion, Vice Chancellor Lori Will denied the directors’ motion to dismiss the claims against the SPAC board and sponsor, finding that the claims were direct, rather than derivative, and therefore plaintiffs need not plead demand futility, and that the board conflicts meant that the transaction would be reviewed under the demanding entire fairness standard.76MultiPlan, 268 A.3d at 805, 818. If these determinations hold after trial, they will likely be powerful tools for SPAC investors alleging that they were lied to in connection with a de-SPAC merger. The entire fairness standard only applies where there is a conflict of interest, but SPAC sponsors may often be conflicted, and where sponsors appoint boards with which they have financial ties, as they often do,77See Klausner & Ohlrogge, supra note 39, at 5–6. the threat of entire fairness review is likely to have bite. Moreover, unlike Rule 10b-5 lawsuits, which often take years to resolve, fiduciary duty cases in the Delaware Chancery Court are likely to move relatively quickly, suggesting that these may be the cases most likely to curb SPAC misconduct in the immediate future.

2. Pre-Closing Lawsuits: Merger Objection Claims

Pre-closing merger objection claims are a different beast from most Rule 10b-5 litigation. Much merger objection litigation is disposed of before the closing of the transaction. Thus, the effects of pre-closing merger objection litigation on de-SPAC transactions may, in theory, be quite direct; specific deals may be halted, or management may release additional information pertaining to specific aspects of the deal, giving shareholders greater confidence (or not, as the case may be). Rule 10b-5 litigation, by contrast, is notoriously slow, often taking years to be resolved.78See Laarni T. Bulan & Laura E. Simmons, Cornerstone Rsch., Securities Class Action Settlements: 2021 Review and Analysis 13 (2021), https://www.cornerstone.com/wp-content/uploads/2022/03/Securities-Class-Action-Settlements-2021-Review-and-Analysis.pdf [https://
perma.cc/4VPS-DAY8] (finding that more than 40% of securities class actions take between three and four years to settle).

The characteristics of transactions that drew the Rule 14a-9 and state fiduciary duty claims in my sample are reported in Table 4. I note that state fiduciary duty claim defendants boast the lowest mean redemption rate of the claim types in my sample at 13.44% (Rule 14a-9 claims correspond with slightly higher redemption rates, at 18.73%). De-SPAC transactions that are targeted by state fiduciary duty lawsuits have a higher mean enterprise value but lower IPO proceeds than those of Rule 14a-9 claims. They also issue fewer warrants, and the target companies are slightly older. All this indicates that along most metrics (redemption rate, warrants, and target age), lawsuits alleging breaches of state fiduciary duties tend to target de-SPAC transactions that generally appear to be higher quality deals.

If fraud explains the association between rising litigation likelihood and falling redemption rates, the merit of the pre-closing claims in my sample is complicated to assess. The redemption rate in de-SPAC transactions is tabulated very near the closing of the transaction, so redemption rates should, in theory, incorporate any information revealed in a pre-closing lawsuit. Accordingly, these lawsuits could be having one of two effects. First, it is possible that the pre-closing merger objection claims in my sample are halting some bad transactions and lowering redemption rates in deals that close by giving investors greater confidence in the deal. This could be the result of the supplemental disclosures that many defendants issue in response to merger objection litigation. These supplemental disclosures could correct any misrepresentations or omissions and make investors feel good about retaining their shares. In the best case, these disclosures could prompt heightened diligence in management’s pursuit of the deal. The second possibility is less rosy; merger objection claims could target fraudulent transactions, but nonetheless fail to induce investors to redeem, either because investors do not perceive these lawsuits to credibly signal fraud, or because SPAC managers, undeterred by the monetary and reputational consequences of these lawsuits, continue to lie to their investors. While merger objection claims are theoretically a promising means of improving de-SPAC transactions before they close, I argue that they do not appear to be fulfilling this role. If fraud causes investors to sue but prevents shareholders from redeeming, I argue that the more plausible explanation for the negative association between the likelihood of pre-closing merger litigation and redemption rate is that these lawsuits fail to induce investors to redeem their shares, or SPACs to be more truthful in their disclosures.79I note that all sued de-SPAC transactions in my sample closed, meaning that pre-closing merger challenges did not halt any deals, bad or otherwise.

Merger objection litigation over the last decade has been aptly described as “schizophrenic.”80See Cain et al., Mootness Fees, supra note 11, at 1779. For discussion of shareholder litigation challenging director conduct in mergers in prior years, see Robert B. Thompson and Randall S. Thomas, The New Look of Shareholder Litigation: Acquisition-Oriented Class Actions, 57 Vand. L. Rev. 133, 134–41 (2004). At its peak in 2013, a whopping 96% of announced mergers gave rise to litigation by investors, and 60% of the lawsuits were filed in the Delaware Chancery Court.81Cain et al., Mootness Fees, supra note 11, at 1779. These lawsuits drew widespread criticism on the ground that they largely lacked merit and that settlements were generally for pre-merger supplemental disclosures that did not benefit shareholders, but did generate lucrative fees for plaintiffs’ lawyers.82See Cain et al., Shifting Tides, supra note 11, at 605; Gregory A. Markel & Gillian G. Burns, Assessing a Judicial Solution to Abusive Merger Litigation, Law360 (Nov. 19, 2015, 9:59 AM), https://
http://www.law360.com/articles/728061/assessing-a-judicial-solution-to-abusive-merger-litigation [https://
perma.cc/BT7Q-KG3T].
In January 2016, the Delaware Chancery Court cracked down on these lawsuits in what proved to be a landmark decision, In re Trulia.83In re Trulia, 129 A.3d 884, 899 (Del. Ch. 2016). In that decision, the court declared its refusal to approve settlements in merger cases that do not provide “[m]eaningful [b]enefit” to shareholders.84Id.

The literature has demonstrated that the effect of Trulia, in addition to other concurrent cases raising the bar that plaintiffs must meet in merger objection cases,85See e.g., Corwin v. KKR Fin. Holdings LLC, 125 A.3d 304, 305–06 (Del. 2015); In re Volcano Corp., 143 A.3d 727, 750 (Del. Ch. 2016). was a plummet in the number of merger objection cases filed in Delaware.86See Cain et al., Shifting Tides, supra note 11, at 608; see also James D. Cox & Randall S. Thomas, Delaware’s Retreat: Exploring Developing Fissures and Tectonic Shifts in Delaware Corporate Law, 42 Del. J. Corp. L. 323, 326 (2018). To prevent plaintiffs from merely fleeing to other state courts, Delaware’s legislature also allowed for the adoption of forum selection bylaws prohibiting lawsuits in other states.87Cox & Thomas, supra note 86, at 325. However, these provisions do not extend to Exchange Act claims in federal courts, and accompanying the drop in Delaware cases was a marked increase in merger-related lawsuits filed in the federal courts under Rule 14a-9. By 2018, one account found that only 9% of merger-related lawsuits were brought in Delaware, while 87% were brought in federal court.88See Cain et al., Shifting Tides, supra note 11, at 608.

In addition to circumventing Delaware’s strictures on merger litigation jurisdictionally, plaintiffs’ lawyers have also found creative ways to avoid the constraints on settlements for window-dressing disclosures and lawyers’ fees. Following Trulia dicta, many defendants have begun to make voluntary supplemental disclosures in response to merger objection lawsuits, thus rendering the claims moot, and paying plaintiffs’ counsel a mootness fee as long as those disclosures were of benefit to shareholders (even if not material to the vote).89Cain et al., Mootness Fees, supra, note 11, at 1780 (alteration in original). The authors note that mootness fees are not always disclosed by the parties, and therefore their data may be underinclusive. Id. at 1791 n.41; see also In re Xoom Corp., S’holder Litig., No. 11263-VCG, 2016 Del. Ch. LEXIS 117, at *14–15 (Del. Ch. Aug. 4, 2016) (awarding a $50,000 mootness fee and holding that Trulia’s materiality requirement did not apply to mootness fees). Various commentators, including courts, have expressed skepticism about this practice, claiming that the supplemental disclosures in contemporary merger litigation confer no real benefit on shareholders. 90See, e.g., In re Walgreen Co., 832 F.3d 718, 721, 725–26 (7th Cir. 2016). Nonetheless, a recent study has found that at least 63% of merger cases were disposed of with a mootness fee and that such fees appear to have entirely replaced formal settlements in merger objection cases in federal court.91Cain et al., Mootness Fees, supra note 11, at 1782.

To investigate whether pre-closing merger objection claims improve de-SPAC transactions, I run unreported regressions in which the dependent variables are the three-day, seven-day, fourteen-day, thirty-day, and ninety-day returns on the deal,92Returns are winsorized at the 1% level. and the independent variable is a dummy equal to one if the transaction generated at least one pre-closing merger objection claim, controlling for target age. There is no statistically significant association in any specification. Thus, there is no evidence that transactions that draw a pre-closing merger objection claim perform better in the first three months than transactions that draw no lawsuits.

To assess the role of the supplemental disclosures that may be prompted by merger challenges, I use SEC EDGAR and internet searches to see whether defendant SPACs voluntarily released additional disclosures in response to pre-closing lawsuits. In my sample, defendant SPACs issued supplemental disclosures in response to pre-closing merger objection lawsuits 46.15% of the time. The average redemption rate of the de-SPAC transactions that were sued prior to the merger that did not issue supplemental disclosures is 19.11%, while the average redemption rate for those that did issue supplemental disclosures is 6.78%. However, this difference is not statistically significant.93The t-statistic is 1.4127. I note that the mean redemption rate for transactions in which no supplementary disclosures were issued may be driven by two particularly high-redemption deals, the Blackridge Acquisition Corporation/Ourgame International merger (88.9%) and the Boxwood Merger Corporation/Atlas TC Holdings merger (94.9%). Without these outliers, the mean redemption rate of SPACs that did not issue supplemental disclosures is 14.26%, and the t-statistic is 1.047. The rate of supplemental disclosures in my sample appears to be substantially lower than that which other studies have documented regarding deal litigation in federal courts.94Cain et al., Mootness Fees, supra note 11, at 1782 (finding mootness fees paid in 63% of litigated deal cases in federal court in 2018; the mootness fees are purported compensation for obtaining supplemental disclosures). Half of the merger objection cases in my sample were voluntarily dismissed by the end of the sample period. Roughly half of those dismissals involved defendant SPACs that appear to have made no supplemental disclosures at all.

I also use SEC EDGAR and internet searches to examine the fees paid by SPACs in connection to these lawsuits. These fees are generally immaterial to the defendant companies that pay them and therefore are often not widely disclosed.95Gregory A. Markel, Vincent A. Sama, Catherine B. Schumacher & Daphne Morduchowitz, Over 50 M&A Deals Have Been Challenged this Year by a Single Group of Lawyers, Lexology (June 12, 2020), https://www.lexology.com/library/detail.aspx?g=f275b650-6822-4bc1-9e56-0314b51dd760 [https://perma.cc/EN7Y-FEX3]. They also involve dismissal before certification of a putative class and are therefore not subject to approval by courts96Id. (unsurprisingly, none of the merger objections in my sample involve on-the-docket settlements). Very few of the transactions in my sample disclosed mootness fees paid in connection with lawsuits; the average amount was $200,000. This is in line with estimates from other studies, which have found that mootness fees range in amount from $200,000 to $450,000 but have declined in more recent years.97Cain et al., Mootness Fees, supra note 11, at 1803.

There are two likely overlapping possibilities for why these lawsuits might not induce SPAC shareholders to redeem their shares, even if they target transactions that involve misrepresentations. First, shareholders may not believe that these lawsuits credibly signal fraud. Their ubiquity may mean that investors simply do not take these lawsuits seriously. The disclosures they prompt may not seem to add anything important. Previous literature has concluded that, even when they are made, the supplemental disclosures of yore did not influence shareholder voting outcomes,98See Cain et al., Shifting Tides, supra note 11, at 605. and the current disclosures required in exchange for mootness fees need not even be material.99Cain et al., Mootness Fees, supra note 11, at 1780. The amounts that SPAC defendants pay to make these lawsuits go away may not seem to investors like a penalty to remedy serious wrongdoing, but an added tax to make sure the deal closes on time.

There are other reasons that these pre-closing merger objections may not signal to investors that something is truly amiss. More than half (57%) of the merger objection lawsuits brought in New York Supreme Court are against SPACs that have a forum selection clause in the charter or bylaws delegating Delaware as the forum for any fiduciary duty litigation.100I also note that some of the SPACs drawing pre-closing merger objections are, like many SPACs, incorporated in the Caymans, meaning that they would not be subject to the fiduciary duties of any state. Of these, 76% were voluntarily dismissed within the sample period without litigation, suggesting two possibilities: (1) the plaintiffs filed first and researched the defendant firms later, discovering the forum selection provisions after the lawsuits were filed and then voluntarily dismissed them; or (2) the plaintiffs privately demanded fees from the defendant SPACs in return for withdrawal that were lower than the cost of litigating the forum selection provision, suggesting that the fees, and thus the settlement value of these cases, were very low. Neither option inspires confidence in the quality of these lawsuits.

Furthermore, the most frequently filing law firms’ litigation patterns in my sample suggest that these cases may not represent credible signals of misconduct. I report the plaintiff firms in my sample in Table 5. The three most frequently-filing plaintiffs’ firms in my sample—Brodsky Smith, Monteverde & Associates, and Rigrodsky Law—account for just over 40% of the lawsuits in my sample. These frequent filers appear to be small plaintiffs’ firms of between five and eight lawyers. Their prolific but largely unlitigated filings have been noted in other studies,101Cain et al., Mootness Fees, supra note 11, at 1798–99 (finding each of these firms in the top six filers of federal merger lawsuits in 2017–2018). I note that most cases in my sample filed by Rigrodsky Law and Brodsky Smith are in fact fiduciary duty claims filed in the New York Supreme Court, implying that the total number of lawsuits filed by these firms is larger than reported in either study. See also Alison Frankel, Rigrodsky Puts Controversial ‘Mootness Fee’ Business Model under Scrutiny, Reuters
(Oct. 13, 2021), https://www.reuters.com/legal/transactional/rigrodsky-puts-controversial-mootness-fee-business-model-under-scrutiny-2021-10-13 [https://perma.cc/X6MC-9CKQ].
leading commentators to remark that “these law firms appear to be more interested in collecting mootness fees than in actively litigating the cases that they file.”102Cain et al., Mootness Fees, supra note 11, at 1799 (noting that the top six frequent filers settled only 1%–4% of their cases, but appeared to collect mootness fees in 65%–80%). Moreover, the complaints filed in the New York Supreme Court, as noted by other commentators,103Frankel, supra note 7; Kevin LaCroix, SPAC-Related State Court Merger Objection Litigation, D&O Diary (May 9, 2021), https://www.dandodiary.com/2021/05/articles/merger-litigation/spac-related-state-court-merger-objection-litigation [https://perma.cc/BEU4-8EMD]. purport to seek an injunction blocking the merger, but do not include the separate motions for a preliminary injunction that would actually be necessary to put off the shareholder vote.104These commentators note that the filings on the docket are only “part of the story” and that defendant firms frequently receive demand letters for fees before filing a complaint. Frankel, supra note 101. This account of entrepreneurial SPAC firms is consistent with many assessments in the literature of plaintiffs’ lawyers’ behavior in the securities and corporate context. One study postulates that the passage of the Private Securities Litigation Reform Act of 1994 (“PSLRA”), and the higher expenses involved in meeting its pleading standards, pushed many small firms away from 10b-5 class actions and toward corporate litigation, such as that challenging mergers.105Brian Cheffins, John Armour & Bernard Black, Delaware Corporate Litigation and the Fragmentation of the Plaintiffs’ Bar, 2012 Colum. Bus. L. Rev. 427, 431 (2012). Others note the rise in dismissals (used as a proxy for low-quality cases) concurrent with the rise in lawsuits brought by “emerging firms” in the securities arena generally.106Klausner & Heglund, supra note 55. Still others have noted that plaintiffs’ firms have proven resilient to attempts to crack down on nuisance litigation and have circumvented these efforts, particularly in the merger litigation context, by filing in federal courts or jurisdictions outside Delaware and by evading requirements for more useful disclosures by voluntarily dismissing cases in exchange for mootness fees.107See Cain et al., Shifting Tides, supra note 11, at 607; Cain et al., Mootness Fees, supra note 11, at 1781.

The second reason that these lawsuits may not induce shareholders to redeem even if they target truly fraudulent transactions is that the fees paid to plaintiffs’ attorneys may be unlikely to deter such fraud. Unscrupulous SPAC managers could pay $200,000, issue supplemental disclosures, and continue to lie about the quality of the target or the diligence they conducted in order to close the transaction and receive their 20% promote. This may contrast with more traditional IPO litigation under section 11, as discussed in Section III.B.

Many of the regulatory and scholarly concerns for the welfare of SPAC investors revolve around issues that might be mitigated with improved disclosure, such as conflicts of interest, speedy deal process, and dilution. Pre-merger deal litigation could theoretically be a mechanism well-suited for providing greater transparency to investors. However, there are reasons to think that even if these cases are targeting fraudulent transactions, they fail to induce SPAC shareholders to redeem. If true, this explanation is a strong condemnation of merger litigation generally, and even more so in the SPAC context: in transactions that are known to be risky to investors, that are by their nature fraught with conflicts of interest, and in which fraud could plausibly be rampant, the type of lawsuit that, in theory, could be of the most preemptive benefit to shareholders is being leveraged by a small number of small firms for small fees at investors’ expense.

B. SPAC-Related Litigation Is Unrelated to Transaction Quality

Of course, it is possible that the SPAC-related lawsuits in my sample do not target fraudulent transactions, and that the lower redemption rates of the transactions that appear to draw lawsuits are not the product of misrepresentations. If this is the case, it appears that plaintiffs may simply have chosen lower-redemption SPACs to sue. The probability of being sued appears to be related neither to the size of the deal, nor the returns, nor the industry, nor the management. Accordingly, in the absence of fraud, we can say little more than that these lawsuits do not appear to be logically related to common metrics for transaction quality, including redemption rate.

There is some reason to think that not all of the low-redemption deals which appear more likely to draw lawsuits involve fraud. My sample suggests that the boom year for redemptions was 2019, with a mean redemption rate of 64.96% across all de-SPAC transactions completed that year. However, only nine SPAC-related lawsuits were filed. The mean redemption rate declined to 37.45% in 2020 as lawsuits climbed to forty-six and even further in the first half of 2021 to 22.98%, as lawsuits exploded at a whopping eighty-nine. Significantly, the first half of 2021 was also the time when the SEC, under the then-new Biden administration, began publicly to take aim at SPACs. The end of March and early April 2021 saw a highly publicized cluster of SEC releases and statements specifically geared at SPACs, addressing, among other issues, key filing issues,108See, e.g., Press Release, U.S. Sec & Exch. Comm’n Div. Corp. Fin., supra note 36. accounting considerations,109See, e.g., Press Release, John Coates, Acting Chief Div. Corp. Fin., & Paul Munter, Acting Chief Acct., U.S. Sec & Exch. Comm’n, Considerations for Warrants Issued by Special Purpose Acquisition Companies (“SPACs”) (Apr. 21, 2021), https://www.sec.gov/news/public-statement/
accounting-reporting-warrants-issued-spacs#_ftnref3 [https://perma.cc/A8N4-Y3B6].
and liability risk under the securities laws.110See, e.g., Press Release by John Coates, supra note 66. By the end of April, reports were circulating that the SEC was considering new guidance on SPACs.111Anirban Sen, Chris Prentice & Joshua Franklin, Exclusive U.S. Watchdog Mulls Guidance to Curb SPAC Projections, Liability Shield –Sources, Reuters (Apr. 27, 2021), https://www.reuters.com/
business/exclusive-us-watchdog-weighs-guidance-aimed-curbing-spac-projections-liability-2021-04-27 [https://perma.cc/XDF4-WHYT].
If fraud is the explanation for low redemption rates, then conversely, SPACs reporting high redemption rates are the ones that told the unpromising truth about their prospects and diligence, causing investors to redeem. But according to my sample, these candid, high-redemption SPACs proliferated in earlier years, while SPACs reporting the lowest redemption rates (resulting, presumably, from lying to their shareholders) 112Some commentators have provided other explanations for the improvement in SPAC redemption rates over time. Some commentators postulate that the SPAC market is “self-correcting” because of demands from institutional investors; some structural changes include the size and nature of the sponsors’ “promote” and the length of time sponsors are required to hold their shares following the de-SPAC transaction. Going Public: SPACs, Direct Listings, Public Offerings, and the Need for Investor Protections: Hearing Before the Subcomm. on Inv. Prot., Entrepreneurship & Cap. Mkts. of the H. Comm. on Fin. Servs., 117th Cong. 10–11 (2021) (statement of Scott Kupor, Managing Partner, Andreessen Horowitz), https://financialservices.house.gov/uploadedfiles/hhrg-117-ba16-wstate-kupors-20210524.
pdf?te=1&nl=dealbook&emc=edit_dk_20210524 [https://perma.cc/E9VL-GXUE]. Others state that “SPAC sponsors today are more reputable than they have ever been, and as a result, the quality of their targets has improved, as has their investment performance.” Bazerman & Patel, supra note 1, at 105. Sponsors have also been increasing their contributions to the merger and decreased the number of warrants they offer per unit at the IPO, which makes the de-SPAC transaction less dilutive. See Gahng et al., supra note 12, at 33.
did their lying concurrently—at least in part—with the announcement that regulatory scrutiny of SPACs was significantly on the rise. It seems counterintuitive that SPACs would be honest when no one was looking and engage in fraud just as the SEC announced that it would be looking very carefully.

To be sure, there is not complete alignment of timing—the SEC’s statements did not begin until April, meaning that SPACs closing earlier in 2021 would have no reason to be more truthful with their investors (other than the ascension of a more pro-regulation administration) than the SPACs before them. However, there may be some explanations for the relationship between redemption rate and lawsuit probability that do not involve fraud.

1. Post-Closing Lawsuits: Standing and Class Size

One potential explanation for the negative relationship between the probability of a lawsuit and redemption rate may be putative class size. In a post-closing lawsuit, fewer redeeming shareholders could mean a larger potential class of investors, which would presumably result in more damages, greater pressure to settle, and a larger potential payout, and therefore a more lucrative lawsuit for plaintiffs’ counsel.

This line of reasoning operates slightly differently for the different types of claims in my sample. Most directly, shareholders have standing in Delaware to challenge a merger under laws of fiduciary duty only if they owned stock at the time of the challenged transaction and throughout the litigation.113Christopher M. Harvey, Corporate Law—Mergers and Double Derivative Actions: The New Frontiers in Derivative Standing, 38 Vill. L. Rev. 1194, 1196–97 (1993). This means that shareholders who redeem their shares before the merger are clearly out. Similarly, private plaintiffs are presumed to have standing under Rule 14a-9 if they were injured in connection with a proxy solicitation and if “reliance of some shareholders on the statement was likely to affect [how they voted].”114Thomas Lee Hazen, Federal Securities Law 94 (2d ed. 2003). Shareholders in SPACs can vote for the transaction and still redeem their shares, but in such a situation are unlikely to have been injured.115It appears to be unsettled whether the presumption of reliance on the proxy is rebuttable. See, e.g., Dowling v. Narragansett Cap. Corp., 735 F. Supp. 1105, 1120 (D.R.I. 1990) (“[T]hat presumption [of reliance on the proxy] may be rebutted by evidence that the alleged misinformation had no effect on the action taken.”); Gaines v. Haughton, 645 F.2d 761, 774 (9th Cir. 1981) (“[S]hareholders who do not rely on allegedly misleading or deceptive proxy solicitations lack standing to assert direct (as opposed to derivative) equitable actions under § 14(a).”); Philip B. Kurland, The Supreme Court, 1963 Term, 78 Harv. L. Rev., 143, 299 (1964) (“A rebuttable presumption of shareholder reliance is necessary, for it would be impossible to show the effect of such violations on those who gave the proxies.”). But see Sandberg v. Va. Bankshares, Inc., 891 F.2d 1112, 1120 (4th Cir. 1989) (“[T]he rebuttable presumption in § 10(b) actions is not applicable to § 14(a) actions.”). However, it is an interesting question whether such a presumption could be rebutted with respect to SPAC shareholders who redeem their shares but vote for the transaction.

Plaintiffs have standing under Rule 10b-5 if they purchased or sold securities in connection with manipulative or deceptive conduct.116Blue Chip Stamps v. Manor Drug Stores, 421 U.S. 723, 749 (1975). Such a class could encompass investors who bought the shares of the merged company on the secondary market in reliance on misleading statements issued before or in connection with the merger. Nonetheless, a large class of non-redeeming shareholders who bought their shares based on rosy statements about the deal could also form the basis of a lucrative class for an action under Rule 10b-5.117The average time elapsed between the merger and the end of the class period for Rule 10b-5 claims in my sample is roughly 6 months, although this falls to 4.7 months if I remove 2 extremely long outliers.

If class size is the explanation for the negative relationship between litigation probability and redemption rate, then these cases may be opportunistic, rather than targeting the worst transactions. Larger classes likely involve larger damages, create the shadow of greater potential liability, and thus are likely to induce greater pressure to settle and probably a larger payout for plaintiffs’ lawyers.118Lisa L. Casey, Reforming Securities Class Actions from the Bench: Judging Fiduciaries and Fiduciary Judging, 2003 BYU L. Rev. 1239, 1239, 1241 (2003) (“Certainly the lawsuits hold the promise of enormous potential profits for class counsel. As a general matter, the larger the company sued (as measured by market capitalization), the larger the losses suffered by the putative class, and the larger the potential settlement fund.”). This may also be an explanation for the greater involvement of top tier plaintiffs’ lawyers that does not rely on merit; these firms generally have relationships with institutional investors who are well-positioned to be lead plaintiff in Rule 10b-5 class actions if they want to be and may deliberately target these cases based on potential payoff.

2. Pre-Closing Lawsuits: Litigation as a Substitute or Complement for Redemption

One possible explanation of the negative relationship between litigation and redemption rate in de-SPAC transactions for pre-closing merger challenges that does not involve fraud may be that SPAC shareholders are increasingly using litigation as a substitute or complement for exercising their redemption rights.

On its face, using litigation as a substitute for redemption makes little sense. The SPAC shareholders’ redemption right is essentially a free put option, allowing shareholders to costlessly back out of the transaction. Moreover, many redeeming shareholders do not entirely surrender their claims to the company should the merger prove to be a good one—they may keep their warrants and vote for the transaction, even if they choose to redeem their shares.

To be sure, freeriding on a lawsuit is also a costless option. There appears to be little downside to a nonplaintiff shareholder to watching such a lawsuit unfold. In the context of pre-closing merger objection litigation, any supplemental disclosures may improve deal quality (although, as discussed, this is unlikely), and although the sample is small, my results indicate that share prices of the merged company do not appear to suffer as a result of pre-closing merger challenges. But while the occasional shareholder may actually believe that a merger objection claim will increase the value of their post-merger shares, this appears unlikely based on the supplemental disclosures issued in my sample and given the dysfunctionality of much merger litigation generally. Indeed, it appears that many of these lawsuits are driven by plaintiffs’ lawyers in exchange for nominal fees rather than by SPAC shareholders. Accordingly, it seems generally improbable that the negative relationship between the likelihood of a lawsuit and redemption rate is the result of shareholders treating these options as substitutes.

It might also be possible that sophisticated investors, who may be more likely to redeem,119This raises a related possibility for the inverse association between redemption rate and litigation probability: investor composition. Anecdotally, the composition of SPAC investors changed significantly over the course of the boom. Prior to 2019, SPACs were a niche market. Investors in them were sufficiently sophisticated both to understand what they were getting and also to exercise their redemption rights when what they got fell short of what they were promised. However, accounts from lawyers suggest that as SPACs went mainstream in 2020 and throughout 2021, the share of less sophisticated retail investors rose—investors who might not be equipped or motivated to keep such a close eye on the prospects of the SPAC or to exercise their redemption rights if those prospects were undesirable. are also more likely to sue as a means of extracting the most possible value from their shares. Although the most direct monetary gains would result if redeeming investors could bring post-closing lawsuits for damages, they generally lack standing to do this. It is also possible that sophisticated investors who redeem might bring pre-closing lawsuits in the hopes of improving the transaction and thus upping the value of their warrants (which they can retain through redemption). I note, however, that the vast majority of merger objection claims in my sample are brought by individual investors, rather than the institutional ones more likely to engage in sophisticated strategy. It is therefore not clear that litigation is being used as a complement to the redemption right intended to make money.

C. Summing Up the SPAC Litigation Deluge

If pre-closing merger challenges produce immaterial disclosures in exchange for nuisance fees, there does not appear to be a plausible interpretation of the negative relationship between lawsuit probability and redemption rate that redeems these lawsuits. If these challenges target high-quality transactions, they are unhelpful. It seems highly unlikely that they are a common or logical substitute or complement for redemption. And if they target fraudulent transactions, they seem to be a failure under the very circumstances where they could, in theory, be most useful. Proponents of merger litigation may argue that these cases provide an opportunity for plaintiffs to force SPACs to improve or abandon their acquisitions and are therefore valuable. This potential indubitably exists and may represent the optimal result for investors, who could redeem their shares, or even, in the best case, benefit from a better deal without the injury of a bad transaction necessitating after-the-fact recompense. But I argue that merger objection lawsuits as they play out on the ground are not performing this role and are therefore undesirable, irrespective of whether they target fraudulent or high-quality deals.

Lawsuits brought after the merger are somewhat more difficult to characterize, in part because the vast majority have not been resolved. If low redemption rates are the product of misrepresentations, these lawsuits are likely meritorious. They also probably are effective deterrents of bad conduct; large class actions are among the more terrifying prospects to corporate management,120Myriam Gilles & Gary B. Friedman, Exploding the Class Action Agency Costs Myth: The Social Utility of Entrepreneurial Lawyers, 155 U. Pa. L. Rev. 103, 106 (2006) (“But does anyone seriously doubt that there is immense deterrent power in the contemporary class action? Executives tempted to lie about earnings are more concerned about [plaintiffs’ lawyers] than they are about the Securities and Exchange Commission (SEC). Companies tempted to skirt fair credit reporting requirements are more concerned with ruinous liability at the hands of the class action bar than they are with the corrective measures and fines that might be meted out following a none-too-likely Federal Trade Commission (FTC) investigation.” (footnotes omitted)). and perhaps even more so these days for SPACs, which are facing increasingly narrow D&O policies to insure against such lawsuits.121Caroline Bullerjahn & Morgan Mordecai, Limiting SPAC-Related Litigation Risk: Disclosure and Process Considerations, Harv. L. Sch. Forum on Corp. Governance (Mar. 14, 2021), https://corpgov.law.harvard.edu/2021/03/14/limiting-spac-related-litigation-risk-disclosure-and-process
-considerations [https://perma.cc/9M8D-BXVX].
Class actions under Rule 10b-5 in particular are likely to be drawn-out affairs, and decreased insurance coverage may mean that the costs of such litigation may be more likely to fall on the defendant firm. But if the low redemption rates are the product of truthful disclosures and signal high-quality deals, the post-closing lawsuits challenging them may be opportunistic. Plaintiffs’ lawyers may be targeting these cases not because there is an indication of misconduct, but because a lower redemption rate means a larger class, more extensive purported damages, greater settlement pressure, and a higher potential payout.

Accordingly, it appears at this early stage that many of the lawsuits comprising the SPAC litigation deluge—the merger challenges brought before the merger—are accomplishing little other than a minimal payout for a few small plaintiffs’ firms. Claims brought after the merger closes may effectively punish fraud. But it is also possible that even some of these lawsuits may be brought for opportunistic, rather than meritorious reasons. And if this is the case, they may be creating a significant cost for the transactions that appear to be of the highest quality. Fully disentangling these competing explanations—that SPAC-related litigation targets fraudulent transactions, or in fact targets quite good transactions—is a complex and perhaps impossible task. But there is at least a credible argument that much of this litigation may be failing to improve SPAC transactions and may be brought for reasons unrelated to SPAC quality.

V. THE BIGGER PICTURE: COMPARING SPAC MERGER LITIGATION TO SECTION 11 LITIGATION

SPACs are engineered specifically to avoid the expensive and time-consuming IPO process. More specifically, they are engineered, in some respects, specifically to avoid liability exposure under the securities laws, which is particularly stringent for IPOs.122Press Release by John Coates, supra note 66. Since the targets of SPACs go public through a merger instead of an IPO, they are governed by the regime for mergers instead of IPOs. In this section, I explore how, at this early stage, these different liability regimes stack up against each other.

A. Problems Avoided by SPACs: Securities Act Liability and the Forward-Looking Statement Safe Harbor

There are two main reasons commonly cited for the reduced liability exposure faced by SPACs. These are liability under section 11 and liability for forward-looking statements.

The Securities Act of 1933 governs liability for primary offerings, including IPOs. A company issuing securities is strictly liable under section 11 of the Securities Act for material misstatements and omissions in its registration statement, the instrument by which it must register new shares for sale with the SEC.123The other significant source of liability under the Securities Act, which is also more stringent than that of Rule 10b-5 is section 12(a)(2), which imposes liability for material misstatements and omissions in a prospectus. I do not focus on these claims because they are generally understood to require negligence, rather than strict liability, and because the content of a prospectus overlaps with that of a registration statement in most cases, they are frequently brought together. See Thomson Reuters, Practical Law Securities Litigation & White Collar Crime: Securities Act: Section 12(a)(2) Elements and Defenses (2022) (“[P]laintiffs often file claims under Section 12(a)(2) together with Section 11 claims.” (citations omitted)). The company’s underwriters, officers, and directors are liable as well; although they may benefit from a “due diligence defense,” in practice, this defense is available only under a very demanding standard.124See Escott v. BarChris Constr. Corp., 283 F. Supp. 643, 684–701 (S.D.N.Y. 1968); In Re Worldcom, Inc., 346 F. Supp. 2d 628, 648 (S.D.N.Y. 2004). Section 11 liability is virtually nonexistent in connection with SPACs. There are several reasons for this. First, the SPAC IPO occurs when the SPAC is a shell company and therefore has almost nothing in the way of operations or financials to disclose.125See Klausner et al., supra note 4, at 271. Second, SPACs could face section 11 liability in connection with the shares that they issue to target shareholders or PIPE investors in connection with the merger. However, once these shares mix in the market with shares issued in the IPO, it is very difficult to “trace” them to the registration statement of a particular offering. Such “tracing” is a requirement for the standing of a purported section 11 plaintiff, meaning that these cases are likely to be rare.126Id. Issuances in connection with the merger are not underwritten, meaning that underwriters escape liability for these issuances entirely. Id.

The second reason commonly cited for going public via SPAC rather than via IPO is the safe harbor for forward-looking statements. The PSLRA allows issuers to make projections and forward-looking statements without fear of liability for such statements under the securities laws, as long as the statements are accompanied by meaningful cautionary language.127See 15 U.S.C. §§ 77z-2, 78u-5. This safe harbor is unavailable in IPOs.128Id. § 77z-2 (a)(2)(d). The ability to make forward-looking statements may be particularly valuable to pre-revenue SPAC targets, who do not yet have historical earnings with which to entice investors, or to firms in high-tech industries which require extensive start-up capital.129Daniele D’Alvia & Milos Vulanovic, A Rethinking of U.S. Forward-Looking Statements in SPACs, Fordham J. of Corp. & Fin. L. (July 13, 2021), https://news.law.fordham.edu/jcfl/2021/07/13/a-rethinking-of-u-s-forward-looking-statements-in-spacs [https://perma.cc/WF9Y-K5Z5]. However, the demise of this advantage may be imminent;130The general rationale for eliminating the safe harbor for SPACs is to level the playing field between SPAC transactions and IPOs with respect to forward-looking statements. For a challenge to this rationale, see Amanda M. Rose, SPAC Mergers, IPOs, and the PSLRA’s Safe Harbor: Unpacking Claims of Regulatory Arbitrage (Oct. 19, 2021) (unpublished manuscript), https://ssrn.com/abstract=3945975 [https://perma.cc/GN92-E9B3]. regulators have hinted that the safe harbor may not apply to SPACs after all,131Press Release by John Coates, supra note 66 (“[T]he PSLRA safe harbor should not be available for any unknown private company introducing itself to the public markets . . . regardless of what structure or method it used to do so.”). For further discussion, see John C. Coates, supra note 3, at 6–7. and the House of Representatives has released draft legislation explicitly eliminating the safe harbor for SPACs.132Ran Ben-Tzur & Jay Pomerantz, House Releases Draft Legislation Eliminating SPAC Safe Harbor for Forward-Looking Statements, Harv. L. Sch. Forum on Corp. Governance (June 7,
2021), https://corpgov.law.harvard.edu/2021/06/07/house-releases-draft-legislation-eliminating-spac-safe-harbor-for-forward-looking-statements [https://perma.cc/J472-NQQ7].

B. Merger Liability for IPO Liability: The SPAC Lawsuit Swap

In going public via SPAC, firms essentially swap the IPO liability regime for the merger liability regime. What are the effects of this exchange? To examine how these lawsuits stack up against one another, I compare the merger objection lawsuits and unsued SPACs in my sample to a sample of IPOs from January 1, 2016, to July 31, 2021, matched with a sample of section 11 class actions from January 1, 2017, to June 31, 2021.133I build in a one-year lag between the IPO sample and the section 11 lawsuit sample because section 11 has a one-year statute of limitations. The timeframe for the lawsuits matches the one I use for SPAC-related litigation. I gather my sample of section 11 class actions from the Stanford Securities Class Action Clearinghouse and the sample of non-SPAC IPOs from Zephyr.

Descriptive statistics are tabulated in Table 11. My total sample consists of 701 IPOs, 60 of which draw section 11 claims. Although the rate varies year over year (it spikes to over 17% in 2019, and the data for 2021 covers only the first half of the year, coming to nearly 3%), the average percentage of IPOs across the sample that draw a section 11 claim is 8.56%. This is far lower than the percentage of SPACs in my sample that drew merger objection claims during the same period (either under state corporate law or Rule 14a-9), which is 51.89%. Moreover, the average across the sample is understated since there are very few merger objection claims in the first three years of my sample; the average percentage in 2020 and 2021, when these lawsuits began to appear in earnest, is 67.69%.

The consequences of these lawsuits also appear to be dramatically different. The average settlement for section 11 lawsuits in my sample is roughly $4.5 million. In calculating this average, I include cases that were dismissed (where the settlement amount equals zero) and cases that are still ongoing (where the settlement amount is currently zero but may be significant when the case is resolved). This accounts to some degree for low settlement values in the last few years of the sample. But the $4.5 million average, though likely substantially understated, is still much larger than the settlements that have so far been disclosed for merger objections in my sample, which average $200,000. Indeed, the likely reason that so few of such settlements are publicly available is that they are so small as to be immaterial to the issuers and therefore need not be disclosed.

There is also a divergence in the plaintiffs’ firms that bring these cases. Section 11 lawsuits, across my sample, involve a top-tier plaintiffs’ firm 63.3% of the time, and this number is relatively stable year over year (ranging from 60% to 69%). Top-tier plaintiffs’ firms account for only 7.27% of the merger objection claims in my SPAC sample. Finally, the most commonly sued industries in the IPO sample are software and programming (21.67% of the sample) and biotechnology and drugs (11.67%). These are among the industries found in other studies to be particularly vulnerable to securities litigation.134Skinner, supra note 56; Rogers & Stocken, supra note 56. By contrast, SPACs that draw lawsuits are generally not in these industries.

Table 11.  Section 11 Lawsuit Descriptive Statistics

Year

Section 11 Lawsuits

Mean Settlement*

Percent Top Plaintiffs’ Firms135Top plaintiff firms are those in my sample that appear in The Legal 500 list for securities plaintiff litigation. Legal 500, supra note 54. Those appearing in my sample are Berman Tombacco, Bernstein Liebhard, Bernstein Litowitz Berger & Grossman, Cohen Milstein Sellers & Toll, Grant & Eisenhofer, Kessler Topaz, Labaton Sucharow, Pomerantz, and Robbins Geller Rudman & Dowd.

Total Non-SPAC IPOs

Percent Non-SPAC IPOs Sued Under Section 11

2017

13

$15,600,000

69.23%

155

8.38%

2018

11

$6,445,455

63.64%

125

8.8%

2019

20

$813,750

60.0%

115

17.39%

2020

11

$0

63.64%

137

8.03%

2021

5 (as of June 31)

$0

60.0%

169 (as of July 31)

2.96%

Total

60

$4,571,841

63.3%

701

8.56%

Notes: *Mean settlement calculations include settlement values for lawsuits that were dismissed and lawsuits that are ongoing.

So, what can we make of all this? In terms of sheer litigation volume, SPACs appear to have fallen from the frying pan into the fire. Though designed in some respects specifically to avoid litigation under section 11, SPACs draw merger objection claims with dramatically greater frequency, and as previously discussed, the bulk of these claims are brought before the merger. However, insofar as information is available, these claims appear to settle for far lower amounts, meaning that for individual SPACs, the increased probability of a lawsuit may be outweighed by the apparently low probability of having to pay an expensive settlement. The majority of section 11 claims are brought by top-tier plaintiffs’ law firms, which may indicate that they are stronger claims than the merger objections, which are brought by a handful of quite small entrepreneurs.

For IPOs, the policy choice has been to wield the heavy cudgel of virtually strict liability against those perceived to have the best knowledge about the newly public firm. This choice was made on the basis that firms new to the market, about which little is known, without any public history and potentially few profits for investors to assess, could be prime candidates for fraud. Investors in such firms have no source of information other than the firm itself, and Congress created the rigid liability provisions of the Securities Act to make that information as accurate as possible.136This model is encountering increasing challenges as entrepreneurial firms without a history of profitability are commanding extraordinary valuations based on the possibility of future earnings. For a discussion of the role of investor protection in the face of these issues, see James J. Park, Investor Protection in an Age of Entrepreneurship, 13 Harv. Bus. L. Rev. 107, 146 (2022). These provisions create the specter of extraordinary costs for those who violate them137See Donald C. Langevoort, Deconstructing Section 11: Public Offering Liability in a Continuous Disclosure Environment, 63 L. & Contemp. Probs. 45, 45–47 (2000) (noting that “a sizable portion of the underwriters’ spread is a liability risk premium, and lawyer-disseminated fear of liability casts a harsh shadow over the due diligence process” (footnotes omitted)). and are thus commonly regarded as “well suited to deter misreporting.”138Volker Laux & Phillip C. Stocken, Managerial Reporting, Overoptimism, and Litigation Risk, 53 J. Acct. & Econ. 577, 579 (2012). Under the current legal framework, SPACs do not face this standard. Rule 10b-5 plaintiffs must plead specific allegations of, at the least, recklessness. Rule 14a-9 litigants must prove negligence to prevail,139Press Release No. 51283, U.S. Sec. & Exch. Comm’n, Report of Investigation Pursuant to Section 21(a) of the Securities and Exchange Act of 1934 and Commission Statement on Potential Exchange Act Section 10(b) and Section 14(a) Liability (Mar. 1, 2005), https://www.
sec.gov/litigation/investreport/34-51283.htm [https://perma.cc/8CY5-QXBT] (“Where the failure to make such disclosure is negligent, an issuer would violate Section 14(a) of the Exchange Act and Rule 14a-9 thereunder . . . .”).
which may be a more demanding task than it appears.140See Roger A. Cooper, James E. Langston, Mark E. McDonald & Charity E. Lee, Rare Federal Court Decision Casts Doubt on Merger Disclosure Claims, but Will It Change Anything?, Cleary M&A & Corp. Governance Watch (June 25, 2020), https://www.clearymawatch.com/2020/06/rare-federal-court-decision-casts-doubt-on-merger-disclosure-claims-but-will-it-change-anything [https://perma.cc/
9XYB-AH9G] (“Unless a plaintiff can show that the proxy statement omitted a fact required to be disclosed by SEC regulations (which is often a tall task), the plaintiff must plead . . . with particularity, not merely with conclusory allegations—how the allegedly omitted fact renders the proxy statement disclosures materially misleading. But without knowing the facts that have been omitted—and because of the discovery stay imposed by the Private Securities Litigation Reform Act (“PSLRA”)—plaintiffs will have difficulty obtaining such facts at the pleading stage . . . .”).
Under Delaware law, acquirers’ challenges to mergers under state corporate law are usually reviewed under the deferential business judgment rule. Although there are good arguments that SPAC mergers, in view of potential management conflicts, should be reviewed under the more demanding “entire fairness” standard,141See Klausner & Ohlrogge, supra note 39, at 12–13; see also In re MultiPlan Corp. S’holders Litig., 268 A.3d 784, 809 (Del. Ch. 2022); AP Servs. LLP v. Lobell, No. 651653/2012, 2015 WL 3858818, at *39–40 (N.Y. 2015) (declining to dismiss entire fairness claims against a SPAC under Delaware law where such a conflict was alleged); Ann Lipton, Another SPAC Legal Development, Bus. L. Prof Blog (Aug. 7, 2021), https://lawprofessors.typepad.com/business_law/2021/08/another-spac-legal-development.html [https://perma.cc/73GT-H4N2]. any of these standards is harder work for plaintiffs than the virtually strict liability of section 11.142It may be worth considering due diligence defenses for certain parties such as those available under section 11.

None of these concerns motivating the rigidity of section 11 liability are any less salient in SPACs, and given the conflicts commonly cited with respect to the SPAC form, they are arguably more intense. Yet, the merger litigation that replaces the Securities Act liability in the SPAC context appears to be significantly less costly to issuers—and indeed, that is probably one reason that the SPAC form is popular. It may be that merger litigation simply does not cost enough to induce SPACs to make the disclosures that would be optimal for investors in newly public firms.

But although most SPAC merger litigation may be insufficiently costly compared to Securities Act litigation, it is simultaneously too costly in that it is indiscriminate. Section 11 plaintiffs are constrained by unyielding standing rules143Plaintiffs must be able to “trace” their securities to the registration statement at issue, a task that is virtually impossible once the securities enter the secondary market. See, e.g., In re Ariad Pharms Inc. Sec. Litig., 842 F.3d 744, 755 (1st Cir. 2016); Shapiro v. UJB Fin. Corp., 964 F.2d 272, 286 (3d Cir. 1992); In re WRT Energy Sec. Litig., No. 96 Civ. 3610, 1997 WL 576023, at *21 (S.D.N.Y. 1997); Gould v. Harris, 929 F. Supp. 353, 359 (C.D. Cal. 1996); In re AES Corp. Sec. Litig., 825 F. Supp. 578, 593 (S.D.N.Y. 1993). and generally have a fleeting one-year limitations period in which to make their claims.14415 U.S.C. § 77m. These restrictions manifest in the relatively low number of IPOs that draw section 11 lawsuits—in my sample, less than 9%.145I note that the dramatically smaller percentage of IPOs drawing section 11 claims may also be a result of the higher scrutiny registration statements faced by issuers and underwriters because of the threat of strict liability. Although the limitations periods for much merger litigation are not much longer (often two or three years), standing rules are broader.146Private plaintiffs are presumed to have standing under Rule 14a-9 if they are injured in connection with a proxy solicitation (no purchase or sale of securities is required). See Thomas Lee Hazen, Treatise on the Law of Securities Regulation § 10:69 (2022). Shareholders have standing to challenge a merger if they owned stock at the time of the challenged transaction and throughout the litigation. If the lawsuit is derivative, they must either have demanded that the board initiate a lawsuit or adequately plead that the demand was futile. Harvey, supra note 113, at 1194. More importantly, the current regime for merger litigation generally allows for quick settlement for small fees without judicial oversight, meaning that no party involved has incentives to put the brakes on these lawsuits; plaintiffs may bring them irrespective of the merit of the case, defendants find it less expensive to pay than to challenge, and courts are not in a position to curtail the fees that are unwarranted or heighten those that are not.147See Cain et al., Mootness Fees, supra note 11, at 1783.

A high percentage of SPACs in the most recent years have drawn merger objection lawsuits. If these SPACs are fraudulent, $200,000 in fees is likely too low to create effective deterrence. If they are not, then many SPACs may be paying a “deal tax” that others have argued, in the standard merger context, is inappropriate.148Id. at 1777.

VI. POLICY IMPLICATIONS

Proposals for improving SPACs abound; academics,149See, e.g., Klausner et al., supra note 4; Rodrigues & Stegemoller, supra note 4. regulatory agencies,150Dave Michaels, SEC Weighs New Investor Protections for SPACs, Wall St. J. (May 26, 2021), https://www.wsj.com/articles/sec-weighs-new-investor-protections-for-spacs-11622052408 [https://
perma.cc/88CM-PEC9].
and even members of Congress151Amrith Ramkumar, Elizabeth Warren, Other Top Democrats Raise Concern About SPAC Incentives, Wall St. J. (Sept. 22, 2021), https://www.wsj.com/articles/elizabeth-warren-other-top-democrats-raise-concerns-about-spac-incentives-11632339583 [https://perma.cc/67B7-HPQW]. are busily gathering information and proposing reforms. Many of these proposals involve heightened disclosures of the various conflicts and relationships inherent in the structure of SPACs. These include, to name a few, sponsor compensation and the incentives sponsors have to close even a subpar deal, the stake that sponsors will have in the merged company, information about the diligence and negotiations that occurred during the merger process, and the probability and extent of dilution for shareholders who continue to hold their shares after the merger.

But perhaps these potential hazards are features, rather than bugs, of the SPAC structure. SPACs, say their proponents, “offer investors and targets a new set of financing opportunities that compete with later-stage venture capital, private equity, direct listings, and the traditional IPO process. They provide an infusion of capital to a broader universe of start-ups and other companies, fueling innovation and growth.”152Bazerman & Patel, supra note 1. Of course, this breadth comes at a cost. SPACs are common for “firms [that] are speculative, have enormous capital requirements, and can provide only limited assurances on near-term revenue and viability.”153Id. at 105. Such investments are obviously risky. But one might assert that investors in SPACs understand this, or if they do not, they should. On this view, investors get to participate in these ostensibly high-reward deals in no small part because they lack the protections of a traditional IPO, and this bargain is a reasonable one that should be left intact.

The success of SPAC-related litigation depends on the camp into which one falls. Much of the SPAC litigation deluge is in fact composed of pre-closing merger challenges that probably generate little but low fees for plaintiffs’ counsel. But if SPAC investors accept less information as part of a deliberate bargain for the opportunity to invest in ventures not normally available to them, then perhaps this litigation is functioning more or less as intended, and pre-closing merger challenges represent a minor deal tax154See, e.g., Frankel, supra note 7. that all parties involved are willing to pay.

If, on the other hand, SPACs are not functioning as intended—if the informational goals of SPACs are analogous to those of standard IPOs, and shareholders have not knowingly bargained for them to be otherwise—then it appears that much SPAC-related litigation is not serving this goal. Rather, the merger challenges brought before the lawsuit are not promoting meaningful disclosure, and even a percentage of the smaller proportion of lawsuits brought after the merger may target transactions that can produce the biggest payout for plaintiffs’ counsel, rather than targeting true misconduct. A natural conclusion of this line of thinking is that the system of private enforcement is currently ill-suited to correct the deficiencies of SPACs and that greater public intervention, such as government enforcement actions and more stringent regulation, may be necessary.

But this early sample of SPAC-related lawsuits has broader implications for the utility of shareholder litigation as a policing mechanism. The consolidation of the U.S. public equity markets has been the source of commentary for some time.155See generally Elisabeth de Fontenay, The Deregulation of Private Capital and the Decline of the Public Company, 68 Hastings L.J. 445 (2017); Andrew Ross Sorkin, C.E.O.s Meet in Secret over the Sorry State of Public Companies, N.Y. Times (July 21, 2016), https://www.nytimes.com/2016/07/
21/business/dealbook/ceos-meet-in-secret-over-sorry-state-of-public-companies.html [https://perma.cc/
52B4-86FV] (“[M]uch of the smart money in the United States is going—and staying—private . . . Publicly listed companies in the United States have become something of a dying breed.”).
The traditional IPO process has become the province of “large, late-stage companies.”156Spencer Israel, The Number of Publicly Traded Companies in the US Is Shrinking—or Is It?, MarketWatch (Oct. 30, 2020), https://www.marketwatch.com/story/the-number-of-companies-publicly-traded-in-the-us-is-shrinkingor-is-it-2020-10-30 [https://perma.cc/8XTT-FPP5]. Concurrently, small- and medium-cap companies—which have “[t]raditionally . . . delivered both higher risk and higher returns”157Frank Partnoy, The Death of the IPO, Atlantic (Nov. 2018), https://www.theatlantic.com/
magazine/archive/2018/11/private-inequity/570808 [https://perma.cc/M4MT-BY9R].
—have become far less accessible to the average investor.158Id. The SPAC boom may be in part interpreted as a testament to retail investor appetite for such high-risk, high-return opportunities; retail investors eying the profits of venture capital and private equity moguls when the markets are rallying and money is flowing freely may well think, “Why not me?” Inaccessibility is a two-way street—smaller, growing firms that cannot access the liquidity of public markets could be limited in their opportunities for “economic growth, hiring, and wealth creation.”159Id. (citation omitted).

While Congress and the SEC have undertaken efforts to make the traditional IPO process more inclusive,160See, e.g., Statement, U.S. Sec. & Exch. Comm’n, Spotlight on Jumpstart Our Business Startups (JOBS) Act (Apr. 5, 2012), https://www.sec.gov/spotlight/jobs-act.shtml [https://perma.cc/SV5D-LUGW]. the trend toward fewer, larger IPOs has proven persistent. It is thus not surprising—and is perhaps commendable—that private actors have experimented with alternative paths to the public markets. SPACs may not be the optimal result of these experiments. But in the absence of regulation that actually results in IPOs for more emerging, high-growth companies, such experimentation is likely to continue. Strong policing mechanisms for these experiments are necessary. Ex ante regulation, by its nature, is one or several steps behind such experiments, as SPACs aptly demonstrate. Regulation by agency enforcement is controversial, and even if it were not, agencies are resource-constrained and cannot catch everything. Shareholder litigation under broad, existing causes of action, such as Rule 10b-5, proxy fraud, and breach of fiduciary duty, constitutes a critical backup protection for investors in the face of private ordering experiments.

That some experiments, like SPACs, are designed to avoid certain types of liability (such as section 11) makes the efficacy of the remaining causes of action even more important. But if the SPAC-related litigation so far is anything to go by, it is unclear whether shareholder lawsuits are currently up to the job. Based on my sample, a very small subset of these lawsuits—post-closing merger challenges in the Multiplan mold—have resulted in standards that could improve a substantial shortcoming of the SPAC structure. The results of another 30% of these lawsuits—Rule 10b-5 claims—are as yet undetermined, but there are both meritorious and opportunistic explanations for these lawsuits, and the end results may lie somewhere in the middle. By far the largest bloc, comprising roughly half the claims in my sample, consists of pre-closing merger objections that are likely to perform little, if any, effective policing function.

This is most unfortunate, because in theory, these lawsuits could be among the most immediate and effective checks available against predatory structures innovated by private markets. The main advantage to merger objection claims is that they can be brought before a transaction closes, potentially saving shareholders from the consummation of a damaging deal. The supplemental disclosures issued in response to many merger objection claims could remedy prior misstatements or omissions and might force managers to do more comprehensive diligence or find better deals. The key advantage of a merger objection lawsuit is its immediacy; unlike most lawsuits, which can only be brought after the fact, merger challenges can directly affect the challenged transaction.161This advantage is both more and less salient in the SPAC context. It is less salient in that where there is no fraud, it makes little sense for shareholders to sue when, if they dislike the transaction for any reason, they can redeem their shares. For a parallel discussion of the interaction of redemption rights with plaintiff’s lawyer incentives in the mutual fund context, see John Morely & Quinn Curtis, Taking Exit Rights Seriously: Why Governance and Fee Litigation Don’t Work in Mutual Funds, 120 Yale L. J. 84, 142 (2010). This remedy is more broadly available than the roughly analogous appraisal remedy available to target shareholders in other mergers under Delaware law, which bars appraisal for certain transactions and in general, for shareholders of corporations whose stock is listed on an exchange or held of record by more than two thousand holders. R. Franklin Balotti, Jesse A. Finkelstein, John Mark Zeberkiewicz & Blake Rohrbacher, 1 Balotti & Finkelstein’s Delaware Law of Corporations & Business Organizations § 9.68 (4th ed. 2022). Shareholders electing to redeem their shares know exactly how much they will get, unlike shareholders seeking appraisal, who must rely in a judge’s valuation. Id. Moreover, exercising the redemption right does not require the filing of a lawsuit. Favoring the redemption right, which is structurally similar to appraisal (but more shareholder-friendly in virtually every way) over such fiduciary class actions seems like common sense. See Charles Korsmo & Minor Myers, Reforming Modern Appraisal Litigation, 41 Del. J. Corp. L. 279, 313 (2017) (finding that that appraisal actions are relatively rare and “associated with merits-related factors,” and hence, “stand[] as the polar opposite of the system of fiduciary class actions”). If there is fraud, however, pre-closing merger lawsuits may actually be more valuable to SPAC investors than to investors in ordinary mergers because they could enable shareholders to exercise their redemption rights. It is difficult to dispute that in any merger, it is desirable to prevent shareholder votes for a transaction that are based on fraud. But in a standard merger challenge, even if some disclosure was fraudulent and even if the defendant firm issues a corrective disclosure before the merger, shareholders dissatisfied with the content of that disclosure will be stuck with the transaction if the majority votes for it. Not so for SPACs. The revelations induced by a pre-closing lawsuit may be more consequential to individual SPAC shareholders than to shareholders in ordinary mergers because SPAC shareholders are not captives to the board and the majority; if they do not like the information revealed by the lawsuit, they can get out if they want to. Accordingly, pre-closing merger lawsuits may be valuable where SPAC managers engage in fraud, potentially even more so to individual SPAC investors than to investors in more traditional transactions.

But based on my early sample of SPAC-related litigation, it is not at all clear that pre-closing merger objections are having this effect. They are so numerous and inexpensive that shareholders could easily be forgiven for not taking them seriously as a signal of real wrongdoing by the management. The disclosures they provide do not appear to induce shareholders to redeem. They do not appear to be related to the returns on the final deal. And the monetary penalties they exact are often too small even to be disclosed and likely would not deter unscrupulous SPAC managers from continuing to lie. All this aligns with the conclusions that have been swirling among scholars and judges in recent years; that “these suits are not being filed with the expectation of obtaining a meaningful recovery for the plaintiff class but rather in order to obtain a quick disclosure and mootness fee.”162Cain et al., Mootness Fees, supra note 11, at 1783.

This is a problem bigger than SPACs. In general, the time frames in which an M&A transaction must close usually discourage defendants from attempting to defeat pre-merger litigation on the merits, even when that litigation is abusive. And, even if settlement costs are minimal in comparison to the size of the M&A transaction, transaction costs associated with litigation end up being visited on shareholders, for no or little appreciable benefit.163Rosenfeld v. Time Inc., No. 17-CV-9886, 2018 WL 4177938, at *4 (S.D.N.Y. Aug. 30, 2018).

Correcting the broader environment that has allowed pre-closing merger litigation to become so dysfunctional is beyond the scope of this Article.164A reasonable starting point, however, might be to consider extending the Trulia rule to other jurisdictions. It is worth observing, however, that even if pre-closing merger objections have different significance in the SPAC context than in standard mergers, and even if SPACs are a phenomenon of the moment, pre-closing merger challenges are worth saving, if not for SPACs, then for future transactions and innovations. The first best solution for SPAC-related litigation, and indeed, for many other transactions, would be to save them.

CONCLUSION

The litigation produced by the recent SPAC boom to date is intuitively odd: the likelihood of being sued is greater for SPACs whose shareholders elect to retain their shares, which suggests that they have greater confidence in the success of the merger. The likelihood that a de-SPAC transaction will generate a lawsuit does not, however, have to be related to the size of the deal, the experience of its managers, or virtually any other proxy for quality. This could mean that these lawsuits are, in fact, targeting fraudulent transactions, and the low redemption rates of these transactions reflect the fact that the shareholders have been lied to. Alternatively, it could mean that these lawsuits target low-redemption (and presumably higher quality) deals for some other reason, which may be opportunistic and unrelated to merit. Though these competing explanations are difficult to disentangle for the post-closing claims in my sample, I argue that the pre-closing merger cases in my sample do not appear to be meritorious, irrespective of whether they target fraudulent or non-fraudulent transactions. They do not appear to be an adequate substitute for section 11 liability, and if SPACs are in need of reform, private litigation may not be the optimal solution.

 

 

96 S. Cal. L. Rev. 553

Download

J.D. Duke Law Lecturing Fellow. For helpful comments, I am grateful to David Berger, John Coates, Patrick Corrigan, Elisabeth de Fontenay, Deborah DeMott, Jessica Erickson, Jill Fisch, Gina-Gail Fletcher, Zohar Goshen, Joe Grundfest, Sharon Hannes, Kobi Kastiel, Michael Klausner, Minor Myers, Michael Ohlrogge, Jim Park, Alex Platt, Adam Pritchard, Barak Richman, Tony Rickey, Amanda Rose, Michael Simkovic, Steven Davidoff Solomon, Andrew Tuch, Andrew Verstein, and participants in the Conference for Empirical Legal Studies, American Law & Economic Association Annual Meeting, Berkeley Law, Accounting & Business Workshop, BYU Deals Conference, and National Conference of Business Law Scholars. Errors are my own.

Co-Creating Equality

When a creative work has many co-creators, not all of whom contributed equally, how should they split ownership? In the absence of a contract, copyright law has long adopted an all-or-nothing answer to this question: if you are deemed to be a “co-author” you get an equal split; otherwise, you get nothing. Because the privileges of co-authorship are so great, courts have erected an onerous barrier to qualifying as a “co-author”: you must have had “control” over the whole collaborative work. This barrier has been criticized both for being arbitrary and for unfairly resulting in lesser contributors going unrecognized and uncompensated. But removing this barrier—in the context of the longstanding rule granting co-authors an automatic equal split—risks unfairly diluting majority contributors. So, in deciding whether to remove the barrier, we have to balance the perceived unfairness of minority contributors going uncompensated against the perceived unfairness of majority contributors being diluted. In this Article I will show that this question of perceived fairness can be answered empirically.

To determine creators’ revealed preferences for how to treat lesser contributing collaborators, I assemble two datasets. Both datasets are in the core copyright domain where the default co-authorship rules are most relevant: co-songwriting. First, I construct a dataset of the songwriting contribution levels, writing credits, and copyright registrations of every band that has a certified Gold Record and writes its own songs. This is over one thousand music groups. Second, by cross-correlating data released by the principal performance rights organizations in response to recent antitrust probes, I estimate royalty splits between the co-writers of over 1.2 million songs. The studies in this Article are the largest and most comprehensive investigation into joint authorship to date that accounts for parties’ contributions.

My primary finding is that the typical behavior of creators is to credit everyone involved in writing as a co-author, even the lesser contributors. A secondary finding is that co-authors who were lesser contributors typically share equally in songwriting royalties. Main contributors thus choose to share much more with lesser contributors than they would be compelled to under current law in the absence of a contract. This revealed preference suggests that we can adopt a more inclusive legal criterion for co-authorship—and in particular remove the arbitrary and exclusionary barrier that to be a co-author one must have control over the entire work—and that we can do so without violating creators’ own sense of fairness. Beyond copyright law and the music industry, these findings have implications for the design of incentives in intellectual property law and creative collaboration more broadly.

INTRODUCTION

Over the last few decades, people have been collaborating to produce creative works more than ever before, from songs to software. But in the absence of an explicit contract, the law of co-authorship now recognizes as “co-authors” only those contributors who have control over the whole collaborative work.1See infra Section I.C. This control doctrine2This is the theory of authorship advanced in Aalmuhammed v. Lee, 202 F.3d 1227, 1234 (9th Cir. 2000) and Erickson v. Trinity Theater, Inc., 13 F.3d 1061, 1068 (7th Cir. 1994), present in Thomson v. Larson, 147 F.3d 195, 202–03 (2d Cir. 1998) as the “decisionmaking authority” factor, and followed in those and other circuits. See infra Sections I.B–C. has been broadly criticized: the doctrine seems arbitrary; it may leave lesser contributors entirely uncompensated; and it may obscure the identities of the lesser co-creators.

Under current law, the question of who counts as a co-author has a lot riding on it. This is because there is a longstanding rule granting co-authors an automatic equal split of royalties.3Statutory law is silent on ownership shares, stating only that the “authors of a joint work are coowners of copyright in the work.” 17 U.S.C. § 201(a). The equal split default was developed as a common law rule. The contemporary common law of joint authorship was codified as part of the Copyright Act of 1976. See infra Sections I.A–B. Unless they have a contract that says otherwise, the default “equal split” rule is that joint authors share license proceeds equally.4While the ability to license the work on a nonexclusive basis is only one of a bundle of rights possessed by the copyright holder—the right to transfer the work on an exclusive basis most notable among them—it is the most economically significant right in most contexts, and therefore the focus of this Article.

Thus under current law, outcomes tend to be binary. Either you have control over the entire work, in which case you count as a co-author and by default get an equal split; or you do not, in which case no matter how much you contributed to the final product you are not a co-author and by default receive nothing.

There are three paths forward. One is to maintain the status quo, leaving the control doctrine in place, at the risk of unfairly leaving minor contributors entirely uncompensated5When efforts are not rewarded, rational workers scale back efforts to conserve their resources. Charles G. McClintock, Roderick M. Kramer & Linda J. Keil, Equity and Social Exchange in Human Relationships, 17 Advances in Experimental Soc. Psych. 183, 195 (1984). (among other problems). A second choice is to remove the control doctrine for co-authorship, while also removing the (more long-standing) “equal split” rule, and make the royalty share proportional to the level of contribution. A third option is to remove the control doctrine but leave the equal split rule in place, at the risk of unfairly diluting the shares of the main authors.

If the wrong choice is made, we risk disincentivizing creative collaboration, either by giving rise to credit allocations that creators view as unfair and demotivating, or at any rate by imposing the transaction costs of having to contract out of them.6See Dan L. Burk & Mark A. Lemley, Policy Levers in Patent Law, 89 Va. L. Rev. 1575, 1639 (2003) (“Rules are cheap to administer because they are simple and straightforward, but due to their inflexibility they may lead to costly outcomes if they fit a given situation poorly.”). The important question is then what is fair as judged by the creators7This appears to be the overriding concern of the courts, although couched in utilitarian language. It may be that courts intuitively recognize the (empirically sound) notion that creators’ perception of fairness can motivate them to create (or demotivate them when perceived fairness is lacking). See Stephanie Plamondon Bair, Rational Faith: The Utility of Fairness in Copyright, 97 B.U. L. Rev. 1487, 1502–06 (2017). My argument is rooted in the idea that the utilitarian ends of copyright may, in a joint authorship context, be served by attention to the fairness concerns of co-authors. For a discussion and further empirical support, see Sarah Polcz, Loyalties v. Royalties, Hastings L.J. (forthcoming 2023). to whom the default rules apply, not as judged by the general public or by legal scholars. Joint authorship default rules are uniquely relevant for co-songwriting; they are preempted by work for hire or other practices in most other contexts. For this reason, in this Article I will attempt to answer the following empirical question: as revealed by their actions, what do the majority of co-songwriters view as a sensible way to grant co-authorship and split songwriting license proceeds? In particular, how do they treat lesser contributing collaborators?

To pursue these questions, I assemble two datasets. First, I construct a dataset of the songwriting contribution levels, writing credits, and copyright registrations of over one thousand music groups—every band with a certified Gold Record that primarily releases its own songs, since certifications began to the present (1959–2021). I explore how co-author-crediting decisions are associated with features of the creative context including members’ levels of songwriting contributions, number of collaborators, geography, genre, and era. Second, I extend beyond music groups and explore royalty splits between co-writers for over 1.2 million songs by leveraging the data sharing undertaken by the American Society of Composers, Authors and Publishers (“ASCAP”)8ASCAP, along with Broadcast Music, Inc. (“BMI”), are the two largest organizations that collect public performance royalties on behalf of songwriters and music publishers (performance rights organizations, or “PROs”) in the United States. Combined, ASCAP and BMI control approximately ninety percent of the public performance licensing market. Anousha Sakoui, Justice Department Leaves Decades-Old Music Industry Decrees Unchanged, L.A. Times (Jan. 15, 2021), https://www.latimes.com/
entertainment-arts/business/story/2021-01-15/justice-dept-consent-decrees-music-industry-ascap [https://
perma.cc/5Z8F-7MXW].
in response to its ongoing antitrust action. I find that lesser contributors are most often credited as co-authors, and as co-authors they most often share equally in songwriting royalties. The results support the position that calls for rethinking the control doctrine while retaining joint authorship’s equal split default rule.

Part I reviews the history of the equal split rule, the phases of the development of the control doctrine in joint authorship law, and the competing proposals for its revision. Part II makes the case that empirical data can guide our way forward. Parts III and IV present the methodologies and results of two studies. Part V discusses the legal implications of the results.

I. CONTEXT

A co-author who licenses a joint work—for example, allowing a song to be used in a television show—must give their co-authors a share of that money.9H.R. Rep. No. 94-1476, at 121 (1976); Paul Goldstein, Goldstein on Copyright § 4.2.2.1, at 4:30 (3d ed. 2020 & Supp. 2020). If the co-authors have a contract between them, they receive the agreed amount. Otherwise, each co-author is entitled to an equal share,10Goldstein, supra note 9, § 4.2.2, at 4:29 (“Each co-owner’s share of license proceeds will be measured according to a principle of strict equality, and will not be proportioned to the quantity or quality of each co-owner’s contributions to the joint work.”). even if the parties have indisputably made unequal contributions to the joint work.11See, e.g., Sweet Music, Inc. v. Melrose Music Corp., 189 F. Supp. 655, 659 (S.D. Cal. 1960).

A. The Equal Split and the “Intent to Merge”

This was not always the case.12For a brief history of accounting in Anglo-American copyright law prior to 1874, see Accountability Among Co-Owners of Statutory Copyright, 72 Harv. L. Rev. 1550, 1553–55 (1959). For the first century after copyright laws appeared in the United States, there was no default rule dictating how to split co-authorship profits, or concerning co-authorship generally.13See Subcomm. on Patents, Trademarks & Copyrights of Sen. Comm. on the Judiciary, 86th Cong., 2d Sess., Copyright Law Revision: Studies Nos. 11–13, at 89 (Comm. Print 1960) (authored by George D. Cary) (“[N]either in the hearings nor in the report accompanying the bill that became the copyright law of 1909, does one find a reference to the problems of joint authorship or joint ownership.” (footnotes omitted)); Accountability Among Co-Owners of Statutory Copyright, supra note 12, at 150 (pointing out that no judicial statement on co-authorship had been made in Anglo-American law prior to 1871). Copyright registration was relatively unimportant for the first century of the United States under the Constitution, for reasons both philosophical14Copyright, and intellectual property protection generally, were regarded as species of monopoly, which could lead to political and social corruption. See 1 William F. Patry, Patry on Copyright § 1:18, Westlaw (database updated Sept. 2022). While these protections were regarded as too useful to be abolished, even early supporters like James Madison thought they should be treated with caution. See James Madison, Detached Memoranda, ca. 31 January 1820, Nat’l Archives, https://founders.archives.gov/documents/Madison/04-01-02-0549 [https://perma.cc/VV9L-5QYX]. and economic.15A calculation by Professor William Patry, placing copyright registrations between 1790 and 1800 at approximately five percent of domestically published books, suggested the local nature of publication, the relative lack of piracy, burdensome formalities of copyright, and the preponderance of British-authored books—for which no protection was available under the Copyright Act of 1790—as prevailing reasons for the lack of registration. Patry, supra note 14, § 1:19. The expansion of the domestic publishing industry, without a concomitant expansion of domestic authorship, would stymie copyright reform throughout the nineteenth century: piracy of foreign (mainly British) authored works was regarded as a boon to the free spread of ideas throughout the democratic polity. Peter Baldwin, The Copyright Wars 114 (2014). Economically, the primary focus was on the protection of the American publisher—often of unauthorized foreign reprints—rather than the American author. See id. at 118. But in the wake of the transatlantic success of American authors, most significantly Harriet Beecher Stowe’s Uncle Tom’s Cabin, Congress fully extended copyright protection to foreign authors in 1891. See id. at 121–22. The modern period of American copyright could be said to have begun, and the modern period of American joint authorship law would soon follow. When co-authorship did arise, co-authors settled among themselves whether or not they would share licensing income. The first judicial decision on accounting between co-authors was not until 1874. The court in Carter v. Bailey held that a co-author could license their joint work without their co-authors’ consent16Carter v. Bailey, 64 Me. 458, 463 (1874) (“In the absence of any contract modifying their relations, they are simply owners in common . . . each owning a distinct but undivided part which or any part of which alone he can sell, as in the case of personal chattels.”). and without having to redistribute that licensing income among any co-authors of the work not party to the licensing transaction.17Id. at 463–64. The rationale behind this was that the left-out co-authors were equally able to do the same if they put forth the effort, because a license to one person does not use up intangible property.18See id. at 462. Each co-author owned an undivided interest in the entire work, and so had the full right to license that property so long as they did not interfere with their co-authors’ rights to do the same.19See id. at 464–65. This mirrored the situation with patent co-owners.20Id. at 464; see also Theodore R. Kupferman, Copyright—Co-Owners, 19 St. John’s L. Rev. 95, 103–04 (1945); Accountability Among Co-Owners of Statutory Copyright, supra note 12, at 1555.

In the decades that immediately followed Carter, the copyright landscape in the U.S. was transformed by a boom of popular culture and means for its dissemination. By the early twentieth century, entertainment industries were expanding, changing, or being freshly created.21In the United States, record sales increased from approximately 3 million units in 1900 to 30 million in the early 1910s to 140 million in 1921. Pekka Gronow, The Record Industry: The Growth of a Mass Medium, 3 Popular Music 53, 59 (1983). Radio broadcasting stations increased from five in 1921 to over five hundred in 1924. Christopher H. Sterling & John Michael Kittross, Stay Tuned: A History of American Broadcasting 827 (3d ed. 2002). By 1930, 40.3% of American households owned radio sets; by 1940 the figure was 82.8%, and 95% saturation was reached by 1950. Steve Craig, How America Adopted Radio, 48 J. Broad. & Elec. Media 179, 182 (2004). For a history of the similar growth of the American film industry during this time period, see generally Eileen Bowser, The Transformation of Cinema 1907–1915 (1990) and Richard Koszarski, An Evening’s Entertainment: The Age of the Silent Feature Picture 1915–1928 (1990). Co-authorship grew from the minority case to a common way of producing creative works in important domains. Moreover, dissemination of works was so improved that the circumstances that justified the holding in Carter—that there was no duty to account because one co-author’s licensing of the joint work would not impair the excluded authors’ ability to license it—quite clearly no longer applied.22See Accountability Among Co-Owners of Statutory Copyright, supra note 12, at 1556 (contrasting the reach of publication in the time of Carter with the ability to reach “virtually the entire potential audience” with then-contemporary publication). Technologies like radio and film could reach the entire market for a joint work,23A similar logic was applied in Crosney v. Edward Small Prods., Inc., 52 F. Supp. 559 (S.D.N.Y. 1942) (accounting appropriate between co-owners of motion picture rights where licensing destroys the value of the res). leaving nothing for excluded authors and thereby effectively destroying the value of the work’s copyright for them.24See Shapiro, Bernstein & Co. v. Jerry Vogel Music Co., 73 F. Supp. 165, 168 (S.D.N.Y. 1947). Most later cases recognize—or at least recite—an equitable “constructive trust” basis for recovery, rather than a tenancy in common theory analogizing ouster or destruction of the res. See Maurel v. Smith, 220 F. 195, 201 (S.D.N.Y. 1915) (“[W]hen the Smiths took out these statutory copyrights the literary property, which by publication they used and destroyed . . . .”). Modern cases often leave the legal basis indeterminate. See, e.g., Oddo v. Ries, 743 F.2d 630, 633 (9th Cir. 1984) (describing duty to account as derived from “equitable doctrines relating to unjust enrichment and general principles of law governing the rights of co-owners” (quoting Harrington v. Mure, 186 F. Supp. 655, 657–58 (S.D.N.Y. 1960))).

This rule first shifted in 1915 with Maurel v. Smith.25Maurel, 220 F. at 201. In Maurel, three parties agreed to jointly author an operatic work. The plaintiff wrote the plot (the “scenario”), while the first defendant H. Smith translated the plot into script and claimed to have made substantial modifications. The second defendant added lyrics for musical numbers without reference to the plot.26The composer was not a party to the suit. The defendants then registered the copyright for the work without including the plaintiff, blocking her from exploiting her property, as under the 1909 Copyright Act an author’s rights followed from registration.27Copyright Act of 1909, Pub. L. No. 60-349, § 9, 35 Stat. 1075, 1077. The court held that the license proceeds were held in a constructive trust for the excluded co-author,28Maurel, 220 F. at 201. in effect creating a duty to account, because there had been no other way for the excluded co-author to obtain license revenue. Cases following Maurel, however, interpreted it as establishing a general duty for co-authors to account to each other for profits.29See, e.g., Jerry Vogel Music Co. v. Miller Music, Inc., 74 N.Y.S.2d 425, 428 (N.Y. App. Div. 1947), aff’d, 87 N.E.2d 681 (1949) (confirming the accounting rule in co-authorship as “promot[ing] sound and orderly marketing of a work and a fair division of profits on the basis of mutual interest”). The case itself concerned, as early joint authorship cases often did, the words and music of a song. Moreover, they were to share equally in those proceeds, at least in the absence of a contract stating otherwise. This position has been treated as settled law since it was established more than one hundred years ago.30See 1 Melville B. Nimmer & David Nimmer, Nimmer on Copyright § 6.08 (2022). Courts have not given consideration to uneven shares.31See Justin Hughes, Actors as Authors in American Copyright Law, 51 Conn. L. Rev. 1, 66 (2019) (noting that Sweet Music, Inc. v. Melrose Music Corp., 189 F. Supp. 655 (S.D. Cal. 1960) is the “rare, possibly lone case directly deciding this issue”), a finding with which my research agrees. In Thomson v. Larson, 147 F.3d 195, 205 (2d Cir. 1998), plaintiff Lynn Thomson proposed an award of “24 per cent of two-thirds of the authors’ share attributable to the work as a whole, or in other words, 16 per cent,” on the theory that she had co-authored two-thirds (book and lyrics, but not music) of a revised version of Rent with Jonathan Larson, 48% of which consisted of new material. Brief of Plaintiff-Appellant at 50, Thomson v. Larson, 147 F.3d 195 (2d Cir. 1998) (No. 97-9085). The court appeared to give no consideration to this suggestion.

Importantly, Maurel established that contributions to the joint work do not need to be balanced; co-authorship would be found so long as the collaborators share a “common design.”32Maurel, 220 F. at 199. One defendant had argued that the plaintiff should not be considered his co-author because substantial changes had been made to the plot she contributed,33Id. in essence asserting that his contributions to the totality were significantly greater than the plaintiff’s. On this point, Judge Learned Hand adopted the view put forth in the English case Levy v. Rutley,34Levy v. Rutley, L.R. 6 C.P. 523 (1871). that as long as the parties had agreed upon a common scheme for the joint work there “can be no difficulty in saying that they are joint authors of the work, though one may do a larger share of it than the other.”35Id. at 530 (opinion of Montague Smith, J.). Having established that the plaintiff was indeed a co-author with the defendants, Judge Hand found that the copyright was the resulting res of their three contributions, “and by every equitable rule the defendants hold any legal rights they have upon trust in the same proportion.”36Maurel, 220 F. at 201. In so declaring, Judge Hand applied the common law of tenancy in common to the parties’ relationship, one of equal ownership by default.37See Avner D. Sofer, Joint Authorship: An Uncomfortable Fit with Tenancy in Common, 19 Loy. L.A. Ent. L.J. 1, 7–8 (1998).

But tenants in common can refute their equal undivided shares by showing evidence of unequal financial contributions to the purchase of the common property.38This was established by the time Maurel was decided. See, e.g., In re McConnell, 197 F. 438, 441 (N.D.N.Y. 1912) (citing Bittle v. Clement, 54 A. 138 (N.J. Ch. 1903)). In the case of a joint work, the analog would be to adjust the co-authors’ shares to reflect some proportion of their inputs, whether the relative quantity of their creative contribution or value created. Judge Hand did not address this feature of tenancy in common explicitly, but effectively shut out its application in the case when he took up the claims of the second defendant. The second defendant, R. Smith (brother of defendant H. Smith), argued that he did not need to share with the plaintiff profits resulting from a separate publication of the song lyrics he had written for the opera. Judge Hand took the view that the lyricist could not claim the opera played no role in the later sales success of his lyrics. In a consequential declaration, Judge Hand found determining the contribution of the whole to the success of the part in this manner was not possible.39Maurel, 220 F. at 200 (“[I]t is impossible to say how much of their vogue was due to [the lyrics] alone, and how much to their presentation along with the opera as a whole. . . . I do not think that it is in the least possible to undertake a satisfactory analysis of the extent of the mutual influences between the parts of such a piece.”). For this reason, the lyricist would be required to split any profits from the separate sale of lyrics equally with his co-authors in the whole opera.

Judge Hand (and decisions accepting the logic of Maurel) treated that reasoning as sufficient to implicate the converse scenario as well: that the particular contribution of any co-author to the success of the whole could not be measured,40See Edward B. Marks Music Corp. v. Jerry Vogel Music Co., 140 F.2d 266, 267 (2d Cir. 1944) (Hand, J.) (“The popularity of a song turns upon both the words and the music; the share of each in its success cannot be appraised . . . .”). leading to the generalized pronouncement that when “several collaborators knowingly engage in the production of a piece which is to be presented originally as a whole only, they adopt that common design . . . and unless they undertake expressly to apportion their contributions, they must share alike.”41Maurel, 220 F. at 200. The underlying causal premises of Judge Hand’s reasoning is that when all authors’ contributions are necessary for a work’s value,42Professor Shyamkrishna Balganesh proposes the application of a Necessary Elements of a Sufficient Set (“NESS”) test to determine whether causation rises to the level of authorship. Shyamkrishna Balganesh, Causing Copyright, 117 Colum. L. Rev. 1, 57–61 (2017). the degree to which each comparatively adds to that value cannot be assessed. Each contribution has its effect at the level of whether or not it is made (that is, categorical) because the contributions are mutually contingent; in this sense all contributions are equally responsible for the work’s total value. This suggests an equal split because it cannot be said that one person’s contribution is “more” of a cause than another’s.

In Maurel, there was disagreement about the parties’ proportional contributions, as the defendants sought to justify excluding the plaintiff from the copyright registration by minimizing her contribution. But the decision suggests that Judge Hand considered the parties’ true contributions to be of comparable magnitude.43Such a perception would be supported by the plaintiff and first defendant having on several previous occasions negotiated an equal split of profits for a collaboration with the same division of labor as in the case at bar, although Judge Hand stated his decision was not based on this consideration. See Maurel, 220 F. at 198. It may be for this reason that, after rejecting the possibility of determining the responsibility of each party separately for the opera’s success, Judge Hand did not find it helpful to entertain the alternative of dividing royalty rights according to some measure of each author’s direct inputs. But whatever the reason, this position has been enforced even when parties did not dispute that the co-authors’ contributions to the joint work were not equal. In Sweet Music, Inc. v. Melrose Music Corp.,44Sweet Music, Inc. v. Melrose Music Corp.,189 F. Supp. 655 (S.D. Cal. 1960). the assignee of a co-author requested a three-quarter share in a song’s renewal copyright, on the basis that he had written “half the words and all the music.”45Id. at 659. In the absence of “evidence indicating that the ownership was intended as other than an undivided one-half interest for each of the co-authors,”46Id.; see also Eliscu v. T. B. Harms Co., 1966 WL 7662, at *2 (N.Y. Sup. Ct. Oct. 27, 1966) (“Plaintiff as a joint contributor to the composition is entitled to share equally with the other collaborators, absent any agreement to the contrary.” (citations omitted)). the court applied the default rule.

When the Copyright Act of 1976 was passed, it left unchanged court-made law on accounting responsibilities of joint authors to one another.47H.R. Rep. No. 94-1476, at 121 (1976). Courts had not yet addressed the question of how comparatively lesser a collaborator’s contribution could be while still being fairly entitled to the equal benefits of authorship. No court had given serious reconsideration to Maurel’s assertion that co-authors are entitled to equal shares of proceeds in the absence of a contract. The Copyright Act codified the criteria for joint authorship in Maurel,48Judge Hand further expanded upon Maurel’s analysis in Edward B. Marks Music Corp. v. Jerry Vogel Music Co., 140 F.2d 266 (2d Cir. 1944), in which joint authorship was used as a defense to an infringement action, and held that in a work originally intended to be joint, renewal of the copyright was to the whole work rather than its constituent elements—here the words and music of a song. The focus was on the nature of the work itself, rather than the mindset of the parties in relation to one another: because the words and music were intended to be performed as a single piece, joint authorship exists. Disregarding the relationship of the creating parties did, however, give rise to the notion that the parties themselves could be unknown to one another. Id. at 267. This is Judge Hand’s one innovation that was explicitly repudiated by the Copyright Act. See H.R. Rep. No. 94-1476, at 120 (1976) (emphasis added) (“The touchstone here is the intention, at the time the writing is done, that the parts be absorbed or combined into an integrated unit.”). requiring that collaborators have an intent to merge their contributions into a unitary whole.49See H.R. Rep. No. 94-1476, at 120 (1976) (“Under the definition of section 101, a work is ‘joint’ if the authors collaborated with each other, or if each of the authors prepared his or her contribution with the knowledge and intention that it would be merged with the contributions of other authors as ‘inseparable or interdependent parts of a unitary whole.’ ”).

B. Gatekeeping Against Lesser Contributors: From “Intent to Merge” to “Intent to be Co-Authors”

For the first decade after the Copyright Act of 1976 was passed, most courts followed a literal reading of the statute—and, per legislative history, the common law precedent—to decide joint authorship claims. The longstanding rule was that co-authors share equally in the benefits of co-authorship regardless of their relative contributions. Co-authorship rewards were potentially high, but the intent to merge standard for minting co-authors was low. Lesser contributors, with whom a work’s more significant authors may not have intended to collaborate,50For instance, if an author creates a work from one of their own previous joint works, does the “intent to merge” from the previous work carry over into the putative derivative work, joining the earlier co-author automatically? See Weissmann v. Freeman, 684 F. Supp. 1248, 1261 (S.D.N.Y. 1988), aff’d in part, rev’d in part, 868 F.2d 1313 (2d Cir. 1989) (finding a joint work in this fact pattern, which was overturned on appeal by a divided Second Circuit panel). or whose contributions were quantitatively small in comparison to their co-author’s,51See Fisher v. Klein, No. 86 CIV. 9522 (PNL), 1990 WL 10072477, at *1 (S.D.N.Y. June 26, 1990). The putative co-authors prevailed at the district court level in Fisher as in Weissmann. Id. at *19. While the Second Circuit’s doubts concerning Weissmann’s intent standard were addressed on appeal, Fisher was allowed to stand. Fisher, which had co-authorship turn on a “dominant author’s” intent to share authorship, was a major influence on the landmark Childress case, albeit to inhibit a finding of joint authorship rather than to support it. See Childress v. Taylor, 945 F.2d 500, 508 (2d Cir. 1991). were being granted co-authorship at the district court level, seeding frustration in the Second Circuit.

Beginning in the 1990s, courts heard a series of cases about creative works arising from joint efforts in which the disparities between the collaborators’ contributions were stark. Under the intent to merge statutory standard, they would nevertheless have been equal co-authors. Courts’ gut reaction to these cases was that equal co-authorship would be unfair.52Second Circuit courts had, on occasion, previously expressed concern with the relative contributions of the collaborators in joint authorship cases. See Kenbrooke Fabrics, Inc. v. Material Things, No. 82 CIV. 7187-CSH, 1984 U.S. Dist. LEXIS 15458, at *17–24 (S.D.N.Y. June 28, 1984); Picture Music, Inc. v. Bourne, Inc., 314 F. Supp. 640, 647 (S.D.N.Y. 1970). Primary creators, it was perceived, would not want to share equal proceeds with collaborators who had made lesser contributions to the work;53This is the implication behind the observation in Childress, not further explained by the court, that the “equal sharing of rights should be reserved for relationships in which all participants fully intend to be joint authors.” Childress, 945 F.2d at 509. When one author is a “dominant author,” it is “especially important.” Id. at 508. Why? The unspoken assumption—unspoken because it seems unquestionable—is that a majority contributor would naturally not want to share equally with someone who made a much smaller contribution. To overcome this “common sense” view requires a strong showing to the contrary. if forced by the law to do so, they would be disincentivized to collaborate out of fear of sharing authorship.54See Erickson v. Trinity Theatre, Inc., 13 F.3d 1061, 1069 (7th Cir. 1994).

But equal co-authorship was well entrenched in the law. Modifying collaborators’ co-authorship shares to reflect their relative contributions was not an option under consideration. Instead, courts granted a prerogative to greater contributors to share, or not share, co-authorship with a work’s lesser contributors. This was given effect by adding a mutual intent to be co-authors requirement to the statutory intent to merge and necessary independently copyrightable contributions.55Copyrightable works are “original works of authorship fixed in any tangible medium of expression.” 17 U.S.C. § 102(a) (2016). “[M]usical works, including any accompanying words,” are one category of works of authorship. Id. § 102(a)(2). A work may be fixed in a “copy or phonorecord.” Id. § 101. For a discussion of the originality requirement as it pertains to musical compositions, see infra notes 159–63. In joint works, an ongoing area of dispute is whether contributions need to satisfy a “non-de minimis” standard or should be independently copyrightable: these approaches are associated with Professors Melville and David Nimmer (the “Nimmer standard”), 1 Nimmer & Nimmer, supra note 30, § 6.07, at 6-20 to 6-21, and Paul Goldstein (the “Goldstein standard”), Paul Goldstein & P. Brent Hugenholtz, International Copyright 248 (2d ed. 2010), respectively. Most circuits follow the latter standard. These cases proposed that evidence of the parties’ subjective intentions to be co-authors could be inferred from, for instance, how the work was billed or credited.

C. Introduction of the Control Doctrine

While the intent to be co-authors test raised the bar for co-authorship, it did not foreclose it.56Furthermore, it wasn’t until Thomson v. Larson, 147 F.3d 195, 202 (2d Cir. 1998) that the Childress analysis was held to apply to fact patterns in which lesser contributions were “major,” or of a type that would be independently copyrightable. This led to the “conundrum” of Thomson having made independently copyrightable contributions on a non-work-made-for-hire basis to a work of which she was not an author. Id. at 205. The pressing question (if Thomson was not a co-author of the work, could she then enjoin the Larson heirs from producing Rent with the lines she contributed?) was avoided by the court on procedural grounds and formed the basis of subsequent litigation. See Jesse McKinley, Family of ‘Rent’ Creator Settles Suit Over Authorship, N.Y. Times (Sept. 10, 1998), https://www.nytimes.com/
1998/09/10/theater/family-of-rent-creator-settles-suit-over-authorship.html [https://perma.cc/MV99-HPTA]. This issue appears never to have been resolved in the circuit. See Kwan v. Schlein, No. 05 CIV. 0459 (SHS) (JCF), 2009 WL 10678967, at *5 (S.D.N.Y. Apr. 23, 2009) (“[W]hile it seems clear that Ms. Kwan is not a co-author, it is possible that, if her contributions were great enough, she might own a copyright as sole author in the portions she wrote.”).
A new requirement for co-authorship, a need to have control over the whole work, was invented. In the Seventh Circuit,57Erickson, 13 F.3d at 1064. it was framed as additional and necessary evidence of the mutual intent test for joint authorship. In the Ninth Circuit, it was framed as a new test of authorship, without which, as before, there could be no question of joint authorship—regardless of the extent of one’s copyrightable contribution.58A common approach in this line of cases was for the court to dismiss lesser contributions as “suggestions.” Erickson, 13 F.3d at 1072; Childress, 945 F.2d at 509; see also Thomson, 147 F.3d at 206 (defendant brief refers to plaintiff’s independently copyrightable contributions as “suggestions”). “Suggestions” says nothing as to the copyrightability of those contributions; it makes their relevance turn on their relationship to the control factor. Later cases would arguably turn on “control,” often to the near-complete exclusion of copyrightability considerations. In Aalmuhammed v. Lee,59Aalmuhammed v. Lee, 202 F.3d 1227 (9th Cir. 2000). appellant Jefri Aalmuhammed served (without a contract) as an Islamic consultant on the Warner Brothers film Malcolm X. In addition to these services, he made comparatively minor scriptwriting and directorial contributions that were included in the completed film. These contributions would have been independently copyrightable.60Id. at 1231. All creative contributors intended that their contributions were to be merged into the whole,61Id. satisfying the statutory intent test. The panel voiced concern that dominant authors would be deterred from beneficial collaboration if they had to share the benefits of authorship with a co-author whose contributions were substantially less,62Id. at 1235 (“Progress would be retarded rather than promoted, if an author could not consult with others and adopt their useful suggestions without sacrificing sole ownership of the work. Too open a definition of author would compel authors to insulate themselves and maintain ignorance of the contributions others might make.”). Referencing Childress’s description of the putative co-author’s contributions in that case as merely “some form of assistance,” id. (citing Childress, 945 F.2d at 504), the Aalmuhammed court envisioned a parade of horribles likely to follow if lesser contributors were granted co-authorship in joint works: “Claimjumping by research assistants, editors, and former spouses, lovers and friends would endanger authors who talked with people about what they were doing . . . .” Aalmuhammed, 202 F.3d at 1235–36. The work in the case at hand, Aalmuhammed’s work, fit into none of those suspect classifications. Here, the court is groping at a basis for a potential standard for when joint authorship is likely intended: note that the court appears to see the existence of a close relationship as indicative of a lack of co-authorship intent. strongly implying that on policy grounds they sought a construction of authorship that would exclude Aalmuhammed. As evidence against the existence of a mutual intent to be co-authors, the Ninth Circuit adopted the control concept introduced by the Seventh Circuit.63See id. at 1233 n.24. Focusing on Spike Lee’s control over including Aalmuhammed’s contributions in the film,64Id. at 1235 (“Aalmuhammed did not at any time have superintendence of the work. Warner Brothers and Spike Lee controlled it.” (citation omitted)). control was elevated as the most important factor needed to find there had been an intent to be co-authors.

Since Aalmuhammed, in the absence of a contract, lesser contributors’ joint authorship claims have turned on evidence establishing that they exercised control. As the joint authorship test for the Ninth Circuit,65Modern cases cite typically to Richlin v. Metro-Goldwyn-Mayer Pictures, Inc., 531 F.3d 962, 968 (9th Cir. 2008), which restated the Aalmuhammed factors as a concise test:

First, we determine whether the “putative co-authors ma[de] objective manifestations of a shared intent to be co-authors.” A contract evidencing intent to be or not to be coauthors is dispositive. Second, we determine whether the alleged author superintended the work by exercising control. Control will often be the most important factor. Third, we analyze whether “the audience appeal of the work” can be attributed to both authors, and whether “the share of each in its success cannot be appraised.”

Id. (citations omitted). In the absence of a contract, “control” or lack thereof is generally sufficient for determining co-authorship intent. To date, no case has turned on the “audience appeal” factor. For a comprehensive discussion of audience appeal’s role in joint authorship cases, see Timothy J. McFarlin, Shouting the People: Authorship and Audience in Copyright, 93 Tul. L. Rev. 443, 469–79 (2019). it has been applied in songwriting joint authorship cases,66In Ford v. Ray, 130 F. Supp. 3d 1358, 1363 (W.D. Wash. 2015), the putative co-author had allegedly contributed the beat that was the “basis for the song” and scratching for the chorus and solos. Applying Aalmuhammed, his claim was defeated because he lacked control over the whole composition, there were no objective manifestations of shared intent, and the court drew no conclusion on the audience appeal prong though he had allegedly contributed the beat that was the “basis for the song” and scratching for the chorus and solos. Id. at 1363–64. In addition to the plaintiff having waited too long to bring the claim, the court clearly did not countenance that a lesser contributor could fairly expect to be entitled to co-authorship status. Id. The court’s comments normatively take for granted that co-authorship for lesser co-authors is at the discretion of the dominant author, characterizing the plaintiff as “motivated by an unfair desire to cash in on the efforts of another.” Id. at 1364; see also Robertson v. Burdon, No. ED CV18-00397 JAK (SHKx), 2019 U.S. Dist. LEXIS 85468, at *19 (C.D. Cal. Apr. 3, 2019) (“The allegations . . . support the inference that [the plaintiff] and [the defendant] shared an intent that the songs would be written together.”). more often than not,67In a case involving three putative co-songwriters, Taylor v. Universal Music Corp., No. CV 13-06412 RGK (AJWx), 2014 U.S. Dist. LEXIS 195775, at *8–10 (C.D. Cal. Mar. 10, 2014), the district court appeared to require (at most) a lessened Aalmuhammed standard for a joint authorship claim to survive a motion to strike. (The standard was applied in full to the related sound recording.) The court referenced dicta in Aalmuhammed, similarly present in Childress, that “traditional” forms of joint authorship, for instance involving the music and lyrics of a song, might not require a full Aalmuhammed inquiry. See id. at *4 (citing Aalmuhammed, 202 F.3d at 1232); cf. Childress, 945 F.2d at 508 (“[Whether the putative joint authors regarded themselves as joint authors] requires less exacting consideration in the context of traditional forms of collaboration, such as between the creators of the words and music of a song.”). This is the only case I have found in which a court operating under the control standard was willing to consider the dicta that the test should perhaps be less stringent when involving traditional forms of co-authorship. although the former was discouraged by the Aalmuhammed court in dicta.68See Aalmuhammed, 202 F.3d at 1232 (“It is also easy to apply the word [“author”] to two people who work together in a fairly traditional pen-and-ink way, like, perhaps, Gilbert and Sullivan. . . . But as the number of contributors grows and the work itself becomes less the product of one or two individuals who create it without much help, the word is harder to apply.”). In practice, control has meant control over the whole work,69See id. at 1233 (“Burrow-Giles defines author as the person to whom the work owes its origin and who superintended the whole work, the ‘master mind.’ ” (citing Burrow-Giles Lithographic Co. v. Sarony, 111 U.S. 53 (1884))); see also Moi v. Chihuly Studio, Inc., No. C17-0853RSL, 2019 U.S. Dist. LEXIS 103576, at *9 (W.D. Wash. June 20, 2019); Beautiful Slides, Inc. v. Allen, No. 17-cv-01091-MMC, 2018 U.S. Dist. LEXIS 226907, at *8 (N.D. Cal. Sept. 7, 2018) (“[The defendant] contends she ‘had her roles for which she had nearly exclusive control’ . . . .”). though a minority position has found control over “separate and indispensable elements of the completed product” to meet the control requirement.70Reinsdorf v. Skechers U.S.A., 922 F. Supp. 2d 866, 872 (C.D. Cal. 2013) (quoting Morrill v. Smashing Pumpkins, 157 F. Supp. 2d 1120, 1124 (C.D. Cal. 2001)). But see Heger v. Kiki Tree Pictures, Inc., No. CV 17-03810 SJO (Ex), 2017 U.S. Dist. LEXIS 237195, at *13–16 (C.D. Cal. July 24, 2017) (explicitly repudiating this interpretation). More recently, there are also signs that the Aalmuhammed-like concept of control is resonating with other circuits and trumping creative contribution considerations.71In 16 Casa Duse, LLC v. Merkin, 791 F.3d 247, 252–53 (2d Cir. 2015), a director claimed joint authorship in a film in which the producer (as in Aalmuhammed) had failed to secure a work for hire agreement. The Second Circuit held that the “dispositive inquiry is which of the putative authors is the ‘dominant author,’ ” and cited the four Thomson factors—decisionmaking authority, billing or credit, agreements with third parties, and other evidence—in making the determination. Id. at 260. But whereas in Thomson the dominant author was found to be the one who contributed the significant majority of independently copyrightable material, Casa Duse’s authorship was predicated on an Aalmuhammed­like control standard, in which authorship requires no independently copyrightable creative contribution: “Casa Duse initiated the project; acquired the rights to the screenplay; selected the cast, crew and director; controlled the production schedule; and coordinated (or attempted to coordinate) the film’s publicity and release.” Id. The court held that these contributions represented greater control over the project than did the contributions of the director and therefore awarded sole authorship to the production company. Id. at 261; see also Anthony J. Casey & Andres Sawicki, The Problem of Creative Collaboration, 58 Wm. & Mary L. Rev. 1793, 1835 (2017) (“The court created the fiction of a dominant author and then that label was bestowed on the party exercising the fewest acts of creative authorship. It had to do this to consolidate formal ownership and authorship . . . .” (citation omitted)).

In Corwin v. Quinonez, 858 F. Supp. 2d 903, 912 (N.D. Ohio 2012), the plaintiff band member’s contributions to sound recordings were denied joint authorship status due to lack of mutual intent with the defendant band leader, principally under the control standard: the defendant did not “cede[] control of the recordings to Plaintiff” and “made the final decision of what [was] used for the song.” As in Aalmuhammed, the language used by the court here presupposes a single author despite the undeniably collaborative nature of the work. Corwin cites principally to Erickson v. Trinity Theatre, Inc., 13 F.3d 1061 (7th Cir. 1994), but also to Janky v. Lake Cnty. Convention & Visitors Bureau, 576 F.3d 356 (7th Cir. 2009). In Janky, the Seventh Circuit found joint authorship in a case in which the putative co-songwriter likely did not make an independently copyrightable contribution, see Janky, 576 F.3d at 364 (Ripple, J., dissenting), partly on the theory that the co-author “wielded considerable control over what the song finally looked like,” id. at 362. (Janky followed the earlier Gaiman v. McFarlane, 360 F.3d 644 (7th Cir. 2004) in repudiating, at least in certain circumstances, the Seventh Circuit’s longstanding adherence to the Goldstein standard.) Janky is, for now, the exception that tests the rule that the control standard is strictly a one-way ratchet for denying joint authorship claims. Its legacy across the circuits may be to serve as a means of granting authorship within the control framework to a party who has made no copyrightable contribution, and possibly no creative contribution at all. But the general consequence of the expanding influence of the control standard is that lesser contributors are blocked from the equal sharing of authorship, and lacking authorship, have no entitlements in the absence of a contract.72On very rare occasions, as in Aalmuhammed itself, unrewarded creative contributors have been allowed to pursue recovery under unjust enrichment or similar theories, although rarely with success. See, e.g., Ahn v. Midway Mfg. Co., 965 F. Supp. 1134, 1140 (N.D. Ill. 1997); Cabrera v. Teatro Del Sesenta, Inc., 914 F. Supp. 743, 769 (D.P.R. 1995). But see Lopez v. Musinorte Ent. Corp., 434 F. App’x 696, 699 (9th Cir. 2011) (upholding a jury award in which one member of a five-member band had received one-fifth of the band’s profits and future royalties). In the two circuits most consequential for the copyright industries,73As discussed supra, influential rulings in joint authorship issue primarily from the Ninth and Second Circuits. The Ninth (32.38%) and Second (16.71%) are also the two circuits that produce the largest volume of copyright litigation. Christopher A. Cotropia & James Gibson, Copyright’s Topography: An Empirical Study of Copyright Litigation, 92 Tex. L. Rev. 1981, 2000 (2014). The districts in which most of these cases originate (the Central District of California and the Southern District of New York) disproportionately find for the defendants in copyright cases. Id. at 2008. Three out of four plaintiffs in the Central District of California (77.46%) are “individuals or small firms.” Id. The most common type of copyright registration for individuals is “text and music” (that is, songs). Dotan Oliar, Nathaniel Patterson & K. Ross Powell, Copyright Registrations: Who, What, When, Where, and Why, 92 Tex. L. Rev. 2211, 2214 (2014). there is now a trend toward finding works to be single-authored.

The intent to be co-authors test, which replaced the intent to merge test, and the control doctrine, itself designed to further strengthen the intent to be co-authors test, are widely regarded by scholars as suboptimal. Courts aspired to recognize the interests of both lesser and greater contributors, but existing law was understood to force an all-or-nothing choice between the two groups.74That is, if the desired end was co-authorship of a joint work. Recovery less than “all” but more than “nothing” could potentially have been pursued through a claim of copyright infringement. Indeed, it is by analogy to recovery in infringement that scholars have advanced the possibility of proportional recovery in joint authorship. See Nimmer & Nimmer, supra note 30, § 6.08. Joint authorship typically arises in litigation as an affirmative defense to copyright infringement. In the leading cases, infringement claims are rarely advanced. This may be because, applying the analogy to infringement damages, recovery in most cases would not be worth the cost of litigation. But when it is undeniable that the creator contributed independently copyrightable material, a significant question is whether, absent an agreement to the contrary, they retain a separate copyright interest in that material. The retention of such a right in a commercial work would place significant pressure on the majority owner to settle to avoid hold-up costs. That is in fact how the Thomson case was ultimately resolved, and likely explains the ruling in Casa Duse that a director had no copyright interest in the film footage that he shot. The consequences of exclusion from co-authorship for lesser contributors include leaving them uncompensated for their work. Also of concern is the disordering effect of these standards on author identification, a central goal of copyright law. Goldstein criticizes the control standard as being “both overinclusive and under-inclusive.”75Goldstein, supra note 9, § 4.2.1.2, at 4:18.3. It allows contributors primarily of non-copyrightable expression, such as film producers, to be recognized as authors. At the same time, it complicates the identification of the authors of most other multi-authored works.

D. Scholarly Positions

Scholars generally agree that lesser contributors who make copyrightable contributions should be counted as co-authors.76In contrast to the courts, scholars often express concern over disincentivizing effects of joint authorship law on lesser contributors. See Abraham Bell & Gideon Parchomovsky, Copyright Trust, 100 Cornell L. Rev. 1015, 1020–21 (2015); Gregory N. Mandel, Left-Brain Versus Right-Brain: Competing Conceptions of Creativity in Intellectual Property Law, 44 U.C. Davis L. Rev. 283, 349–50 (2010); Jennifer Yamin, Note, Analyzing Aalmuhammed v. Lee in the Context of Entertainment Industry Employment, 8 NYU J. Intell. Prop. & Ent. L. 91, 114 (2018). Arguments in favor of authorship for lesser contributors are often subsumed within arguments in favor of proportionality, typically on fairness or efficiency grounds. See Mandel, supra, at 353 (“The outcomes would be more efficient because they would provide the proper incentives to potential collaborators . . . . The outcomes would be more equitable because each joint creator would be rewarded in appropriate proportion to his or her contribution.”). Other objections are purely legal: specifically, that nothing in the Copyright Act or the circumstances surrounding its enactment supports the idea that authorship requires equal contributions. See Mary LaFrance, Authorship, Dominance, and the Captive Collaborator: Preserving the Rights of Joint Authors, 50 Emory L.J. 193, 232 (2001). Beyond that, scholars are divided into two camps with respect to how the law ought to treat them vis-à-vis their majority contributing co-authors.

One camp would retain the equal split default while granting equal shares to lesser co-authors.77Professor Mary LaFrance offers this proposal: “Where [the] contribution is substantial as well as independently copyrightable, joint authorship should be presumed, and a party seeking to rebut that presumption would be required to show that the contribution in question was incorporated into the finished work under an express or implied derivative work license.” LaFrance, supra note 76, at 203. Goldstein would apply the plain language of the statute and allow equal ownership to all contributors of independently copyrightable material if they intended to merge their contributions in a unitary whole. If Warner Brothers does not want to share equal ownership in Malcolm X with Aalmuhammed, it should not fail to negotiate with him for the value of his services—and be more careful in the future.78Contra 1 Nimmer & Nimmer, supra note 30, § 6.08 (“If the only choice that the court faced was between making Aalmuhammed a half-owner or a non-owner of the resulting film, then that hard case would understandably force the bad law of the latter result.” (citation omitted)); Anthony J. Casey & Andres Sawicki, Copyright in Teams, 80 U. Chi. L. Rev. 1683, 1721 (2014). Goldstein also offers a less radical compromise, consistent with the logic of Childress, suggesting that courts could find an implied transfer of copyright ownership on the basis of the nature of the relationship between the collaborators. Goldstein, supra note 9, § 4.2.1.1. The example given is the editor and author relationship: editors rarely expect to share joint authorship in the work they edit. Similarly, given the prevalence of work for hire agreements in the film industry, it could fairly be said that Aalmuhammed did not expect to be a joint author of Malcolm X. However, Aalmuhammed testified that he approached an executive producer seeking credit as a screenwriter and was told “there is nothing I can do for you,” but that they would discuss the matter in the future. Aalmuhammed v. Lee, 202 F.3d 1227, 1231 (9th Cir. 2000). Goldstein levels similar criticisms of the extra-statutory introduction of the intent to be co-authors and control requirements to the Copyright Act’s intent language.79See Goldstein, supra note 9, § 4.2.1.1. While the statutory intent (intent to merge) approach “will sometimes give an economic interest to a contributor . . . who probably did not intend to receive it,” the Copyright Act should not be distorted to protect the economic interests of dominant contributors.80Id. A better outcome is for dominant contributors to bear the burden of adjusting shares via contract to avoid an undesired equal split.81Childress places that burden on the nondominant contributor. See Childress v. Taylor, 945 F.2d 500, 507 (2d Cir. 1991).

The other camp of scholars focuses on courts’ unease with non-equal contributors receiving equal authorship rights as the source of courts’ statute-distorting jurisprudence. This camp would endorse lesser co-authors receiving a lesser split.82See, e.g., Mandel, supra note 76, at 353–57. The shared assumption of this camp—that if lesser contributors reap financial rewards greater than their contributions seem to merit, majority contributors will be disincentivized and creative production will suffer as a result—is rarely questioned.83Paying high- and low-performing workers the same leads high performers to lower their efforts. Jason D. Shaw, Pay Dispersion, 1 Ann. Rev. Org. Psych. & Org. Behav. 521 (2014). The inefficiency of equal pay is widely accepted. There is an assumption that fairness has an important role to play in economic productivity. But there is more to the psychology of linking compensation to contributions than simply motivating cool-headed, rational workers. The sentiment held by many is that a contribution-based default should be established because fairness requires a correspondence84Perhaps intuitions are less strict than absolute proportionality, but they would at least require that the individual who contributed the most received the most; the individual who contributed the second most, the second most; and so on: that fairness requires a rank order between inputs and outputs. between inputs and outputs.85Aristotle, who noted that “in acts of justice what is equal in the primary sense is that which is in proportion to merit, while quantitative equality is secondary,” was an early proponent of this construction of fairness. Aristotle, The Nicomachean Ethics 151 (David Ross trans., Oxford World’s Classics ed. 2009). This dynamic between inputs and outputs is referred to as “equity” in justice studies; in sociology it has been called the “principle of differentiation”; in organizational behavior, “pay dispersion.” The boundaries of the concept shift only slightly across disciplines. See Morton Deutsch, Equity, Equality, and Need: What Determines Which Value Will Be Used as the Basis of Distributive Justice?, 31 J. Soc. Issues 137, 143 (1975); Jennifer L. Hochschild, What’s Fair: American Beliefs About Distributive Justice 111 (1986); Shaw, supra note 83.

There are two main approaches, which hold in common that it is possible to make adjustments to existing joint authorship law.86Other scholars propose alternate regimes that allow for proportional recovery. See Bell & Parchamovsky, supra note 76 (proposing a “copyright trust” that would allow for one controller of the work while contributors divided profits in proportion); Casey & Sawicki, supra note 78, at 1725 (separate authorship from ownership and grant ownership to the joint work’s “team manager”); Rochelle Cooper Dreyfuss, Commodifying Collaborative Research, in The Commodification of Information 397, 412 (Niva Elkin-Koren & Neil W. Netanel eds., 2002) (allow proportionality via “collaborative work[s]” that are not work for hire but fail the joint authorship test); Russ VerSteeg, Intent, Originality, Creativity and Joint Authorship, 68 Brook. L. Rev. 123, 179 (2002) (allow proportional recovery in quantum meruit if joint authorship is objectively unreasonable). The proposed implementation which has attracted the most scholarly support is a rebuttable presumption of equality.87See Brief of Professors Shyamkrishna Balganesh. Justin Hughes, Peter Menell, and David Nimmer as Amici Curiae in Support of Neither Party at 28–29, Garcia v. Google, Inc., 786 F.3d 733 (2015); Nimmer & Nimmer, supra note 30, § 6.08 (2019); Hughes, supra note 31, at 65–67 (2019); Roberta Rosenthal Kwall, “Author-Stories”: Narrative’s Implications for Moral Rights and Copyright’s Joint Authorship Doctrine, 75 S. Cal. L. Rev. 1, 58 (2001); Benjamin E. Jaffe, Rebutting the Equality Principle: Adapting the Co-Tenancy Law Model to Enhance the Remedies Available to Joint Copyright Owners, 32 Cardozo L. Rev. 1549 (2011). This may be because it receives textual support in the legislative history88“Under the bill, as under the present law, coowners of a copyright would be treated generally as tenants in common, with each coowner having an independent right to use or license the use of a work, subject to a duty of accounting to the other coowners for any profits.” H.R. Rep. No. 94-1476, at 121 (1976). and is drawn from the analogy of joint authorship with real property tenancy in common. In such cases, while undivided equal shares are the default, co-owners may rebut that presumption by showing that unequal contributions had been made to the purchase price. This would allow unequal shares to be awarded if there is evidence the co-authors’ contributions were unequal. Aalmuhammed’s contributions to Malcolm X could be determined by experts to have been responsible for some small fraction of the film’s success, and he could be compensated accordingly.89What is often left unexplained with these proposals is whether lesser contributors would only be entitled to royalties in proportion to their contribution, or whether they would have full authorial rights, such as the right to license the work on a nonexclusive basis. A smaller group of scholars, perhaps relying on the silence of the Copyright Act as to shares in the copyrighted work, argue that proportionality should be the default rule in all joint authorship cases. For these scholars, fairness, as they believe it to be perceived, is a paramount concern. If songwriters prefer to split equally even when a co-author makes a lesser contribution, it would be a challenge to this notion of fairness.

II. WHAT SHOULD BE DONE?

How should we choose whether to reject the intent to be co-authors and control doctrines and instead include lesser contributors as co-authors? And if we do decide to include lesser contributors as co-authors, what would be the most efficient rule for splitting revenue between joint authors: retaining the equal split, or revising it to be contributions-based? With respect to the first issue to be decided, it is possible, if unlikely, that most creators would prefer lesser creators to be excluded from co-authorship and simply paid for their services. Often in collaboration situations where the parties do not have a contract or the contract is silent about co-authorship shares, which is when the default joint authorship rules apply, a main creator will not have funds to pay lesser contributors in advance for services, and the unevenness of contributions may not be clear until the joint work is complete.

With respect to the second question, some would argue that well-established common law default rules are presumptively efficient90See Richard A. Hillman, The Richness of Contract Law 225 (1997) (noting the standard view that the rule most parties would want is synonymous with the efficient rule). because they have been accepted by parties across contexts over time.91See Alan Schwartz & Robert E. Scott, The Common Law of Contract and the Default Rule Project, 102 Va. L. Rev. 1523, 1585–86 (2016) (“[E]nduring common law rules have to be transcontextual; that is, they must be satisfactory to parties over broad sections of the economy. . . . [F]ew rules can satisfy the structural requirement that they are (almost) everywhere applicable just because commercial parties (almost) everywhere like them.”). The control doctrine did not arise to thwart the equal split rule for nearly a century.92The tenure of the equal split rule is comparably long to the cohort of common law contract default rules argued to have stood the test of time as trans-contextually acceptable to parties and therefore efficient. See id. at 1535. It has similarly satisfied the criteria of having been applied and accepted in different industry contexts. See, e.g., Greene v. Ablon, 794 F.3d 133 (1st Cir. 2015) (scholarship); Brownstein v. Lindsay, 742 F.3d 55 (3d Cir. 2014) (software); Berman v. Johnson, 518 F. Supp. 2d 791 (E.D. Va. 2007), aff’d, 315 F. App’x 461 (4th Cir. 2009) (film); Gordon v. Lee, No. 1:05-CV-2162-JFK, 2007 WL 1450403 (N.D. Ga. May 14, 2007) (architecture); Words & Data, Inc. v. GTE Commc’ns Servs., Inc., 765 F. Supp. 570 (W.D. Mo. 1991) (business); Strauss v. Hearst Corp., No. 85 CIV. 10017 (CSH), 1988 WL 18932 (S.D.N.Y. Feb. 19, 1988) (advertising); Fishing Concepts, Inc. v. Ross, 226 U.S.P.Q. 692, 696 (D. Minn. 1985) (advertising); Mister B Textiles, Inc. v. Woodcrest Fabrics, Inc., 523 F. Supp. 21 (S.D.N.Y. 1981) (textiles); Donna v. Dodd, Mead & Co., 374 F. Supp. 429, 430 (S.D.N.Y. 1974) (literature); Noble v. D. Van Nostrand Co., 164 A.2d 834 (N.J. Super. Ct. Ch. Div. 1960) (scholarship); G. Ricordi & Co. v. Columbia Graphophone Co., 258 F. 72 (S.D.N.Y. 1919) (music). The expansive treatment of work made for hire under the Copyright Act of 1909 may have forestalled potential joint authorship claims and thus challenges to the equal split. On the other hand, the equal split rule’s detractors regard as self-evident that it is unfair and unpopular. The suboptimality of the equal split default feels like a frictionless assumption. Proportional compensation is the norm in wage labor contexts, which are related but distinct.93Shaw notes: “Moreover, theories purportedly supporting the benefits of pay compression do not, in a general sense, advocate equal pay for unequal work. . . . [E]ven Pfeffer’s (1998) simplified practitioner-oriented treatment, which advocates pay compression as a best practice, also extols individual pay-for-performance as something organizations should universally adopt.” Shaw, supra note 83, at 534 (citing Jeffrey Pfeffer, Competitive Advantage Through People: Unleashing the Power of the Work Force (1994)). It is not clear what inputs are, but they are somehow quantitative and contextually determined. Often there is an assumption that focal inputs should be those antecedents with a more direct link to outcomes. See Robert Folger, Rethinking Equity Theory, in Justice in Social Relations 145 (Hans Werner Bierhoff, Ronald L. Cohen & Jerald Greenberg eds., 1986).

The traditional view is that an efficient default reflects the preferences of “most contracting parties—or perhaps most contracting parties in a given industry.”94Russell Korobkin, The Status Quo Bias and Contract Default Rules, 83 Cornell L. Rev. 608, 616 (1998). The universe of possible contracting parties in this case is those creative collaborators who would potentially contract with one another over the division of license proceeds. This universe consists principally of
co-songwriters. The core copyright industries include literature, music, theater, film, the media, photography, software, visual arts, and advertising.95The World Intellectual Property Organization (“WIPO”) identifies the core copyright industries as those “wholly engaged in the creation, production and manufacture, performance, broadcasting, communication and exhibition, or distribution and sale of work and other protected subject matter.” World Intell. Prop. Org., Guide on Surveying the Economic Contribution of the Copyright Industries 51 (2015), https://www.wipo.int/edocs/pubdocs/en/copyright/893/wipo_pub_
893.pdf [https://perma.cc/E8AD-9ZQ7]. These include literature, music, theatre, film, the media, photography, software, visual arts, advertising services, and collective management societies. Id. at 52–53.
Collaborative creative production has been on the rise across all of the core copyright industries, but it nevertheless accounts for a very small proportion of output (<0.5%) in most of the visual arts96Approximately 500,000 pieces of contemporary art—works of “painting, sculpture, drawing, photography, prints, installation & video” created by artists born after 1945—were sold at auction in the last decade. See The Contemporary Art Market Report 2018, Artprice, https://www.artprice.
com/artprice-reports/the-contemporary-art-market-report-2018 [https://perma.cc/4S6J-W2JS]. The Museum of Modern Art (“MoMA”) Collection dataset (“MoMA dataset”) contains records of nearly 200,000 artworks and their creators, including dates of birth and death, year of creation, artwork type, and date of creation. See The Museum of Modern Art (MoMA) Collection, Github, https://github.com/MuseumofModernArt/collection [https://perma.cc/Z2H8-CDD4]. Filtering the dataset for contemporary artists (artists born 1945 or later) returns 21,238 works, 20,952 of which were created by a single artist, or a co-authorship rate of 1.3%. Extrapolating based on the Artprice data, approximately 6,500 pieces of co-authored art have been auctioned over the last ten years.
(for example, fine art,9710/2358 (“Painting” Classification) in MoMA dataset. The Museum of Modern Art (MoMA) Collection, supra note 96. sculpture,9816/1725 (“Sculpture” Classification) in MoMA dataset. Id. and photography991449/31730 (“Photography” Classification) in MoMA dataset. Id.). The rate is higher, albeit still very low, in literature and theater. In music, however, the co-authorship rate of songs100Hereinafter, “songs” refers to songs listed in performance rights organization databases. is 37%.101See infra Section IV.B. Copyright works are much more likely to be produced through collaboration in the film, software,102See Patrick Cauldwell, Code Leader: Using People, Tools, and Processes to Build Successful Software xxi (2008) (noting that “almost all software projects” are written by teams of programmers). and music103See supra note 101 and accompanying text. industries, and in academia.104Approximately 3 million articles are published yearly in scholarly peer-reviewed English-language journals. Rob Johnson, Anthony Watkinson & Michael Mabe, The STM Report: An Overview of Scientific and Scholarly Publishing 5 (5th ed. 2018), https://www.stm-assoc.org/2018_10_04_STM_Report_2018.pdf. Outside of the humanities, the vast majority of scholarly articles are co-authored. See Smriti Mallapaty, Paper Authorship Goes Hyper, Nature Index (Jan. 30, 2018), https://www.natureindex.com/news-blog/paper-authorship-goes-hyper [https://perma.cc/27AH-P6P9]. However, in all but the music industry, the joint authorship default rules are for the most part inapplicable.

There are two reasons for this. First of all, only authors can be joint authors. This means that whenever collaborators’ works are officially “authored” by their employer,105As set forth by statute,

In the case of a work made for hire, the employer or other person for whom the work was prepared is considered the author for purposes of this title, and, unless the parties have expressly agreed otherwise in a written instrument signed by them, owns all of the rights comprised in the copyright.

17 U.S.C § 201(b) (2016). the default rules are not relevant.106The Copyright Act defines a work made for hire as,

a work prepared by an employee within the scope of his or her employment; or . . . a work specially ordered or commissioned for use as a contribution to a collective work, as a part of a motion picture or other audiovisual work, as a translation, as a supplementary work, as a compilation, as an instructional text, as a test, as answer material for a test, or as an atlas, if the parties expressly agree in a written instrument signed by them that the work shall be considered a work made for hire.

17 U.S.C. § 101 (2016). Work for hire arrangements are not generally practiced in the music industry,107Modern courts have not typically held musicians to be employees of their record labels. See Daniel Gould, Time’s Up: Copyright Termination, Work-for-Hire and the Recording Industry, 31 Colum. J.L. & Arts 91, 109 (2007). Sound recordings, mentioned elsewhere in the Act, are also notably absent from the list of works made for hire, a circumstance which has generated considerable scholarly comment. See, e.g., Nimmer & Nimmer, supra note 30, § 5.03; Gould, supra, at 108 (2007); Mary LaFrance, Authorship and Termination Rights in Sound Recordings, 75 S. Cal. L. Rev. 375, 379 (2002). Sound recordings have been held not to fall under the “audiovisual work” label. Lulirama Ltd. v. Axcess Broad. Servs., Inc., 128 F.3d 872, 878 (5th Cir. 1997). Works made for hire are otherwise confined strictly to the categories of work enumerated in section 101. See Cmty. for Creative Non-Violence v. Reid, 490 U.S. 730, 748 (1989). Even when music is composed as a work made for hire, royalties are distributed to the actual author per ASCAP rules. Robert Brauneis, Copyright and the World’s Most Popular Song, 56 J. Copyright Soc’y U.S.A. 335, 411 (2009). but they are the norm in film108See Nimmer & Nimmer, supra note 30, § 6.05 n.19 (“The reality is that contracts and the work-made-for-hire doctrine govern much of the big-budget Hollywood performance and production world.” (quoting Garcia v. Google, Inc., 786 F.3d 733, 743 (9th Cir. 2015))). and software. (The stakes for Warner Brothers in Aalmuhammed illustrate why.) Second, in academia, journal articles, which comprise the bulk of academic publishing,109Academic journal revenue is roughly three times larger than that of academic book publication. See Johnson et al., supra note 104, at 22. Over 3 million science, technology, and medicine (“STM”) scholarly articles are published per year. Id. at 5. as a matter of course reassign royalty streams from creators to publishers.110See Ann Okerson, With Feathers: Effects of Copyright and Ownership on Scholarly Publishing, 52 Coll. & Rsch. Libr. 425, 427–28 (1991). The open access movement opposes assigning copyrights to paywalling publishers, but even under an open access model the publishing royalties would still be inconsequential for authors, albeit for a different reason.111While authors typically retain their copyrights in the work under an open access regime, the works are often published under royalty-free licenses. See Giancarlo Frosio, Open Access Publishing: A Literature Review 98 (Ctr. For Copyright & New Bus. Models in the Creative Econ., Working Paper 2014/1), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2697412 [https://perma.cc/UJ6L-7W4X]. These practices—work for hire and publisher assignments—cover two paths to corporate ownership of copyrightable work and most contexts of creation in film, the media, software, and advertising.112See Nimmer & Nimmer, supra note 30, § 6.05 n.19 (“The reality is that contracts and the work-made-for-hire doctrine govern much of the big-budget Hollywood performance and production world.” (quoting Garcia v. Google, Inc., 786 F.3d 733, 743 (9th Cir. 2015))).

A. The Unique Relevance of Co-Songwriting

Songwriting is not included in the categories of creative work covered by work for hire.113See 17 U.S.C. § 101 (2016). Typically, musicians who are signed by record labels will be paid an advance against future royalties.114These are often structured as funds, which combine the recording budget with advances against royalties in a lump sum, but traditional advances are still utilized. See Donald S. Passman, All You Need to Know About the Music Business 111 (9th ed. 2015). In exchange, most artists transfer the copyrights to their recordings (“the masters”) to the label. See id. at 211. However, in those cases, this is not an assignment of ownership of their songwriting copyrights to the labels.115Songwriters often sign away a portion of their songwriting copyrights to publishers through co-publishing agreements. See id. at 235; Jill A. Michael, Music Copublishing and the Mysterious ‘Writer’s Share,’ 20 Ent. & Sports L. 13, 14 (2002). Record labels may also take a portion of the songwriting copyright, or the proceeds thereof, as part of a “360 deal.” See Edward Pierson, Negotiating a 360 Deal: Considerations on the Promises and Perils of a New Music Business Model, 27 Ent. & Sports L. 1, 34 (2010). So co-songwriters, as a matter of course, will be either contracting out of, or relying on the default rules for, joint authorship to specify the allocation of songwriting royalties.

Songwriting is one of the copyright domains in which the parties are the least likely to be thinking in legalities at the time of creation, or even in terms of industry norms: songwriters have only limited knowledge of other songwriters’ split practices. Unlike scholarship, commercial filmmaking, or software development, songwriting can be (and often is) undertaken by a handful of teenagers in a garage band for whom default rules in the absence of contract are particularly relevant.116Theater is probably the closest analogue. Like songwriting, it has been the domain of several landmark joint authorship cases, discussed supra. See Thomson v. Larson, 147 F.3d 195 (2d Cir. 1998); Erickson v. Trinity Theater, Inc., 13 F.3d 1061 (7th Cir. 1994); Childress v. Taylor, 945 F.2d 500 (2d Cir. 1991). Moreover, 68% of music groups are primarily composed of friends or family members.117Polcz, supra note 7, at 38. Other domains where joint authorship rules tend to apply do not approach the volume of creation—and therefore individual works which are potentially subject to contracting about royalty splits—of co-songwriting. There are over 750,000 people who have co-written a song in the ASCAP repertory alone.118ACE Repertory Search, ASCAP, https://www.ascap.com/repertory [https://perma.cc/8A8F-QB3W] [hereinafter ASCAP Repertory]. The downloadable version in CSV format (current as of Mar. 26, 2020) was used.

Finally, joint authorship rules are financially consequential for co-songwriters.119See Justin Hughes & Robert P. Merges, Copyright and Distributive Justice, 92 Notre Dame L. Rev. 513, 532 (2016) (“[E]ven in the music industry—and with access to very little empirical data—we can see the powerful role that copyright plays in securing incomes for creative individuals.”) Justin Hughes and Robert Merges also note the importance of the copyright system in advancing African-American prosperity at a time when the failure of other systems to do so is being increasingly, if belatedly, recognized. See id. at 551–55. The economic importance of songwriting royalties120See id. at 532–33 (making as a “low-ball estimate,” PROs collected at least $4.1 billion in public performance royalties for songwriters in the 2010 to 2014 period). is not an argument in favor of focusing on co-songwriters to set the default joint authorship rules. But it is an additional reason why we should be interested in efficiently setting the joint authorship rules more generally. In the history of popular music, the importance of songwriting royalties to musicians has waxed and waned.121In 1978, the Copyright Act revised the mechanical royalty rate, which had remained unchanged since the Copyright Act of 1909 set it at 2 cents per song. Royalties started at 2.75 cents per song, and have increased to 9.1 cents per song today. U.S. Copyright Off., Mechanical License Royalty Rates, https://www.copyright.gov/licensing/m200a.pdf [https://perma.cc/PNY6-7S5N]. Songwriting royalties became meaningful for musicians after the 1960s transition to solo artists and bands who wrote their own songs122American popular music of the pre-rock era was largely written by professional songwriters, rather than songwriter-performers. See generally Russell Sanjek, American Popular Music and Its Business: The First Four Hundred Years (1988). and who, by the 1970s, had the bargaining power to control their own publishing revenues.123See id. at 537–39. Beginning in the band era of the 1960s, songwriting royalties for the most successful songs were worth millions.124See id. at 473. The revenue for artists derived from songwriting has declined again with the advent of streaming.125Royalties generally, and songwriting royalties in particular, took a significant hit from the rise of streaming music, legal and otherwise. Streaming as a whole has undercut physical music sales. See John Seabrook, The Song Machine: Inside the Hit Factory 187 (2015). Even legal streaming services like Spotify can offer little benefit to songwriters. Id. at 188 (“On most streaming services . . . the owners of the recording get most of the performance royalty money, while the owners of the publishing get only a fraction of it.”); Jason B. Bazinet, Mark May, Kota Ezawa, Thomas A. Singlehurst, Jim Suva, Alicia Yap, Jennifer Breithaupt, Kevin Brown & Bjorn Niclas, Putting the Band Back Together: Remastering the World of Music 74 (2018), https://www.citivelocity.com/
citigps/music-industry/ [https://perma.cc/HAG2-5TR9] (“[I]f you are a fully independent artist, you are likely to earn around $15,000-$20,000 per million plays on a streaming service and that gets split between the writers . . . . [I]f you are on an ‘old industry’ label, you can expect to only get $1,700 per million plays, because the label is taking the lion’s share . . . .”) Labels feeling the pinch from declining physical sales have also increasingly turned to “360 deals,” whereby artists are obliged to surrender percentages of other revenue streams (like songwriting) in order to land a recording contract. See Passman, supra note 114, at 102–03; Lee Marshall, The 360 Deal and the New Music Industry, 16 Eur. J. Cultural Stud. 77 (2012). Increasing royalties from digital streaming may indicate that the importance of this revenue stream will only increase in the future. See Ed Christman, NMPA Claims Victory: CRB Raises Payout Rate from Music Subscription Services, Billboard (Jan. 27, 2018), https://www.billboard.
com/articles/news/8096590/copyright-royalty-board-crb-nmpa-spotify-apple-music-streaming-services [https://perma.cc/YC99-TWH5]; U.S. Sales Database, RIAA, https://www.riaa.com/u-s-sales-database/ [https://perma.cc/GA2H-EZMJ] (showing a reversal in recorded music revenue decline due to the growth of monetized streaming).
Nevertheless, songwriting royalties are a significant portion of total income for many successful musicians. This became particularly clear during the COVID pandemic which eliminated touring income for over a year for many artists. A survey prior to the COVID pandemic of working musicians found that, in the most recent decade, indie rock bands have earned about 21% of their gross income from songwriting royalties and advances on those royalties.126See Future of Music Coa., Artist Revenue Streams: Case Study: Indie Rock Composer-Performer 9 (Mar. 15, 2012), http://money.futureofmusic.org/wordpress/wp-content/uploads/2012/03/ARScasestudyA.pdf [https://perma.cc/6UUY-MLPN]. This information was gathered by the Future of Music project, which sought to compile information on musicians’ and composers’ revenue streams via surveys and interviews. See Artist Revenue Streams, Future of Music Coa., http://futureofmusic.org/article/research/artist-revenue-streams [https://perma.cc/XMS3-VHNK]. For a discussion of the survey’s results, see Peter DiCola, Money From Music: Survey Evidence on Musicians’ Revenue and Lessons About Copyright Incentives, 55 Ariz. L. Rev. 301 (2013). Across other genres this figure was approximately 8% of income,127The survey found that “[m]usicians in rock, pop, country, folk and all other genres [other than classical or jazz] earn 8% of their revenue from compositions.” DiCola, supra note 126, at 329. rising to 39% for people who identified as composers, whether performing or not.128Self-identified composers, including both performers and non-performers, earned 39%. Id. And songwriting royalties often provide a measure of financial security for musicians. Unlike money from tours, for instance, the checks keep coming in once a musician’s most active career years are over.129See, e.g., Dean Goodman, Songwriter “Dirty Dancing” All The Way to the Bank, Reuters (Nov. 10, 2010), https://www.reuters.com/article/uk-dirtydancing/songwriter-dirty-dancing-all-the-way-to-the-bank-idUKTRE6A84IJ20101110 [https://perma.cc/T3B8-MUP5] (“Previte estimates that he gets quarterly checks of $10,000 to $30,000 for radio airplay, additional quarterly checks of $50,000 to $100,000 from the hit stage adaptation, and annual checks of $100,000-$125,000 when [1987’s “(I’ve Had) The Time of My Life”] is used in commercials.”); J.J. Cale: A Veteran Songwriter’s ‘Old Man’ Music, NPR (Feb. 25, 2009), https://www.npr.org/2009/02/25/101148876/a-veteran-songwriters-old-man-music [https://perma.cc/S9HM-STKD] (“Those royalty checks keep coming in, so Cale doesn’t have to tour or record much.”). Songwriters can also receive substantial payouts for licensing their compositions for film, TV, commercials, or video games.130Synchronization fees vary based on the media, how the song is used, how much of the song is used, and the status of the songwriter. Passman, supra note 114, at 265–66. Typical fees for using songs in television shows can range from $10,000 to $50,000 or more, depending on the previous popularity of the song and how prominently it features. Id. at 269. These fees will scale if, for example, the song is licensed for less or more than a year. If the master is being used in addition to the song, a master use license is also required; this is typically equal in cost to the synchronization fee. Id. at 265. Fees for licensing a song for television play can be as low as $1,000131For instance, if an artist accepts a low-ball offer in exchange for audience exposure. Id. at 269. per year to more than $100,000132Id. at 270. per year for well-known hits.

If you combine the considerations of (1) collaborations not covered by employment, work for hire, and where there is no standard practice of assignment of copyright, with (2) the volume of collaborations and works produced, there is a strong case to be made that default joint authorship rules are primarily relevant for co-songwriting. It may be that by volume, default joint authorship rules that are efficient for co-songwriting are efficient for most co-authorship cases to which those rules apply in general.

If fairness in labor contexts means applying the proportionality principle, we would expect musicians to find the equal split default rule unfair. But there is no evidence suggesting that songwriters tend to take issue with Judge Hand’s view that they must “share alike.” There has been no movement against it. Proposed amendments to the Copyright Act have never sought to revise the equal split rule, nor have music industry representatives testifying before Congress problematized it. This suggests that the equal split rule has persisted not merely because it is precedent, but because either very little is at stake, or because it leads to what collaborators consider a fair result.

B. Identifying Efficient Joint Authorship Rules Is an Empirical Task

If one agrees with the prevailing view that an efficient default reflects the preferences of most contracting parties,133There is also the view that the most efficient default rule for a given fact pattern may not be a “majoritarian” default, but rather a “penalty” default: a default that most contracting parties would not prefer. See, e.g., Ian Ayres & Robert Gertner, Filling Gaps in Incomplete Contracts: An Economic Theory of Default Rules, 99 Yale L.J. 87, 91 (1989) (explaining that penalty defaults are intended to serve a twofold information-forcing purpose: to force one party to reveal to another information that, if concealed, could increase their private gain on the contract at the expense of the total gain; and to force parties to reveal information to courts when it would be less efficient for courts themselves to discover it.) The equal split default was clearly not intended as a penalty default: it was derived from the equal split rule in tenancy in common, which was itself not intended as a penalty default. Judge Hand’s (and subsequent courts’) reasoning does not support such a reading. Judge Hand disagreed with courts dividing joint authorship shares other than equally, not because he thought it would impose a fact-finding burden on the court that would be better placed on the parties, or because he identified an asymmetry of information or market power between the parties, but because he regarded it (correctly or not) as philosophically impossible. Maurel v. Smith, 220 F. 195, 200 (S.D.N.Y. 1915). Could the equal split default function as a penalty default? Assuming majority or plurality contributors do not want to split equally, it would serve as a penalty default to them. But majoritarian defaults are penalty defaults to those not in the numerical majority; in most arrangements, the lesser contributors would be the more numerous beneficiaries. Furthermore, the only relevant information that might be withheld is the existence of the default itself; there is no reason to suppose that majority contributors, as a rule, would be the more legally well-informed parties. (Well-informed lesser contributors would be incentivized to strategically withhold information.) Also, while a proportional default would indeed impose additional fact-finding on courts, creative labor is particularly ill-suited to ex ante bargaining. Any efficiency gain likely to arise from compelling authors to quantify their contributions in advance would therefore be small, and might actually hinder creative production. This consideration may inform, at least in part, the lack of equal split defaults in non-American jurisdictions. See infra notes 142–44. then determining the most appropriate default relies on identifying the preferences of those parties. This can be achieved by looking at what contracting terms those individuals actually agree to, an empirical question.134See, e.g., Richard A. Epstein, In Defense of the Contract at Will, 51 U. Chi. L. Rev. 947, 951 (1984) (selecting a rule “because it reflects the dominant practice in a given class of cases and because that practice is itself regarded as making good sense for the standard transactions it governs”); Stewart J. Schwab, Collective Bargaining and the Coase Theorem, 72 Cornell L. Rev. 245, 286 (1987) (looking to “actual contracts” to determine “which party values the entitlement most highly”); J. Hoult Verkerke, An Empirical Perspective on Indefinite Term Employment Contracts: Resolving the Just Cause Debate, 1995 Wis. L. Rev. 837, 842 (1995) (using contract data to find support for a default rule in “[t]he revealed preferences of market participants”). An objection to modeling default rules on actual contracting behaviors is that, where a well-developed default rule already exists, the universe of contracting decisions is distorted by the existence of the default rule. Cf. Jacob Goldin & Daniel Reck, Revealed-Preference Analysis with Framing Effects, 128 J. Pol. Econ. 2759, 2760 (2020) (describing default rules as a type of framing effect); Cass R. Sunstein, Switching the Default Rule, 77 N.Y.U. L. Rev. 106 (2002) (discussing default rules and the endowment effect). Parties who find the default objectionable will explicitly contract out of it, while those who find it acceptable will leave a gap. See Ayres & Gertner, supra note 133, at 115–16. However, even among its detractors, this market-mimicking approach has been considered desirable in circumstances similar to those surrounding artistic creation in general and songwriting co-authorship in particular—for example, where parties are unaware of the default or lack a contract entirely, as the distorting effect would not be present. See id. at 115 n.122. Additionally, a large majority (75%) of bands maintain their split preferences over the lifetime of the band. See infra Section III.B.1. Moreover, in the case of co-songwriting, this is answerable.135The royalty splitting directions that co-songwriters provide to performance rights organizations (“PROs”) may or may not be backed up by a written contract. PRO registration is not a contract between co-authors, and only needs to be signed by one of a song’s co-authors or their legal agents. I argue that looking at actual allocating behavior is an effective proxy for looking at actual contracting behavior. For this reason, while presumptively all split directives will have incurred negotiating transaction costs, they may or may not also have incurred contracting transaction costs. Unlike many questions governed by default terms in contract law which may have a remote chance of becoming operational, if a song earns any songwriting royalties that the co-songwriters want to collect, they will have to face the question of with whom and how to divide them up.

Co-songwriters create a record of who shares co-authorship of their joint work when they register a song with a performance rights organization (“PRO”).136Registration forms must indicate the identities of the writers, their publishers and their respective royalty shares. See Work Registration Form, Broadcast Music, Inc., https://www.bmi.com/pdfs/work-reg-e.pdf [https://perma.cc/J6H9-CTST]. As registrations must be signed only “by an affiliated writer or an authorized representative of the submitting publisher,” they do not function as contracts between putative co-authors. Id. Similarly, split sheets—internal documents indicating the relative ownership shares of songwriting contributors—have been held not to provide “conclusive evidence of copyright ownership or authorship,” although they have evidentiary value in determining the validity of such a claim. See Montalvo v. LT’s Benjamin Records, Inc., No. CV 12-1568 (GAG), 2015 WL 13815393, at *5 (D.P.R. May 8, 2015). The individuals completing a PRO’s registration form must specify how royalties are to be split between those credited as writers: there is no default split as far as the PROs are concerned. PROs then channel the royalties they collect to a song’s listed co-writers in accordance with that information. The major PRO repertories are public and include nearly the full set of co-authorship crediting decisions made by co-songwriters in the United States.

I am interested in the co-authorship crediting decisions and royalty split choices of collaborations where contributions are uneven. Ideally, I could identify all songs resulting from uneven contributions and see if all contributors, including lesser contributors, are typically credited as co-authors. That information is not attainable. However, in this Article I construct a unique database including songwriting contribution levels for over 1,000 music groups with certified Gold Records—every band with a Gold Record that primarily writes its own songs, from the first certifications to the time of writing (1959–2021) (the “Gold Record bands”). The songwriting process of each band in the Gold Record database is coded for the contribution levels of its band members based on publicly available information.

Because the membership of these most popular bands is well-known, we can determine by looking at their PRO-registered songwriting credits whether lesser contributors are most often counted as co-authors. PRO credits were obtained for bands in the Gold Record database from the song repertories of the American Society of Composers, Authors and Publishers (“ASCAP”) and Broadcast Music, Inc. (“BMI”), which together represent 90% of the U.S. market in public performance rights (Study 1).

If the answer to whether lesser contributors are counted as co-authors is “no,” then the consequence of the control doctrine that lesser contributors are excluded from co-authorship status will have been shown to align with creator preferences in this case. This would weaken at least one key objection to the control doctrine (see Figure 1).

If the answer is “yes,” for which I make the case, the next question is whether, as co-authors, lesser contributors typically share equally in the benefits of co-authorship.137The economic benefits of co-authorship involve the authors’ ability to profit from the copyright on either an exclusive (sale) or nonexclusive (license) basis. Exclusive transfers require the consent of all owners of the copyright; nonexclusive transfers can be executed by any single owner. While the practice of songwriters selling their catalogue is not unknown, the primary means by which they derive economic benefits from their work is through licensing. For this Article, I conducted two studies to investigate this question (Study 2a and Study 2b). In Study 2a, I estimated the royalty splits of a third of the uneven Gold Record bands which include all members as co-authors. First, I compiled statements by the bands themselves or those close to them (managers, for example) disclosing the bands’ royalty split practices. The second method used the repertories of ASCAP and BMI to infer the splits of Gold Record bands. In Study 2b, I used this same methodology to infer the splits of 1.2 million co-written songs.

If the results of Study 2 support that “yes,” typically lesser contributing co-authors do receive an equal split of royalties, then the existing equal split rule is presumptively the most efficient default.

If the analysis in Study 2 suggests that “no,” lesser contributing co-authors typically do not receive an equal split of royalties, then the proposals of the second camp of scholars for revisiting the equal split default ought to be debated further.

Figure 1.  What Is Creators’ Preferred Treatment of Lesser Contributors?

C. Relevance of Empirical Data

Can the transaction cost implications of these studies help guide us between an equal split and proportional split rule? Even if the results of Study 2 suggest that we should retain the equal split default, some might question if there is any added value to empirical preference data over and above existing transaction cost arguments in favor of the equal split. Even if preference data were to show that creators prefer a proportional split, the argument might go: an equal split default would still be most efficient because the transaction costs of a proportional split default are too high. I argue that it seems unlikely that the transaction costs of implementing a proportional default split are as high as some contend. The thought is that a contribution-based rule would entail a considerable fact-finding burden and require jurists to assign percentages based on their subjective appraisals of each co-author’s contributions.138See LaFrance, supra note 76, at 257; Timothy J. McFarlin, An Idea of Authorship: Orson Welles, The War of the Worlds Copyright, and Why We Should Recognize Idea-Contributors as Joint Authors, 66 Case W. Res. L. Rev. 701, 753 n.190 (2016). Unlike in copyright infringement cases, where courts already make such appraisals,139Under one approach to the prevailing substantial similarity standard, finders of fact must “break[] the works ‘down into their constituent elements, and compa[re] those elements for proof of copying.’ ” Swirsky v. Carey, 376 F.3d 841, 845 (9th Cir. 2004) (quoting Rice v. Fox Broad. Co., 148 F.Supp.2d 1029, 1051 (C.D. Cal. 2001)) (describing the extrinsic test), and determine “whether the ordinary reasonable person would find ‘the total concept and feel of the works’ to be substantially similar,” Pasillas v. McDonald’s Corp., 927 F.2d 440, 442 (9th Cir. 1991) (quoting Data East USA, Inc. v. Epyx, Inc., 862 F.2d 204, 208 (9th Cir. 1988)) (describing the intrinsic test). Therefore, in an infringement action, the finder of fact continually makes percentage assignments both formal and informal. See, e.g., Copeland v. Bieber, 789 F.3d 484, 494 (4th Cir. 2015) (discussing the relative importance of the chorus in pop); Newton v. Diamond, 388 F.3d 1189, 1196 (9th Cir. 2004) (no infringement where the sampled portion “is roughly two percent of the four-and-a-half-minute ‘Choir’ sound recording”); Three Boys Music Corp. v. Bolton, 212 F.3d 477, 487 (9th Cir. 2000) (“The jury found that 28% of the album’s profits derived from the song, and that 66% of the song’s profits resulted from infringing elements.”), overruled by Skidmore v. Led Zeppelin, 952 F.3d 1051 (9th Cir. 2020); cf. 17 U.S.C. § 107(3) (fair use determined in part by “the amount and substantiality of the portion used in relation to the copyrighted work as a whole.”). in joint authorship disputes the value of a work is not necessarily known.140See McFarlin, supra note 65, at 490–91. This position rests on evidentiary assumptions that are not compelling. Assigning joint authors uneven shares in a song involves approximate and imprecise appraisals.

Yet those aspects of making proportional attributions would not be unique. Courts are accustomed to carving up responsibility in contexts of nebulous causality to arrive at liability determinations in common law torts.141See Restatement (Third) of Torts: Apportionment Liab. § 8 (Am. L. Inst. 2000). English courts have awarded proportional shares in cases where authorship was found in the absence of contract and contributions were unequal,142See Fisher v. Brooker, [2006] EWHC (Ch) 3239 (Eng.) (awarding a 40% share to a co-author of a copyrighted song, although Fisher had argued for a 50% share); Bamgboye v. Reed [2002] EWHC (QB) 2922 (Eng.) (awarding a one-third share). and ownership is proportional by default in Germany.143Gesetz über das Urheberrecht und verwandte Schutzrechte (Urheberrechtsgesetz) [UrhG] [Act on Copyright and Related Rights], Sep. 9, 1965, as amended, Art. 8(3) (Ger.), https://www.gesetze-im-internet.de/englisch_urhg/englisch_urhg.html [https://perma.cc/M33C-5Q42] (“Proceeds derived from the use of the work are due to the joint authors in accordance with the extent of their involvement in the creation of the work, unless otherwise agreed between the joint authors.”). If there are reasons to think American jurists would face unique obstacles to determining proportional ownership shares, they have yet to be raised. In general, while degrees of interdependence vary across collaborative contexts, almost as a rule people succeed in translating comparative contributions into pay differences within a tolerated margin of error. If most collaborators must contract out of the default regime because an equal split does not match their royalty distribution preferences, then on the whole a proportional split default rule could be more efficient. The pre-existing arguments against the proportional split and in favor of the equal split are weak.144Another efficiency-based argument in favor of the existing equal split rule is that a proportional split rule would increase how often joint authorship is litigated, because co-authors would self-servingly bring claims arguing they are entitled to larger shares than their co-authors will acknowledge. However, this seems unlikely. See Mandel, supra note 76, at 356. (“Equitable apportionment would reduce the stakes of expected outcomes from litigation, which would be expected both to reduce litigation and to increase the rate of settlement of any litigation that is initiated.”). These arguments do not fare well if you believe the central preference contention of the advocates of a contributions-based split. The aggregate transaction costs of contracting out of the default rule are the most significant potential drag on efficiency. For this reason, it is important to know both if creators—in this article particularly songwriters—think lesser contributors deserve to be co-authors, and as co-authors, how much they think they deserve.

III.  WHO IS A CO-AUTHOR?

A. Study 1: Are Lesser Contributors Considered Co-authors?

1. Methodology

When songwriters collaborate to write a song, but one of them contributes more than the other, will they all still be credited as co-authors? To answer this question, I focused on a large group of songwriting collaborators: music groups that write their own songs and have one or more Gold Records. As popular music groups, they are often on record about which of their members contribute to writing their songs and how much of a contribution145Importantly, they are a non-arbitrarily defined set of music groups. Descriptions of the co-songwriting processes for songs by Gold Record bands are broadly similar to those of co-songwriting by less successful bands and in non-band collaborations; on this basis, comparable shares of songs written through even and uneven contributions are expected between the Gold Record bands and co-songwriting more generally. Sufficient information was found to code 96.16% of all Gold Record bands that primarily release songs written by band members. The percentage of results excluded for missing data is 3.84%, because either there was insufficient information available for songwriting process coding or writing credit information was absent or uninterpretable. This level of missing data has been characterized as inconsequential. See Yiran Dong & Chao-Ying Joanne Peng, Principled Missing Data Methods for Researchers, 2 SpringerPlus 222, 223 (2013) (citing Joseph L. Schafer, Multiple Imputation: A Primer, 8 Stat. Methods Med. Res. 3, 7 (1999)). those members make to writing their songs.146See supra text accompanying note 55 for general copyrightability considerations. See infra notes 159–64 and accompanying text for the analysis applied to data in this study. The 1,003 bands in this study include Gold Record awardees from across the full sixty years (1959–2021) during which Gold Records have been awarded147Gold & Platinum, RIAA, https://www.riaa.com/gold-platinum/ [https://perma.cc/8SHS-2YRF]. by the Recording Industry Association of America (“RIAA”).148The search of the RIAA Gold & Platinum database for Group and Duo artists resulted in an initial list of 1,669 group and duo performances, many of which were not by bands. The study excluded 666 search results. Search results were excluded for non-band group performances as well as bands mostly performing songs not written by band members (19.47% of results; for example, Mormon Tabernacle Choir or folk groups performing songs in the public domain), backing bands (4.55% of results; for example, Dave Matthews Band), and bands for which insufficient information was available concerning songwriting process or song credits (3.84% of results). The RIAA certifies albums149RIAA and GR&F Certification Audit Requirements: RIAA Album Award, RIAA (Mar. 2021), https://
http://www.riaa.com/wp-content/uploads/2021/03/ALBUM-AWARD-RIAA-AND-GRF-CERTIFICATION-AUDIT-REQUIREMENTS.pdf [https://perma.cc/R3LS-VZVM]. Gold certification indicates sales of 500,000 and Platinum of 1,000,000 units. A unit is defined as a physical or digital album sale, 10 permanent track downloads, 1,500 on-demand streams, or some combination of the above. Prior to 1975, Gold certification required $1 million in wholesale sales, with no unit sales requirement. See Adam White, The Billboard Book of Gold & Platinum Records viii (1990).
and singles150RIAA and GR&F Certification Audit Requirements: RIAA Digital Single Award, RIAA
(Feb. 2016), https://www.riaa.com/wp-content/uploads/2016/02/DIGITAL-SINGLE-AWARD-RIAA-AND-GRF-CERTIFICATION-AUDIT-REQUIREMENTS.pdf [https://perma.cc/M7RQ-ZXM9]. Gold certification indicates sales of 500,000 and Platinum of 1,000,000 units. A unit is defined as a permanent digital download, 150 on-demand streams, or some combination of the two. (Physical singles sales are now largely nonexistent.) Between 1976 and 1989, Platinum certification indicated sales of 2 million units. See White, supra note 149, at viii.
based on sales.

The Gold Record bands have previously discussed their songwriting processes in numerous interviews.151The sources consulted were primarily interviews, biographies and feature articles covering a band’s songwriting process and the degree of band members’ involvement. Such interviews were the principal basis for identifying bands in which all the members contribute to songwriting but do not contribute evenly (uneven contributions bands).152To classify bands, I hired outside coders to compile and review publicly available sources of information on the songwriting processes of the Gold Record bands and to code them according to the songwriting contributions protocol. The reliability of source material for each band was classified by a coder as very strong, strong, satisfactory, or insufficient information (these bands were excluded). Source reliability was very strong when the code was based primarily on unambiguous direct interview quotes from the band members. Source reliability was strong when third-party quotes were drawn from mainstream or music-focused publications and clearly delineated the songwriting process. Satisfactory reliability was given to codes based primarily on tertiary or amateur sources. Overall, source reliability was very strong or strong for 78% of the bands and 22% satisfactory. Multiple sources support the coding of 87% of bands, and source reliability was very strong or strong a majority of the time (79%) when coding was based on a single source. An independent coder coded an overlap of 10% of the bands (100). These bands occupied a middle ground between, on the one hand, groups in which all members made more or less even contributions to songwriting (even contributions bands), and, on the other hand, bands in which some members did not contribute to songwriting at all (some members do not contribute bands).153Band members’ contributions to songwriting were assessed on the basis of the writing process for individual songs, rather than the band’s overall song output. This means bands such as Queen, in which all band members contributed solo written songs to the group’s output more or less evenly, were coded as some members do not contribute, rather than as even contributions. While this approach has the potential to cloud the interpretation of co-authorship crediting, in practice the codes assigned at the per song level and overall output level converged 99% of the time. The research and coding processes were highly labor intensive, taking several hundred hours over which thousands of sources were screened and compiled.154First, the coders collected and coded data on a subset of the Gold Record bands. I fine-tuned the coding protocol, then all bands were re-coded. The design of the coding process incorporated guidelines, derived from Klaus Krippendorff and Kimberly Neuendorf, presented in Mark A. Hall & Ronald F. Wright, Systematic Content Analysis of Judicial Opinions, 96 Cal. L. Rev. 63, 107–17 (2008). Interrater reliability was measured as 84.5% using Krippendorff’s alpha; percent agreement was 91%. Krippendorff’s alpha is a standard measure of agreement between multiple coders. See Klaus Krippendorff, Content Analysis: An Introduction to Its Methodology 221–22 (2d ed. 2004).

In addition, unless all of a band’s members made independently copyrightable contributions to songwriting, the band was classified as one in which some members do not contribute.155The footnoted sentences in this paragraph which follow are reproduced from Sarah Polcz, supra note 7, which relies upon the same dataset. The copyrightability of songwriting contributions was assumed when quotes labeled band members as songwriters or confirmed members’ involvement in songwriting in general terms.156Uncontradicted assumptions were informed by genre norms; for example, rappers were taken to be delivering their own verses, and members of electronic dance music (“EDM”), rap and hip-hop groups described as “producers,” “programmers,” or “beat makers” were assumed to be making copyrightable musical contributions. See Tonya M. Evans, Sampling, Looping, and Mashing . . . Oh My!: How Hip Hop Music Is Scratching More than the Surface of Copyright Law, 21 Fordham Intell. Prop. Media & Ent. L.J. 843, 852–53 (2011); Chris Robley, Should My Producer Get Publishing and Songwriting Credit?, DIY Musician (July 11, 2018), https://diymusician.cdbaby.com/music-rights/does-my-producer-deserve-publishing-and-songwriting-credit [https://perma.cc/MF8P-DLS3]. If any band members were described as making only contributions to songs that are not legally considered songwriting—such as arrangement, suggestions or feedback—then their bands were coded as some members do not contribute. To distinguish between uneven and even contributions by band members, industry norms, where existing, supplied assumptions; for example, lyrics were weighted as comprising half of the song. See Daniel Abowd, FRE-Bird: An Evidentiary Tale of Two Colliding Copyrights, 30 Fordham Intell. Prop. Media & Ent. L.J. 1311, 1329 (2020). Since compositions may be the product of jam sessions or studio experimentation, a sound recording may represent the fixed form of the composition. See Bridgeport Music, Inc. v. UMG Recordings, Inc., 585 F.3d, 267, 276 (6th Cir. 2009); Robert Brauneis, Musical Work Copyright for the Era of Digital Sound Technology: Looking Beyond Composition and Performance, 17 Tul. J. Tech. & Intell. Prop. 1, 28 (2014) (“By 2012, 77% of musical work registrations were accompanied by phonorecord deposits and only 17% by deposits of musical notation . . . .”). With this in mind, coders were instructed to regard contributions as “arrangements”—contributions to the sound recording rather than the music composition—only when band members clearly described them as such, with the understanding that the interviewee was aware of the distinction. These represent the categories of contribution ruled not to be protectible under the Childress standard. See Erickson v. Trinity Theatre, Inc., 13 F.3d 1061, 1068 (7th Cir. 1994); Childress v. Taylor, 945 F.2d 500, 509 (2d Cir. 1991); BTE v. Bonnecaze, 43 F. Supp. 2d 619, 623 (E.D. La. 1999) (holding no joint authorship when a musician contributes unfixed “ideas and helpful insights”). Interviewees discussed a variety of contributions, some copyrightable (whether to the composition or to the sound recording) and others likely not. When members’ particular contributions were described,157See, e.g., Tim Louie, An Interview with Sixx:A.M.: Returning with Their Own Prayers for the Damned, Aquarian (May 18, 2016), https://www.theaquarian.com/2016/05/18/an-interview-with-sixxa-m-returning-with-their-own-prayers-for-the-damned/ [https://perma.cc/2L9S-U4WU] (“It’s the three of us getting together in a room picking up instruments and talking. We talk a lot before we even start writing, discussing subject matters, and working through melody ideas, working through riff ideas and we all bring in ideas.”). case law (interpreting the Copyright Act) was the primary basis for assessing their copyrightability.158Additionally, the United States Copyright Office (“USCO”) was a source for the concept that a musical work consists of four copyrightable elements: melody, rhythm, harmony, and lyrics. U.S. Copyright Off., Compendium of U.S. Copyright Office Practices § 802.3 (3d ed. 2021). Band member contributions which included the elements of a musical work—lyrics,159Individual words and short phrases are typically denied copyright protection. Nimmer & Nimmer, supra note 30, § 2.01[B][3]. However, this general rule may not be applicable in a songwriting context. See Goldstein, supra note 9, § 2.8, at 2:102–2:102.1 (“[T]he Act’s inclusion of ‘accompanying words’ in its reference to musical works means that musical and lyrical elements that by themselves would not be sufficiently original and expressive to qualify for copyright may combine with each other to produce a copyrightable work.”) Courts have been willing to consider the copyrightability of lyrics that would not reach the originality threshold if published as a literary work. See, e.g., May v. Sony Music Ent., 399 F. Supp. 3d 169 (S.D.N.Y. 2019) (refusing to dismiss an infringement claim based on the lyric “We run things. Things no run we.”). It is unlikely that band members would be described as lyricists, lyric writers or lyrical contributors if their only contributions failed to meet this threshold of originality. melody,160Goldstein wrote,

Melody in a musical composition consists of a succession of notes, as well as the long and short durations of individual notes, organized around the composition’s rhythm. Because melody is so salient, and is relatively unconstrained by musical convention, it is typically the principal vessel of originality in musical compositions.

Goldstein, supra note 9, § 2.8, at 2:102.1–2. harmony,161“Harmony gives depth to a musical composition. It might consist of two or more voices, separated by a constant span of notes, simultaneously singing the melody, or it might consist of chords—the simultaneous sounding of individual notes—harmoniously connected to each other and to the composition’s melody.” Goldstein, supra note 9, § 2.8, at 2:102.2. See also Williams v. Gaye, 895 F.3d 1106 (9th Cir. 2018) (Blurred Lines Case) (finding no reversible error in an infringement verdict based substantially on rhythmic and harmonic elements). Harmonic elements appeared in the coding in the form of chords and chord progressions. and rhythm162“Rhythm is the physical element of music, the steady beat that sets a listener’s fingers tapping. Although rhythm can be varied, the dictates of musical convention will typically constrain variety. As a result, courts rarely find originality in rhythm alone.” Goldstein, supra note 9, § 2.8, at 2:102.1–2; see also Bridgeport Music, Inc. v. UMG Recordings, Inc., 585 F.3d 267 (6th Cir. 2009) (rhythmic elements copyrightable); New Old Music Grp., Inc. v. Gottwald, 122 F. Supp. 3d 78 (S.D.N.Y. 2015) (drum part copyrightable); BMS Ent./Heat Music LLC v. Bridges, No. 04 CIV. 2584 (PKC), 2005 WL 1593013, at *1 (S.D.N.Y. July 7, 2005) (rhythmic elements copyrightable); Santrayll v. Burrell, No. 91 CIV. 3166 (PKL), 1996 WL 134803, at *1 (S.D.N.Y. Mar. 25, 1996) (rhythmic elements copyrightable). Rhythmic songwriting elements often appeared in the coding in the form of drum parts, basslines, and beats.—were assumed to be sufficiently original163The originality (and thus copyrightability) of the type of contribution is discussed supra. The minimum quantity of contribution also required consideration. A recent case offers the guideline (in dicta) that this is certainly more than three or four notes, but perhaps as few as seven. See Skidmore v. Zeppelin, 952 F.3d 1051, 1071 (9th Cir.) (en banc). In practice, sources did not reach this degree of specificity. See also U.S. Copyright Off., supra note 158, § 802.5(B) (“There is no predetermined number of notes, measures, or words that automatically constitutes de minimis authorship or automatically qualifies a work for copyright registration.”). and treated as copyrightable.164The copyrightable expression in a musical composition is typically found in its melody, harmony, rhythm or some combination of the three. See Goldstein, supra note 9, § 2.8, at 2:102.1–102.3.103; Nimmer & Nimmer, supra note 30, § 2.05[D]; see also 2 William F. Patry, Patry On Copyright, § 3:93 (“Originality in a musical composition consists not just of melody or harmony, but also in the combination of these two in addition to any other elements, such as rhythm or orchestration.”). While melody was long privileged as the sole source of copyrightable expression in musical compositions, courts have sometimes—and perhaps increasingly—been willing to find other aspects of the work copyrightable. See Joseph P. Fishman, Music as a Matter of Law, 131 Harv. L. Rev. 1861, 1870–73 (2018). Joint authorship cases concerning songwriting are typically decided on intent and rarely reach the question of copyrightability. Most discussion of the copyrightability of song elements has therefore arisen out of an infringement context. Infringement cases in music, involving highly fact-specific determinations, have understandably not produced a list of copyrightable and uncopyrightable elements that can be applied mechanically: the most that can be said is that certain elements may (or may not) be copyrightable. Furthermore, the infringement analysis does not itself determine copyrightability. In some instances, infringement has been found on the basis of elements that may not themselves be independently copyrightable. See Swirsky v. Carey, 376 F.3d 841, 848 (9th Cir. 2004) (“[T]o disregard chord progression, key, tempo, rhythm, and genre is to ignore the fact that a substantial similarity can be found in a combination of elements, even if those elements are individually unprotected.”); Three Boys Music Corp. v. Bolton, 212 F.3d 477, 485–86 (9th Cir. 2000). Most bands were classified as some members do not contribute(555/1,003, or 55%), followed by uneven (258/1,003, or 26%); it was least common for all members to contribute evenly to songwriting (190/1,003, or 19%).

Next, I assessed whether uneven bands credited all members as co-authors of their songs. A song’s writers are listed in several places: liner notes, on PRO registrations, and in United States Copyright Office (“USCO”) registrations. I consulted PRO registrations and validated that they correspond to USCO registrations.165Courts have not recognized PRO registrations as evidence of authorship, but USCO registrations constitute “prima facie evidence of the validity of the copyright and the facts stated in the certificate of registration.” U.S. Copyright Off., supra note 158, § 202. Therefore, USCO registrations were searched and compiled for ten songs by all bands with uneven contributions that include all members as co-authors. The credited writers in the PRO registrations match listed co-authors in USCO registrations 99% of the time. Also, 37% of the songs registered with PROs were not registered in the USCO database. See Zvi S. Rosen & Richard Schwinn, An Empirical Study of 225 Years of Copyright Registrations, 94 Tul. L. Rev. 1003, 1030 (2020) (noting that, over the course of the past thirty years, music registrations with USCO “f[e]ll off a cliff to levels not seen since the 1930s”). A band was classified as one in which all members are co-authors (true/false)166This relied on assembling, per song, the number of band members and the number of co-authors credited. if, for a majority of its songs, all the members of the band in the year the song was released were credited as writers.167The number of members with writing credit was compared to the number of members the band had in the year each song was released. If the number of member co-authors was less, the song was coded as false (per song; variable used only in computing per band level all members are co-authors), otherwise as true (per song). The total number of true songs was counted and compared to the total number of the band’s songs. When more than 50% of a band’s songs were credited to all members of the band in the year the song was released, the band was coded as true for the variable all members are co-authors. To determine the number of members in each band at the time their songs were released, discographies including year of release information were obtained from AllMusic, https://www.allmusic.com/ [https://perma.cc/4GVX-XYGQ]. Discography data was web-scraped. When AllMusic did not provide details on the year in which a song was released, a web search was conducted. After this, several sources were reviewed to find the number of band members in each year songs were released: band members were listed on AllMusic, Wikipedia, band websites, in liner notes, and often named in interviews. Touring and session musicians were not counted as band members. The names of the writers credited with the songs of bands in the Gold Record database were obtained from the online repertories of ASCAP and BMI. For 38% of the 1,003 bands in the study, it was true that all members are co-authors.

Other factors beyond writing contribution may influence whether or not lesser contributors receive co-authorship credit. For this reason, I collected data on a number of factors. I designated the lowest number of members the band had during its active years as a representative band size168The size of a band’s membership could influence how willing members are to include all members as co-authors. Particularly for uneven contributions to songwriting bands, as group size grows, so does the possible economic penalty for including all members as co-authors. To investigate whether the number of members in a band predicts the inclusion of lesser contributors as co-authors, representative band size was generated for each band (Two (21%); Three (22%); Four (35%); Five (16%); Six+ (6.1%)). For bands with variable numbers of members, the lowest number of members during the band’s active years was used. More than 90% of music groups had five or fewer members. The most common band size was four members (349/1,003, or 35%). and identified each band’s genre169Genre data was obtained from AllMusic’s “genres” listing for each band. Music Genres, AllMusic, https://www.allmusic.com/genres [https://perma.cc/KN52-AVL9]. AllMusic’s proliferation of subgenres (over 120) is highly useful for capturing subtle commonalities across the site’s more than 30 million tracks. However, this subgenre classification scheme is too granular for the size of this study’s dataset. At the same time, AllMusic’s twenty-one higher level genre classifications potentially collapse meaningful differences within the study sample of Gold Record bands (for instance, by combining Pop and Rock into a single genre). For this reason, I decided to group together the bands’ AllMusic subgenres into the following nine genre categories: Rock (48%); Latin (2.9%); Country (4.4%); Metal (6.5%); Punk (2.1%); Pop (15%); Reggae (0.8%); Hip Hop, R&B, Gospel, Jazz (19%); and Electronic (1.7%). AllMusic frequently associates artists with multiple subgenres. In the event of a band’s multiple subgenre classifications corresponding to more than one of the study genre groups, the band was assigned to the study genre group with fewer observations. of music, region of origin,170Geographic regions are sometimes thought to vary in terms of attitudes that could relate to decisions about including lesser contributors as co-authors (for example, Southern communalism or coastal capitalism). For this reason, the geographic regions of bands from the United States and its territories were coded according to the location where the band was started. Location data was obtained from Wikipedia, which was then classified into regions using the boundaries of the U.S. Divisions and Regions of the U.S. Census Bureau, widely used regional divisions for statistics and data collection: Northeast (17%); Midwest (8.4%); West (26%); and South (20%). U.S. Census Bureau, Geography Div., Census Regions and Divisions of the United States, https://www2.census.gov/
geo/pdfs/maps-data/maps/reference/us_regdiv.pdf [https://perma.cc/FD3M-3LUY]. As the U.S. Census Bureau does not include Puerto Rico in any census region, bands from Puerto Rico were classified as South. All bands originating outside the United States and its territories were classified as Non-USA (28%).
and decade171In keeping with the common practice of organizing discussions of the history of popular music around particular decades, bands were assigned to a period spanning ten years according to the year in which their first album was released. The years in which albums were released was obtained from each band’s profile on AllMusic. The decade classifications used are 1960s and earlier (8.9%); 1970s (13%); 1980s (23%); 1990s (29%); 2000s (18%); and 2010s and later (8.6%). In general, no region or decade dominated group genesis, though there were comparatively fewer music groups prior to 1980. of formation (see Table 1). I also evaluated whether co-author inclusion changed over time.172This was analyzed as a true/false variable. It relied on assembling a year-specific version for all members are co-authors variable for the first and last year of a band’s existence. Bands might initially decide to include all members as co-authors, or not, but change their co-author inclusion practices in subsequent years. To track the potential for this occurrence, an all members are co-authors by year (true/false) variable was produced for each year the band has released songs. Bands were then categorized as initial year co-author inclusion (true/false), based on the value of all members are co-authors by year, in the year of their first release. The initial year co-author inclusion code for each band was compared against the all members are co-authors by year codes for each release year. If there were any occurrences differing from the bands’ initial year co-author inclusion code, the band was code as true for co-author inclusion changed, and otherwise as false. Further, 75% of bands were false for co-author inclusion changed.

Table 1.  Descriptive Statistics for Gold Record Bands 1959–2021

Variable

N = 1003

All members are co-authors

379 (38%)

Songwriting

 

Even

190 (19%)

Uneven

258 (26%)

Some members do not contribute

555 (55%)

Decade

 

1960s and earlier

89 (8.9%)

1970s

129 (13%)

1980s

234 (23%)

1990s

288 (29%)

2000s

177 (18%)

2010s and later

86 (8.6%)

Members

 

2

215 (21%)

3

220 (22%)

4

349 (35%)

5

158 (16%)

6+

61 (6.1%)

Genre

 

Rock

478 (48%)

Hip Hop, R&B, Gospel, Jazz

190 (19%)

Pop

151 (15%)

Metal

65 (6.5%)

Country

44 (4.4%)

Latin

29 (2.9%)

Punk

21 (2.1%)

Electronic

17 (1.7%)

Reggae

8 (0.8%)

 

Region

 

Northeast

170 (17%)

Midwest

84 (8.4%)

West

265 (26%)

South

204 (20%)

Non-USA

280 (28%)

2. Results

From 1959 to 2021, uneven bands typically included all members as co-authors (140/258 or 54%). This tendency strengthened over time, specifically throughout each of the last three decades, even while accounting for genre, region of origin, and representative band size (p < 0.01).173First, for all Gold Record bands with any level of band member songwriting contributions, I used logistic regression analysis to investigate whether all members are co-authors was associated with songwriting contributions, genre, region, decade, or representative band size. The reference group for decade was “1980s.” Songwriting contributions were the most important predictors of all members are co-authors. Compared to bands in which some members do not contribute to songwriting, bands with even (or 2.9, p<0.001) and uneven (or 1.9, p<0.001) were significantly more likely to include all members as song co-authors. Significant associations were also found with representative band size, decade, and genre, but not region (p<0.001). Representative band size: Compared to four-member groups, two-member groups were significantly more likely to include all members as co-authors (p<0.001); five-member groups (p<0.05) and groups with six or more members (p<0.005) were significantly less likely to do so. For “decade,” compared to the 1980s, groups formed in the 1960s or earlier were significantly less likely (p<0.05) to include all members as co-authors, whereas this was less common with bands formed in the 1990s (p<0.05), 2000s (p<0.005), and 2010s and later (p<0.01). Genre: Compared to Pop groups, Latin (p<0.05) and Hip Hop, R&B, Gospel, and Jazz (p<0.05) groups were significantly less likely to include all members as co-authors. For the subgroup of uneven songwriting contributions bands, to a lesser extent, a band size of two members was also associated with all members included as co-authors (p<0.05). Most uneven bands with fewer than five members include all members as co-authors.174Z-test of one proportion, p<0.05. Among bands with five or more members, a considerable majority (66%) have some members who do not contribute to songwriting, and there were too few even or uneven contribution bands of that size to investigate co-author inclusion rates and whether copyrightable contributions by all members suffice for co-authorship credit or whether some other factor, perhaps control, is required.175Such analysis would be underpowered.

To pursue the trend of increasing co-author inclusion over time, I focused on uneven bands formed after 1990 (n=175). A significant majority of these groups credit lesser contributors as co-authors (111/175, or 63%).176Z-test of one proportion, p<0.001, 95% CI 56%–71%. This preference was less common with larger bands, but otherwise was not associated with other factors.177I used logistic regression to investigate associations with representative band size, genre, and region. The only significant association was with larger band sizes, which were negatively associated with all members are co-authors (five members p<0.05, or -1.1; six or more members p=0.054, or -1.8). Overall, bands with five or more members were significantly more likely to be coded as some members do not contribute (145/219, or 66%) than bands with four or fewer members (410/784, or 52%) (z-test of two proportions p<0.001).

Table 2.  Logistic Regression for Co-Author Inclusion: UnevenContributions Bands 1959–2021

Variable

log(OR)

95% CI

p-value

Decade (ref. cat.: 1980s)

     

1960s and earlier

-0.30

-1.6, 0.92

0.6

1970s

-0.12

-1.3, 1.0

0.8

1990s

1.2

0.35, 2.1

0.007*

2000s

1.2

0.34, 2.1

0.007*

2010s and later

1.5

0.39, 2.6

0.009*

Members (ref. cat.: 4)

     

2

1.2

0.21, 2.3

0.022*

3

-0.04

-0.81, 0.73

>0.9

5

-0.73

-1.5, 0.05

0.067

6+

-1.4

-3.0, 0.05

0.075

Genre (ref. cat.: Pop)

     

Rock

0.61

-0.25, 1.5

0.2

Hip Hop, R&B, Gospel, Jazz

-0.80

-1.9, 0.27

0.15

Country

-1.7

-3.6, 0.01

0.060

Metal

-0.09

-1.3, 1.1

0.9

Punk

-0.47

-2.7, 1.7

0.7

Electronic

-1.4

-3.4, 0.57

0.2

Reggae

13

-167, NA

>0.9

 

Region (ref. cat.: Northeast)

     

Midwest

-0.51

-1.9, 0.82

0.5

West

-0.07

-0.91, 0.77

0.9

South

0.36

-0.57, 1.3

0.5

Non-USA

0.11

-0.80, 1.0

0.8

Notes: *p < 0.05; **p < 0.005.

Table 3.  Logistic Regression for Co-author Inclusion: UnevenContributions Bands 1990–2021

Variable

log(OR)

95% CI

p-value

Songwriting (ref. cat.: Some members do not contribute)

     

Even

2.9

2.4, 3.5

<0.001**

Uneven

1.8

1.4, 2.1

<0.001**

Decade (ref. cat.: 1980s)

     

1960s and earlier

-0.94

-1.7, -0.20

0.016*

1970s

-0.33

-1.0, 0.27

0.3

1990s

0.50

0.06, 1.0

0.026*

2000s

0.74

0.23, 1.2

0.004**

2010s and later

0.90

0.23, 1.6

0.009*

Members (ref. cat.: 4)

     

2

0.91

0.43, 1.4

<0.001**

3

0.02

-0.42, 0.45

>0.9

5

-0.61

-1.1, -0.11

0.017*

6+

-1.5

-2.6, -0.56

0.003**

Genre (ref. cat.: Pop)

     

Rock

0.37

-0.14, 0.89

0.2

Hip Hop, R&B, Gospel, Jazz

-0.79

-1.4, -0.16

0.014*

Country

-1.0

-2.0, -0.09

0.034*

Metal

0.27

-0.52, 1.0

0.5

Latin

-1.5

-3.0, -0.18

0.032*

Punk

0.87

-0.29, 2.0

0.13

Electronic

-0.50

-1.8, 0.90

0.5

Reggae

1.0

-1.6, 4.1

0.5

Region (ref. cat.: Northeast)

     

Midwest

-0.68

-1.4, 0.04

0.069*

West

0.01

-0.49, 0.51

>0.9

South

0.11

-0.43, 0.65

0.7

Non-USA

0.26

-0.26, 0.78

0.3

Notes: *p < 0.05; **p < 0.005.

             

A significant majority of bands (75%) did not deviate from the practice of crediting or not crediting all members as co-authors which they had established in the year of their first release.178Z-test of one proportion, p < 0.001. Of those 25% of bands that switched credit practices, most of those that changed did so to credit all members as co-authors rather than vice versa.179Test of two proportions, p < 0.001. To note, 20% of bands which began by including all members as co-authors at some point switched to not including all members as co-authors; 36% of bands which began by not including all members as co-authors at some point switched to including all members as co-authors.

3. Open Questions

Study 1 investigated co-author inclusion practices, but the data did not contain details on how songwriting royalties are allocated between the band members who are credited as co-authors.

If creators do consider it unfair for lesser contributors to receive equal co-authorship benefits, then we might expect that even though bands’ lesser contributors are most often treated as co-authors, they receive a smaller share of songwriting royalties than their co-authors who made the most significant contributions. This would be the practice predicted by the scholars advocating for a proportional split default. Study 2 was designed to test this hypothesis.

IV. DO LESSER CO-AUTHORS RECEIVE LESS ROYALTIES?

The scholars arguing for a proportional default split would predict that when collaborators’ contributions to a joint work are unequal, they prefer an unequal royalty split. Study 2 tests this hypothesis not only for music groups, but for co-written songs in general. Study 2a investigates the songwriting royalty percentage splits of the uneven contributions bands in the Gold Record database using two different approaches. Study 2b adapts one of these approaches to estimate the proportion of uneven co-authored songs in general, using co-written songs in the ASCAP repertory.

A. Study 2A: Gold Record Bands Royalty Splits

Study 2a compiles uneven Gold Record bands’ and their managers’ disclosures about their royalty splits, along with statements about their splits from other reliable sources. If the proportional preference hypothesis is correct, the results should indicate that when co-authors’ contributions are uneven, they prefer to split songwriting royalties unevenly as well.

1. Methodology

First, a coder180Despite the success of machine learning techniques for classifying image data, machine learning techniques have been less successful with music genre classification and human coding continues to generate more satisfactory results. Email from Joel Shor, Senior Software Engineer, Google Research, to Sarah Polcz (May 29, 2019) (on file with author). searched for information on the songwriting royalty split practices of every uneven band in the Gold Record database that includes all members as co-authors (n=140). Split information was found for 21% of these bands that met a standard of “reasonably certain” source reliability (29/140); on this basis their splits were classified as equal or unequal.181An equal split variable was created for which bands were coded as true/false. Data was obtained through the online searches only for a subset of uneven bands, and it is not clear whether the bands whose splits are not public is data that is missing at random.

Second, to gain further insight into how uneven bands split songwriting royalties, I consulted the ASCAP repertory. In 2015, ASCAP began to disclose the percentages of song royalties under the control of its repertory,182This terminology is used by ASCAP to refer to the aggregate percentage of a song’s royalties administered by ASCAP on behalf of its members and/or members of affiliated non-U.S. performance rights organizations. See ASCAP Reveals Its Percentage Share of Over 10M Songs Online, Music Bus. Worldwide (Nov. 12, 2015), https://www.musicbusinessworldwide.com/ascap-reveals-its-percentage-share-of-over-10m-songs-online/ [https://perma.cc/7PNF-SU5Z]. likely in preparation for an antitrust review by the U.S. Department of Justice.183See U.S. Dept. Just., Statement of the Department of Justice on the Closing of the Antitrust Division’s Review of the ASCAP and BMI Consent Decrees 2 (Aug. 4, 2016), https://www.justice.gov/atr/file/882101/download [https://perma.cc/7XW6-CE7D]. (The review recommended, in part, that ASCAP and BMI should be required to license songs on a “100%” basis, rather than only the percentage under their control. Strongly opposed by the PROs, this proposal was ultimately defeated.) See Songwriters Win in Second Circuit Ruling on 100% Licensing Decision, ASCAP (Dec. 19, 2017), https://www.ascap.com/news-events/articles/2017/12/songwriters-win-in-second-circuit-ruling [https://perma.cc/7RT6-QQCF]. ASCAP is the only PRO that currently discloses the writers’ share of royalties it controls.184Broad. Music, Inc. (BMI), https://bmi.com [https://perma.cc/HB3A-AF8T] and Glob. Music Rts. (GMR), https://globalmusicrights.com [https://perma.cc/HGR6-HEVQ], list each PRO’s percentage of control over a work’s total public performance right, without specifying the extent to which this percentage is derived from the writer’s and/or publisher’s share. SESAC, https://www.sesac.com/ [https://perma.cc/FQ4W-F2DZ], does not include any control data in its publicly available repertory. SESAC Repertory, Sesac (2022), https://www.sesac.com/documents/SESAC_REPERTORY.pdf [https://perma.cc/B3DP-S8TN]. I used ASCAP’s disclosures on the proportion of royalties under its control to code each song by the uneven bands as either split equally, or not.185Songs were coded as true/false for the equal split variable using this information. A coder searched the ASCAP repertory for songs by the uneven bands that include all members as co-authors (n=140) and with at least one member on whose behalf ASCAP does not collect royalties.186The data obtained included song titles, credited writers, writers’ PRO memberships, and the proportion of each song’s royalties controlled by ASCAP. The coder then determined whether or not each credited writer was a member of the band and referenced the number of members in each band in the year each song was released. For each song the total number of band members credited as writers was tallied. Twenty-two bands (22/140, or 16%) were identified for which split data could be inferred.

The basis for inferring how a song’s royalties are split involved dividing the total writer’s share by the number of credited writers for the song, then evaluating whether the percentage due to each writer under an equal split arrangement was a factor of the percent controlled by ASCAP. If so, the song was classified as split equally; otherwise, the song was classified as not split equally.187The range of comparison in non-integer cases was expanded to allow for share allocations that round up or down by one. For example, for a song with two writers, if ASCAP controls 50% of the writer’s share of royalties, it follows that the non-ASCAP writer is receiving the remaining 50%.188Traditionally, income from music publishing is evenly divided between the songwriter or songwriters (“writer’s share”) and the publisher or publishers (“publisher’s share”). PROs forward 50% of the publishing income from a song, less fees, to the writers and the other 50% to the publishers, representing 100% of the income earned. The stated purpose of conveying the writer’s share directly to the writers, rather than having it distributed to the writers by the publishers, is to prevent unscrupulous behavior on the part of the publishers. See Passman, supra note 114, at 242. ASCAP lists 100% of the writer’s share of public performance royalty as 50%, as it is half (along with the 50% publisher’s share) of the whole public performance right. On the other hand, if ASCAP controls more or less than 50%, the writers are splitting unequally.189If a song has three writers, one of whom is a member of ASCAP, which controls 33.3% of the writer’s share of royalties, then the song was classified as equally split, since under an equal split arrangement between the three writers, each writer would be entitled to a 33.3% share.

2. Results

The two approaches for identifying royalty splits used in Study 2a—compiling the splits of bands that have publicly disclosed their arrangements and inferring splits using the ASCAP repertory—converge in suggesting that approximately 80% split equally. Both methods suggest a majority190Z-test of one proportion (> 50%). of uneven bands nevertheless split royalties equally: the results of the online searches found that 83% (24/29) of disclosed splits were equal among band members; the PRO data suggests that 77% (17/22) of the bands split equally.

Using these two strategies, Study 2a was able to account for 33% of the uneven Gold Record bands that include all members as co-authors (46/140). The two methodologies produced results for different bands, with the exception of six bands for which results were found both ways. For these six bands, the different methodologies led to the same assigned split. These results indicate that most lesser contributors receive an equal split in the context of music groups. Study 2b investigates whether this is also true of co-written songs more generally.

B. Study 2b: All Co-authored Songs

To go beyond how bands split royalties and to infer whether lesser contributors for all co-authored songs tend to be rewarded equally, I applied the strategy of inferring splits based on the proportion of royalties controlled by ASCAP to all eligible co-authored songs in the ASCAP repertory.

1. Methodology

The ASCAP repertory contains nearly 7 million songs.1916,985,181 songs. ASCAP allows anyone to request its repertory. See SESAC Repertory, supra note 184. More than 2.5 million of these songs are credited to two or more authors.1922,612,687 songs. See id. However, royalty splits can only be inferred in cases where a song’s royalties are controlled by both ASCAP and another PRO. There were a few steps involved in identifying those songs,193The repertory as provided by ASCAP includes the total share of songwriting royalties controlled by ASCAP (writers’ shares and publishers’ shares), and the names of writers of each song. Details on the writer’s share controlled by ASCAP are, however, available in ASCAP’s online repertory. The list of co-authored songs was filtered to include only those songs for which ASCAP controlled less than 100% and more than 0% of the total songwriting royalties. The total share controlled by ASCAP was a useful coarse filter for reducing the list of songs for which ASCAP-controlled writer’s share data was then compiled by web scraping. and ultimately royalty split data was obtained for 1,237,764 works; this covers 92% of the songs ASCAP only partially controls, amounting to 48% of all co-authored songs in the ASCAP repertory.194To note, 8% of the songs on the reduced list of co-authored songs did not have entries in the online repertory and consequently no data was available for those titles.

Each song’s royalties were classified as split equally or split unequally, by the same methodology adopted in Study 2a. To account for potential misclassifications of unequally split songs as equally split songs, I estimated that a reasonable lower bound for the total proportion of equal split songs should include an 8% downward adjustment on the result of this classification method.195When songs were credited to three or four writers, additional steps were taken to reduce the potential for uncertainty in these split estimates. To adjust for the possibility of three-writer splits where, for example, ASCAP controls 33% but two writers are members of ASCAP and the split is therefore unequal (Writers 1 and 2 share 33%, Writer 3 gets 66%), further information was sought. For both three-writer, and four-writer songs, a randomly generated sample of one hundred songs was checked against ASCAP song registrations to obtain a measure of the frequency of unequal splits of the structure described. The frequency of such “hidden inequality” was 5% for three-writer songs and 11% for four-writer songs; taking the average suggests an adjustment of 8%. While it is possible that those splits could still be unequal (for example, Writer 1 gets 33% and instead of splitting the remaining 66% as 33% to each of Writer 2 and Writer 3, the split is Writer 2 gets 40% and Writer 3 gets 26%), this seems unlikely enough as to be insignificant, particularly as the first order adjustments are small. The most frequently observed number of co-authors in both the Gold Record songs list (38%) and the ASCAP co-authored songs list (62%) is two writers. A similarly small proportion of songs are credited to six or more writers in both the ASCAP (1%) and Gold Record song lists (4%). In both the ASCAP co-authored songs data and the Gold Record bands data, recurrent collaborations are the norm: the nature of a band is that collaborative writer-name combinations occur for most of the band’s songs. In the ASCAP co-authored songs data, the median number of repeated collaborations was five, though most collaborative writer-name combinations occur only once. This gave an estimate of the overall proportion of equally split songs in the all co-authored songs ASCAP data.

The next step was to estimate the proportion of those equally split songs that were produced through uneven contributions by collaborators. As a guide, I referenced the proportions of bands including all members as co-authors for Gold Record bands whose songs were composed by members’ contributions which were even, uneven, or without contributions by all members. These two datasets are similar at a high level based on the available data.196The ASCAP repertory does not include information on the covariates of the Gold Record database (songwriting contributions, genre, region, decade, or representative band size). The Gold Record bands’ songs list contains 1% of the songs in the ASCAP co-authored songs data. This reflects the small number of music groups earning Gold Record certifications.

2. Results

A significant majority (63%) of the co-authored songs split royalties equally.197p<0.001. To answer the research question, I needed to identify the subset of the ASCAP songs that were written with uneven contributions; however, the full dataset also includes songs written with even contributions, gift credits,198A “gift credit” is a writing credit naming someone who did not contribute copyrightable expression to the work. Gift credits are common. Telephone interview with Scott Jungmichel, Senior Vice President of Royalty Distrib. & Royalty Rsch. Servs., SESAC (May 1, 2017). and sampling. As a guide for inferring the size of the subset of interest, I refer to Study 1 for the relative shares of the three categories of songwriting contributions of the Gold Record bands including all members as co-authors: even (30%), uneven (44%), and some members do not contribute (26%). Relying on these proportions, I first assume that of the 63% of equal split ASCAP songs, 30% were written with even contributions. The remaining 33% equal split ASCAP songs I assume to have been written with uneven contributions. I assume that 26% of the unequal ASCAP songs credit as writers individuals who did not make a copyrightable contribution (based on the proportion of some members do not contribute bands which did the same). This leaves 11% of the unequal split ASCAP songs assumed to be uneven contributions songs (37% unequal minus 26%), and suggests that the proportion of unevencontributions songs that is equally split is 75%. A further adjustment should be made to account for the possibility of “hidden inequality” described above,199See supra Section IV.B.1. which adjusts the estimated proportion downward by 8% to 67%.

This number is lower than the approximately 80% in Study 2a. One reason for this is that I have not taken into account sampling. Sampling inflates the number of total co-written songs, and specifically the number of unequal songs.200An estimated 15–25% of songs use samples of other songs. Tracklib Presents State of Sampling 2019, Tracklib, https://www.tracklib.com/blog/tracklib-presents-state-of-sampling-2019/ [https://
perma.cc/A4V8-WALF]. Sampling requires licensing both the sound recording and the song itself. This normally involves an advance payment plus sharing a portion of the copyright income of the song in which the sample is used. On the publishing side, the typical range is 10–30% of royalties, Passman, supra note 114, at 250–51, indicating a likelihood of an unequal split.

There is no difference in equal splitting between name combinations that only appear once (66%) versus at least ten times (65%). Number of collaborations for a set of writers was negatively associated with equal splitting,201p < 0.001. but nevertheless majority preference for equal splitting remained significant even with collaborative groupings with over one hundred credited works together (57%).202p < 0.001. A higher number of credited writers was negatively associated with equal splitting.203p < 0.001. The majority preference for equal splitting flips with five or more writers (27%).204p < 0.001. Higher numbers of co-writers also increase the chances of unequal splitting attributable to sampling. Because songs with more credited writers are not necessarily more complex or longer than songs with fewer co-writers, higher numbers of writers also reduce the likelihood that all credited writers have made copyrightable contributions, increasing the probability that the writing credits include gift credits, which may also contribute to rates of unequal splitting for songs with many co-authors.

V. GENERAL DISCUSSION

In this final Part, I review some limitations of the studies and how the findings answer the research questions, along with caveats on the support for the equal split rule. I discuss the legal implications and contributions of this research and preview other work focusing on the mechanisms that potentially drive these co-author inclusion and equal split practices.

A. Limitations

There are several limitations to the data and results of these studies; here I consider some of the most noteworthy. First, whereas a comprehensive database could be constructed to investigate the question of co-author inclusion (Study 1), this was not possible for exploring how royalties are split (Study 2). Instead, there are the convergent lines of Studies 2a and 2b. Each study involved making assumptions during data collection or analysis, which were necessary on account of data or resource limitations. To the extent these assumptions are not supported, the studies’ results and implications may be affected.

The results of the Gold Record database analysis may reflect survivor bias. Only a small fraction of all music groups earns Gold Records; it may be that less successful music groups are less likely to include minor contributors as co-authors. Although it cannot be ruled out, this seems unlikely because bands’ co-author inclusion choices are early business decisions in the groups’ lives, made before the arrival of great success, and are most often stable.

Songs are short by convention, which precluded investigating preferences when disparities in contributions are of the magnitude present in Aalmuhammed.205                                           Aalmuhammed v. Lee, 202 F.3d 1227, 1233 (9th Cir. 2000). Still, as discussed, the control doctrine has been applied more often than not regardless of the magnitude of the parties’ contributions, and, as argued, co-songwriting is the most important domain where the default rules are likely to apply in practice.

Figure 2.  Studies 1 & 2: Research Questions and Results

B. Summary of Findings

The control doctrine has resulted in the exclusion of lesser contributors from joint authorship when there was no contract between the collaborators. This, along with other problematic effects, has caused some scholars to propose returning to the pre-control legal regime and retaining the equal split rule. Others propose putting in place a proportional split default rule to better address the concerns with the control doctrine, and implicitly, align with creators’ assumed allocation preferences.

To identify the best choices, Studies 1 and 2 sought to uncover the actual preferences of co-songwriters. I have argued that the best option for co-songwriters is presumptively the best rule on the whole for those collaborating to produce joint works.

1. Are Lesser Contributors Counted as Co-authors?

First, I explored whether or not lesser contributors in co-songwriting are typically rewarded with co-authorship (Study 1). To observe the leap from contributor to co-author, I constructed the Gold Record database of 1,003 bands—every band that both has a Gold Record and writes its own songs. I found that the inclusion of lesser contributors as co-authors has steadily increased for the past six decades, becoming the typical practice (63%) beginning in the 1990s. I also found that 75% of bands never changed their initial practice of including or not including all members as co-authors. This is consistent with a stable level of songwriting contributions. When bands did shift from their initial co-author crediting choices, it tended to be in the direction of more, rather than less, co-author inclusion. This could be explained by an increase in members’ songwriting contributions, but it also raises questions for future investigation about a possible ratcheting effect, whereby it is hard to renegotiate a generous initial co-author inclusion practice.

The strength of the co-author inclusion result is notable given the heterogeneity of the data, which includes bands from subgenres as diverse as Norteño, Metalcore, Eurodance, Contemporary Christian, Children’s Music, and Punk Rock. The music created by these groups appeals to a wide spectrum of people’s tastes.

This practice runs counter to the de facto exclusion of lesser contributors under the control doctrine. Additionally, to the extent that control is incompatible with recognizing more than a few authors because it is understood as a right to override others’ creative choices,206See id. at 1233 (“[Artistic control] would generally limit authorship to someone at the top of the screen credits . . . .”); Goldstein, supra note 9, § 4.2.1.2. the results are not consistent with co-songwriters sharing that view: songs created through the uneven contributions of four collaborators typically credit all involved.207To the extent that “control” is understood as control over one’s own contributions rather than the entire work, it is compatible with the findings. However, that has been a minority interpretation of the control doctrine. See supra note 70 and accompanying text.

Although this preference data strengthens the case against the control doctrine, I find support for limits on grants of co-authorship even for contributors who have made copyrightable contributions. When group size exceeds four contributors, having made a copyrightable contribution—even an equal one—is no longer treated as sufficient for co-authorship in most cases. Thus, considerations beyond contributions do play a role in deciding who is granted co-authorship credit for a joint work, at least in collaborations at scale. This is consistent with the view in Aalmuhammed that “as the number of contributors grows and the work itself becomes less the product of one or two individuals, . . . the word [‘author’] is harder to apply.”208Aalmuhammed, 202 F.3d at 1232. Future research might investigate what these other considerations may be. Nevertheless, in the case of co-songwriting, the findings suggest large collaborations are uncommon. On the whole, the concepts and consequences of the control test are at odds with the revealed preferences of this significant creator population. Because it cuts against these revealed preferences, the transaction costs of the current legal regime are presumptively high to the extent that parties seek to contract out (though the data does not report contracting rates).

Future research may explore what accounts for this inclusivity trend. It may be due in part to an increase in the kinds of bands that tend to include all contributors as co-authors, such as smaller bands. There have also been changes in music industry standards and practices, which may have played a role. First, mechanical songwriting royalties doubled—and continued to increase over time—after the Copyright Act of 1976.209See Mechanical License Royalty Rates, U.S. Copyright Off., https://www.copyright.gov/
licensing/m200a.pdf [https://perma.cc/3XVH-3GFG].
As songwriting royalties increased in value, more band members may have sought a piece of the pie. Including all members as co-authors may have facilitated agreements to feature the strongest songs on albums without generating animosity.210See, e.g., Ryan Reed, Why Genesis Started Writing Shorter Songs: Exclusive Interview, UCR (Mar. 26, 2018), https://ultimateclassicrock.com/genesis-tony-banks-songwriting/ [https://perma.cc/
WR2C-R73Z] (interview with Tony Banks of Genesis); Richard Buskin, Classic Tracks: 10cc ‘I’m Not In Love,’ Sound On Sound (June 2005), https://www.soundonsound.com/techniques/classic-tracks-10cc-not-love [https://perma.cc/6TJL-FJUJ] (interview with Eric Stewart of 10cc). Both give this reason for equal royalty splits regardless of actual authorship.
Second, there were changes in what contributions were viewed as “songwriting.” In early Rock, the melody writer was considered the writer of the song. Early Rock was rooted in the Blues, and the rhythm portion of the song was often an uncopyrightable standard.211See Tom W. Bell, Intellectual Privilege: Copyright, Common Law, and the Common Good 21 (2014) (“[C]ourts have repeatedly protected stock scenarios and motifs . . . from suffering capture within copyright’s exclusive rights. The balcony scene, one-point perspective, blues chord progressions, and other creative building blocks thus remain free for all authors to use and reuse.” (footnote omitted)); Timothy J. McFarlin, Father(s?) of Rock & Roll: Why the Johnnie Johnson v. Chuck Berry Songwriting Suit Should Change the Way Copyright Law Determines Joint Authorship, 17 Vand. J. Ent. & Tech. L. 575, 643 (2015) (“Johnson may have been playing what was, by itself, a preexisting or otherwise uncopyrightable blues chord progression or riff.”); Michael J. Madison, Intellectual Property and Americana, or Why IP Gets the Blues, 18 Fordham Intell. Prop. Media & Ent. L.J. 677, 700–01 (2008) (“[T]he law sees in the blues not the concrete and protectable, but the general and the unprotectable. Individual notes and chords are ‘facts’; chord progressions are ‘ideas’ necessary to expression within the genre . . . . Repetition of rhythms, riffs, and even melodies does not constitute ‘originality’ or ‘authorship’ in a copyright sense.”). Third, musicians have arguably become savvier as Rock culture has developed. In early Rock, the only legally sophisticated party was often the producer, studio owner, label owner, or bandleader. It was in that person’s best interest to limit the number of writers on the song and, if possible, add themselves as writers.212Shourin Sen, The Denial of a General Performance Right in Sound Recordings: A Policy that Facilitates Our Democratic Civil Society?, 21 Harv. J. L. & Tech. 233, 241–42 (2007).

2. Do Lesser Contributing Co-authors Receive Less Royalties?

The converging lines of evidence from Studies 2a and 2b suggest that royalties are typically split equally, even with lesser contributing co-authors (over 70% of the time).213See, e.g., LaFrance, supra note 76. In Study 2a, the songwriting royalty splits of a third of the uneven contributions Gold Record bands were inferred based on information disclosed by the bands’ inner circles or ASCAP; in Study 2b, splits were inferred for co-songwriting more broadly, based on a sample of more than a quarter of co-authored songs registered with U.S. PROs.214I take this to be a conservative estimate. ASCAP and BMI have 90% market share; the co-authored songs are 48% of co-authored songs in ASCAP’s repertory, most commonly listed also in BMI’s repertoire. Perhaps surprisingly, the underlying preference assumptions of scholars who have argued the law should be revised to favor contribution-based splits were not supported. Instead, the results throw support behind the proponents of the existing equal split rule.

As with initial co-author inclusion choices, there was stability in initial split choices. Co-authors who wrote together only once were as likely to split royalties equally as co-authors who collaborated one hundred times. 

C. Legal Implications

Study 1 affirms the conventional wisdom, endangered by the control doctrine, that contributions are the most important consideration in granting co-authorship. Songwriting contributions overshadowed differences of genre, region, band size, and era. The copyrightability considerations standards have been on somewhat shaky ground as control has expanded to trump other factors in co-authorships tests. These results provide additional support for pushing back against that trend and reasserting a contributions-centered way of thinking about who is a co-author.

At the same time, the results imply that once co-authors are recognized, their comparative contributions are often disregarded when they allocate proceeds among themselves. Co-authorship practices appear as a step-function, rather than a matter of degree. Rather than provide a mandate for establishing a contributions-based default split rule, the findings suggest that the equal split rule is consistent with preferences and does not deter grants of co-authorship to lesser contributors. Taken together, Studies 1 and 2 suggest that equal co-authorship for lesser contributors is the practice of a plurality of the involved co-songwriters. On this basis, the studies lend support to the statutory intent to merge copyrightable contributions standards, or, at least, support the suggestion by Professor Mary LaFrance, and others, that a rebuttable presumption of intent is appropriate.

A return to the intent to merge co-authorship standard would predictably reach undesirable results in certain industries, most notably the film industry and editorial relationships. This section discusses two possible responses.215A third response could be industry-specific joint authorship rules. See, e.g., Kwall, supra note 87, at 62–63; McFarlin, supra note 211, at 660–61; Gregory S. Donat, Fixing Fixation: A Copyright with Teeth for Improvisational Performers, 97 Colum. L. Rev. 1363, 1403–04 (1997); Susan Keller, Collaboration in Theater: Problems and Copyright Solutions, 33 UCLA L. Rev. 891, 935 (1986). The industry-specific nature of patent law is well-attested. See Dan L. Burk & Mark A. Lemley, Is Patent Law Technology-Specific?, 17 Berkeley Tech. L.J. 1155, 1158–85 (2002). And industry-specific carve-outs are well-known in copyright law; for example, the blanket inclusion of all creative motion picture contributions as potential works made for hire. 17 U.S.C. § 101; see also Peter DiCola, Music Copyright, in 2 Research Handbook on the Economics of Intellectual Property: Analytical Methods 565–66 (Peter Menell & David Schwartz eds., 2019) (discussing ways in which copyright law is tailored to the music industry, notably in infringement); Joseph P. Liu, Regulatory Copyright, 83 N.C. L. Rev. 87 (2004). Yet this rule was the result of intense industry lobbying. Cf. Burk & Lemley, supra note 6, at 1637 (warning of “counterproductive special interest lobbying” in tailored patent legislation). Furthermore, creative industry standards, when they can be identified, may run contrary to black letter copyright law, with little room for modification except through legislation. See, e.g., Effects Assocs., Inc. v. Cohen, 908 F.2d 555, 556–57 (9th Cir. 1990) (discussing the apparent custom of unwritten exclusive transfers in the film industry). Or, consider an industry default based on the outcome of 16 Casa Duse, LLC v. Merkin, 791 F.3d 247, 265–65 (2d Cir. 2015): all film work would be work made for hire, even in the absence of a contract. An industry-based approach would need to take into account these and other challenges to further the utilitarian ends of copyright. See generally Polcz, supra note 7.

First, any default rule will not satisfy all situations. Parties can contract out of the default rule, and it is reasonable to expect that sophisticated parties will do so. Inevitably some may fail to put in place a contract through pure oversight. But it is inefficient to change the rule when such oversights are exceptional. In other contexts, when parties failed to contract around default rules, there have been costly settlements that have no doubt subsequently increased attention to such matters down the road.216See, e.g., Samantha Drake, Co-Founders Could Have Avoided Legal Drama on Eve of Self-Driving Car Deal, Forbes (July 14, 2016), https://www.forbes.com/sites/samanthadrake1/2016/
07/14/co-founders-could-have-avoided-legal-drama-on-eve-of-self-driving-car-deal [https://perma.cc/
82WU-NYW9] (“[T]he apparent failure to document the scope of an alleged co-founder’s contribution at the San Francisco-based Cruise came back to haunt it on the eve of the startup’s acquisition.”).

Second, future research might explore a parameter-based approach to co-authorship determinations. For example, the number of collaborations was associated with co-author inclusion and royalty split. There has been considerable theoretical debate over the extent to which defaults should be “tailored” (that is, modified) such that they lead to different results for different parties based on some relevant characteristic. This has been recognized to potentially lead to more efficient outcomes by better reflecting decisions that parties would actually make in a costless environment.217See Richard Craswell, Contract Law: General Theories, in 3 Encyclopedia of Law and Economics: The Regulation of Contracts 1, 5 (Boudewijn Bouckaert & Gerrit De Geest eds., 2000); Stewart J. Schwab, Life-Cycle Justice: Accommodating Just Cause and Employment at Will, 92 Mich. L. Rev. 8, 52 (1993). Default tailoring comes with costs: the cost to lawmakers of identifying and implementing the most efficient tailoring, and the cost to parties of interacting with a default more specific to their circumstances than a one-size-fits-all rule.218See Ayres & Gertner, supra note 133, at 117–18; Christopher R. Drahozal & Peter B. Rutledge, Contract and Procedure, 94 Marq. L. Rev. 1103, 1160 (2011) (stating that a complex rule may be a more efficient rule if efficiency gains outweigh increased information costs); Robert E. Scott, A Relational Theory of Default Rules for Commercial Contracts, 19 J. Legal Stud. 597, 598 (1990) (noting that rule complexity increases costs). Thus, the most efficiently tailored rule is likely one that reflects a discrete set of common practices, thereby reducing information costs. This may be why previous proposals to tailor the equal split default have focused on industry-based tailoring as a commonsense default unit. This Article has suggested that the number of collaborators is one potential parameter; if the size of collaborative groups predicts split preference across genres and regions, it is worth investigating whether it is a useful predictor across industries. Pursuing this and other leads219See George S. Geis, An Experiment in the Optimal Precision of Contract Default Rules, 80 Tul. L. Rev. 1109, 1129 (2006) (arguing that optimal tailoring must be based on empirical data). See generally Ariel Porat & Lior Jacob Strahilevitz, Personalizing Default Rules and Disclosure with Big Data, 112 Mich. L. Rev. 1417 (2014). Analysis of empirical data has increasingly been used to challenge conventional wisdom in a variety of copyright contexts. See, e.g., Christopher Buccafusco & Paul J. Heald, Do Bad Things Happen When Works Enter the Public Domain?: Empirical Tests of Copyright Term Extension, 28 Berkeley Tech. L.J. 1, 17–28 (2013) (using audiobook data to question prevailing rationales for copyright term extension); Kristelia García, James Hicks & Justin McCrary, Copyright and Economic Viability: Evidence from the Music Industry, 17 J. Empirical Legal Stud. 696 (2020) (finding that sound recordings largely exhaust their economic value long before the termination of the copyright term); Glynn S. Lunney, Jr., Copyright and the 1%, 23 Stan. Tech. L. Rev. 1 (2020) (supporting copyright reform proposals with video game usage data); see also DiCola, supra note 215, at 567 (discussing empirical data-gathering difficulties particular to the music industry). may bring about gains in efficient tailoring over and above what is possible with industry-based rules.220The goal is to find the tipping point at which the rule is tailored to the extent that such tailoring maximizes efficiency: where further tailoring would increase net transaction costs. Porat & Strahilevitz, supra note 219, at 1423.

D. Why Equality: Mechanisms

The data presented suggest that when it comes to relative rewards, co-songwriters, on balance, prefer equal over contributions-based allocations—even in unequal collaborations. What can account for this surprising preference for equality? A systematic investigation of the preference mechanisms at work is out of the scope of this Article. I conclude here by priming intuitions connected with the theoretical framework I use elsewhere for that purpose.

Context often determines which of a few interpersonal allocation rules we find to be most psychologically rewarding. There are many contexts in which we find equality—where everyone is rewarded in the same quantities—to be the most psychologically rewarding state of affairs. Under other circumstances, we may prefer contribution-based remuneration. A context can be defined, for example, by who is involved or what goals are involved. There are patterns to when we are likely to have these preferences, which have been studied under various research traditions in distributive justice.221See Robert J. MacCoun & Sarah Polcz, Distributive Justice Norms, Social Value Orientations, and Social Relations Models: An Integrative Account, in Soc. Psych. & Just. 93, 93 (E. Allan Lind ed., 2020). Earlier, I noted that the members of a large number of bands are friends or family members. Theories that can provide a way of thinking about the influence of parties’ close relationships on the allocations they prefer when distributing a resource—such as royalties—among themselves, are most conducive for understanding this equality preference.222See generally Alan Page Fiske, Structures of Social Life: The Four Elementary Forms of Human Relations 6063 (1991) (discussing four models for the distribution of resources).

CONCLUSION

The trends in co-songwriters’ preferences revealed by this research offer the first comprehensive look into decisions made every year, hundreds of thousands of times, offering a point of reference for creators. In songwriting, co-authorship matters. Not only does it confer status; it also confers money. For some musicians, co-authorship royalties are their retirement plan. As the lead singer in an all-female Punk Rock band explained songwriting credits, “we’re straight up talking business here.”223Telephone Interview with Julia Kugel-Montoya (Sept. 17, 2017).

And yet, in this Article I have looked at this “business” decision of how to share co-authorship and found that short-term financial self-interest is not the end of the story: main songwriters share songwriting credit, and songwriting royalties, much more generously than the standard by which lesser contributors would be rewarded under the control test. This revealed preference suggests that we can adopt a more inclusive legal criterion for co-authorship while retaining the equal split default and that we can do so without violating creators’ own sense of fairness.

APPENDIX

Table A1.  Logistic Regression for Gold Record Bands 1959–2021: Are All Members Included as Co-authors?

Variable

log(OR)

95% CI

p-value

Songwriting (ref. cat.: Some members do not contribute)

     

Even

2.9

2.4, 3.5

<0.001**

Uneven

1.8

1.4, 2.1

<0.001**

Decade (ref. cat.: 1980s)

     

1960s and earlier

-0.94

-1.7, -0.20

0.016*

1970s

-0.33

-1.0, 0.27

0.3

1990s

0.50

0.06, 1.0

0.026*

2000s

0.74

0.23, 1.2

0.004**

2010s and later

0.90

0.23, 1.2

0.004**

Members (ref. cat.: 4)

     

2

0.91

0.43, 1.4

<0.001**

3

0.02

-0.42, 0.45

>0.9

5

-0.61

-1.1, -0.11

0.017*

6+

-1.5

-2.6, -0.56

0.003**

Genre (ref. cat.: Pop)

     

Rock

0.37

-0.14, 0.89

0.2

Hip Hop, R&B, Gospel, Jazz

-0.79

-1.4, -0.16

0.014*

Country

-1.0

-2.0, -0.09

0.034*

Metal

0.27

-0.52, 1.0

0.5

Latin

-1.5

-3.0, -0.18

0.032*

Punk

0.87

-0.29, 2.0

0.13

Electronic

-0.50

-1.8, 0.90

0.5

Reggae

1.0

-1.6, 4.1

0.5

 

Region (ref. cat.: Northeast)

     

Midwest

-0.68

-1.4, 0.04

0.069

West

0.01

-0.49, 0.51

>0.9

South

0.11

-0.43, 0.65

0.7

Non-USA

0.26

-0.26, 0.78

0.3

Notes: *p < 0.05; **p < 0.005.

     

96 S. Cal. L. Rev. 607

Download

Fellow, Stanford Law School. For comments on versions of this Article, I thank Lisa Ouellette, Paul Goldstein, Robert MacCoun, Samantha Zyontz, Joel Shor, Anna Lewis, and Adam Brown. For helpful feedback, I also thank the participants of the 2022 Copyright Scholarship Roundtable at Columbia Law School, the Stanford Legal Research in Progress Workshop 2020, the Intellectual Property Scholars Conference 2020 at Stanford Law School, and the Works-in-Progress Intellectual Property Colloquium 2020 at Santa Clara University School of Law. Thank you to Matthew Kelleher and William Button for outstanding research support.

The Limitations of Applying the Stored Communications Act to Social Media

The advent of social media has increasingly affected how people live and communicate. Millions of Americans use social media every day, and the numbers continue to grow. The motivation to post on social media is multifactorial and includes a desire to stay connected, find others with shared interests, change opinions, and encourage action, but posting also serves to boost one’s self-esteem and self-worth. However, posting on social media creates a serious risk of self-disclosure, with people revealing more intimate details online than they would in more traditional settings without really appreciating the privacy issues and potential negative consequences related to such disclosures.

As social media use continues to grow, its use as a tool in police investigations has also increased. Both the content and metadata associated with social media posts now routinely aid law enforcement authorities in finding patterns and, importantly, in establishing timelines in criminal investigations. Thus, there is an urgent need to revise the existing laws governing stored communications—to better adapt them to these new, evolving technologies and improve the legal framework governing online privacy rights. This Note argues that various aspects of the Stored Communications Act (“SCA”) are outdated and that thirty-six years after it was enacted, it is time for an update that reflects the changing landscape of evolving technological advances.

The Note explores how the internet and social media use have evolved over the years and explains why the SCA no longer sufficiently protects consumers from government acquisition of their information. Particular emphasis is placed on the novelty of social media “Stories,” a technology unlike any that Congress could have imagined when it enacted the SCA in 1986. The Note examines the history of the SCA—with a focus on the Fourth Amendment, the Electronic Communications Privacy Act, and Supreme Court cases addressing the applicability of the Fourth Amendment to various forms of communication technology—before analyzing the SCA in detail, and looks at how law enforcement agencies can obtain these communications for use in criminal investigations. The Note concludes by arguing that the SCA needs to be revised to more adequately apply to today’s social media technologies since their content, and non-content, does not easily fit into the currently delineated categories. Revising the SCA would afford greater protection to consumer communication rights: not only would the SCA better apply to modern technology, but it would also be more readily applicable to future emerging media technologies.

INTRODUCTION

The rise of social media has significantly impacted the way people live and communicate, and the trend toward extensive social media use will likely only continue to grow. According to a Pew Research Center study, seven in ten Americans use social media.1Brooke Auxier & Monica Anderson, Social Media Use in 2021, Pew Rsch. Ctr. (Apr. 7, 2021), https://www.pewresearch.org/internet/2021/04/07/social-media-use-in-2021 [https://perma.cc/
DG7C-4FY3].
On average, people spend an estimated two and a half hours on social media platforms over the course of their day,2Global Social Media Statistics, DataReportal, https://datareportal.com/social-media-users [https://perma.cc/Y6JS-XZQF]. While this number might not seem large when compared to the twenty-four hours in the day, it is reported that, on average, Americans spend around five and a half hours a day on their phones, while globally, people average just over three hours of phone time per day. Damjan Jugović Spajić, How Much Time Does the Average Person Spend on Their Phone?, Kommando Tech (May 10, 2022), https://kommandotech.com/statistics/how-much-time-does-the-average-person-spend-on-their-phone [https://perma.cc/K5HZ-W9TF]. This means that of all the time people spend on their phones each day, about one half is spent exclusively on social media. and “[a] majority of Facebook, Snapchat and Instagram users say they visit these platforms on a daily basis.”3Auxier & Anderson, supra note 1. More specifically, 69% of Americans use Facebook, 40% of Americans use Instagram, and 25% of Americans use Snapchat.4Id. These percentages represent a significant number of people—approximately 230 million, 133 million, and 83 million, respectively.5These numbers were calculated based on the Census Bureau’s most recent estimate of the American population (332,403, 650). Derrick Moore, U.S. Population Estimated at 332,403,650 on Jan. 1, 2022, U.S. Census Bureau (Dec. 30, 2021), https://www.census.gov/library/stories/2021/12/happy-new-year-2022.html [https://perma.cc/3Z3P-3HVB]. Further, social media users make extensive use of the “Stories”6See infra Section I.B. feature, with one billion Facebook Stories being posted daily and five hundred million daily active users of Instagram Stories worldwide.7Jimit Bagadiya, 430+ Social Media Statistics You Must Know in 2022, SocialPilot, https://www.socialpilot.co/blog/social-media-statistics [https://perma.cc/D6DJ-SPU9]. The motivation to post on social media is multifactorial and includes a desire to stay connected, find others with shared interests, change opinions, and encourage action, but posting also serves to boost one’s self-esteem and self-worth.8Rosalyn Ransaw, The Psychology Behind Why We Share on Social Media, ShutterStock (Apr. 30, 2021), https://www.shutterstock.com/blog/the-psychology-behind-why-we-share-on-social-media [https://perma.cc/9B4D-72D7]. These desires create a serious risk of self-disclosure on social media, with people revealing more intimate details online than they would in more traditional settings without really appreciating the privacy issues and potential negative consequences related to such disclosures.

Just as social media has become popular with the American public, it is also becoming increasingly utilized as a tool in police investigations. A 2012 survey showed that four out of five law enforcement agents used social media to gather intelligence during investigations.9Heather Kelly, Police Embrace Social Media as Crime-Fighting Tool, CNN Business (Aug. 30, 2012, 5:23 PM), https://www.cnn.com/2012/08/30/tech/social-media/fighting-crime-social-media/
index.html [https://perma.cc/2EPT-56GK].
Not only do authorities look online for public information, but they also request access to private data directly from social media providers—which can help them build their criminal cases. For example, after finding photos and comments “glamorizing alcohol abuse” on a woman’s MySpace page, prosecutors were able to use them as evidence and advocate for a longer sentence for her vehicular manslaughter conviction.10Ian Urbina, Social Media, a Trove of Clues and Confessions, N.Y. Times (Feb. 15, 2014), https://www.nytimes.com/2014/02/16/sunday-review/social-media-a-trove-of-clues-and-confessions.html [https://perma.cc/9BRM-HGAD]. Since people are less inhibited when it comes to social media disclosures, they often share details of their lives and more controversial opinions than they may in other forums. After these once private thoughts are stored electronically, they become more easily accessible to investigators. Not only can the content of social media posts aid criminal investigations, but the related metadata11There are different kinds of metadata, but in the context of criminal investigations and social media, descriptive metadata—which includes the time and date the content was created and posted,
the creator of the data, and the location on the device where the data was created—can be
implicated. Metadata Forensics, When Files Can Speak and Reveal the Truth, Ironhack (June
24, 2021), https://www.ironhack.com/en/cybersecurity/metadata-forensics-when-files-can-speak-and-reveal-the-truth [https://perma.cc/8XWX-HEKD].
alone “can help law enforcement authorities to find patterns, establish timelines and point to gaps in the data.”12Adelle Geronimo, Beyond Data: The Value of Metadata in Criminal Investigations, ITP.net (Sept. 1, 2021), https://www.itp.net/security/99783-beyond-data-the-value-of-metadata-in-criminal-investigations [https://perma.cc/SVB6-QE42]. “[C]hanging technology has rendered metadata analysis more important.” Orin S. Kerr, The Next Generation Communications Privacy Act, 162 U. Pa. L. Rev. 373, 398 (2014). Therefore, social media metadata can be just as easily used to gather information on a suspect as the actual content of a post. Because the trend toward extensive social media use will likely endure, there is an urgent need to revise the laws governing stored communications—to better adapt them to these evolving technologies and improve the legal framework governing online privacy rights.

This Note argues that various aspects of the Stored Communications Act (“SCA”) are outdated and that thirty-six years after it was enacted, it is time for an update that reflects the changing landscape of evolving technological advances. Part I of this Note explores how the internet and social media have evolved throughout the years and explains why the SCA no longer affords sufficient protections against government acquisition of consumer information. It discusses the evolution and expansion of social media platforms. Particular emphasis is placed on the novelty of social media Stories, which are unlike any technology that Congress could have imagined when they enacted the SCA in 1986.

Next, Part II examines the history behind the SCA to explain why the law was initially passed by Congress, with a focus on the Fourth Amendment, the Electronic Communications Privacy Act (“ECPA”), and Supreme Court cases addressing the applicability of the Fourth Amendment to various forms of technology. Part III analyzes the SCA in detail, focusing on the distinctions made between the different types of internet service providers (“ISPs”) and the different aspects of communications (content versus non-content data). It looks at how the content and non-content information—for example, metadata including a user’s identity, location, and other data not part of the main substance of the communication—can be obtained by law enforcement in the course of a criminal investigation.

Part IV argues that the SCA cannot be easily applied to social media today because it does not fit within the categories delineated in the SCA. Most importantly, it highlights how (1) social media content does not easily fit into either of the SCA’s currently defined categories because Congress could not have anticipated the advances in the technologies that exist today; and (2) “non-content” is not fully defined in the statute, and therefore lends itself to being more easily obtained in some situations as opposed to others. Finally, Part V suggests ways in which the SCA can be revised to more adequately apply to social media today and ultimately protect the right to privacy guaranteed by the U.S. Constitution.

I. INTERNET PRIVACY AND EVOLVING TECHNOLOGY

Americans are entitled to their right to privacy, which on third-party ISPs such as Facebook and MySpace is protected by the SCA.13Stored Communications Act, 18 U.S.C. §§ 2701–2713. One problem with the SCA, however, is that it is dated. Although the internet was invented in the 1960s, it was not widely used until 1983, when computers on different networks were finally able to easily communicate with one another.14A Brief History of the Internet, Bd. of Regents of the Univ. Sys. of Ga., https://www.usg.
edu/galileo/skills/unit07/internet07_02.phtml [https://perma.cc/P72B-H2DS].
When the SCA was enacted in 1986—just three years later—Congress had only a limited experience with internet use and the potential privacy problems it could create, and had certainly not envisioned the extensive modern use of social media. This partially accounts for some of the weaknesses in this legislation and why the SCA is often difficult to apply to social media today.

A. Evolution of Social Media Platforms

Social media is defined as “forms of electronic communication . . . through which users create online communities to share information, ideas, personal messages, and other content.”15Social Media, Merriam-Webster, https://www.merriam-webster.com/dictionary/social%20
media [https://perma.cc/3PUC-PTPT].
This definition implies that social media could not exist without the internet, and that it depends on user-generated content.16See Matthew Jones, The Complete History of Social Media: A Timeline of the Invention of Online Networking, Hist. Coop. (June 16, 2015), https://historycooperative.org/the-history-of-social-media [https://perma.cc/WUZ9-JVWE]. While it can be said that social media began in 1971, when the first email was sent,17Rachel Swatman, 1971: First Ever Email, Guinness World Recs. (Aug. 19, 2015), https://
http://www.guinnessworldrecords.com/news/60at60/2015/8/1971-first-ever-email-392973 [https://perma.cc/
9CNE-U852].
for many people social media really began in the late 1990s or early 2000s—years after the SCA was enacted—with the advent of messaging services such as AOL and MSN Messenger.18See Caitlin Dewey, A Complete History of the Rise and Fall—and Reincarnation!—of the Beloved ‘90s Chatroom, Wash. Post (Oct. 30, 2014, 2:01 PM), https://www.washingtonpost.com/news/
the-intersect/wp/2014/10/30/a-complete-history-of-the-rise-and-fall-and-reincarnation-of-the-beloved-90s-chatroom [https://perma.cc/PB3Y-4T3B] (“Services like MSN and AOL . . . made the chat function available to millions of Americans . . . .”).
MySpace, arguably the “most popular and influential” of the early social media platforms, was later launched in August 2003,19Jones, supra note 16; Nicholas Jackson & Alexis C. Madrigal, The Rise and Fall of MySpace, Atlantic (Jan. 12, 2011), https://www.theatlantic.com/technology/archive/2011/01/the-rise-and-fall-of-myspace/69444 [https://perma.cc/ZYJ4-9NEJ]. Although MySpace was more popular, Six Degrees is “credited as being the ‘first online social media’ site” because it “allowed people to sign up with their email address, make individual profiles, and add friends to their personal network.” Jones, supra note 16. Six Degrees only lasted for four years, and it peaked at less than four million users, id., far less than the twenty-seven million users MySpace had just two years after its launch. Jackson & Madrigal, supra. and it allowed individuals to interact by commenting on each other’s profiles and sending private messages. It was the largest social media platform until Facebook, created in 2004, overtook it in 2008.20Jones, supra note 16. Facebook has now grown to be the largest social media platform in the world with almost three billion monthly active users.21Facebook Statistics and Trends, DataReportal, https://datareportal.com/essential-facebook-stats [https://perma.cc/BP76-FY42] (“Facebook had 2.934 billion monthly active users in July 2022 . . . .”).

The number and types of social media platforms have grown extensively. Today, other prominent social media platforms include Instagram and Snapchat. Instagram was launched in 2010 and is a platform focused on sharing photos and videos.22Jones, supra note 16. Snapchat was created in 2011 and gained its popularity from users’ ability to send each other pictures or videos (“Snaps”) that disappear shortly after being opened.23Id. These platforms allow users to share content with their friends, some of which they believe to be “private,” visible only to those friends they allow to see it. However, the widespread use of these platforms has created new issues with how the government can legally access and use these communications.

B. Emergence of Stories on Social Media Platforms

The continued evolution and development of new information sharing functions on social media platforms have created multiple issues concerning user privacy rights. For example, in 2013, Snapchat began to allow people to share “Stories” that are displayed for twenty-four hours before becoming inaccessible.24Emma Wiltshire, The Rise of the Story Format [Infographic], Social Media Today (Feb. 2, 2018), https://www.socialmediatoday.com/news/the-rise-of-the-story-format-infographic/516143 [https://
perma.cc/SWJ6-9MXN].
Stories are a collection of individual Snaps that are played in the order in which they were created and allow users to share their entire day in a narrative manner. Today, Stories are also available on a variety of other social media platforms, including Facebook and Instagram.25Snapchat was the first social media platform to utilize Stories, in October 2013, with Instagram following in August 2016 and Facebook in March 2017. Id. Other social media applications have also started utilizing Stories. Id. Part of the reason why Stories are so successful is because they are only available temporarily, so people can post small daily updates or silly images that they only want visible for a short period of time.26See Simon Batt, What Are “Stories” on Social Media?, Make Tech Easier (Jan. 3, 2019), https://www.maketecheasier.com/stories-on-social-media [https://perma.cc/AD66-7R7A] (noting the traits that make Stories useful). Therefore, users reasonably believe that their content will remain private and then disappear, becoming permanently inaccessible. Another reason for the success of Stories is that “social media [S]tories tend to be more spontaneous” than an individual’s carefully curated feed, making it feel more “casual.”27Chloe West, Social Media Stories: Your Guide to All Social Media Story Platforms, Sprout Soc. (June 30, 2021), https://sproutsocial.com/insights/social-media-stories [https://perma.cc/EV8W-C9LD]. As a result, these Stories can be extremely useful to law enforcement, as they can provide a less filtered view of an individual’s daily life and a timeline for the posted events. Thus, the challenge becomes balancing users’ right to privacy with the government’s need for access to information in order to investigate criminal offenses.

As it exists now, the SCA does not provide an adequate statutory framework for protecting communications on the various aforementioned social media platforms and, importantly, does not specifically address new advances in technology such as transient Snapchat and Instagram Stories. Since the SCA does not adequately protect individuals from unlawful searches of their private social media data, there is a need for Congress to reform the statute to accommodate evolving technology.

II. HISTORY OF THE STORED COMMUNICATIONS ACT

A. The Fourth Amendment

The Fourth Amendment to the Constitution protects “[t]he right of the people to be secure in their persons, houses, papers, and effects, against unreasonable searches and seizures.”28U.S. Const. amend. IV. While the meaning of “search” is not immediately defined by the Amendment, the Supreme Court has held that “[a] ‘search’ occurs when an expectation of privacy that society is prepared to consider reasonable is infringed”29United States v. Jacobsen, 466 U.S. 109, 113 (1984). and that “[i]f the inspection by police does not intrude upon a legitimate expectation of privacy, there is no ‘search.’ ”30Illinois v. Andreas, 463 U.S. 765, 771 (1983). Thus, when it comes to physical searches, the meaning of the Fourth Amendment is well understood,31Orin S. Kerr, Computer Crime Law 389 (4th ed. 2018). whereas what constitutes a search in the digital context is more uncertain.

In Olmstead v. United States, the Supreme Court held that wiretapping did not violate the Fourth Amendment because the lack of physical trespass and seizure of anything tangible meant there was no search or seizure.32Olmstead v. United States, 277 U.S. 438, 463–64 (1928), overruled by Katz v. United States, 389 U.S. 347 (1967), and Berger v. New York, 388 U.S. 41 (1967) (holding that the use of the wiretapped conversations of a suspected bootlegger as incriminating evidence did not violate his Fourth Amendment rights because wiretapping did not constitute a search or seizure under the meaning of the Fourth Amendment since there was no physical trespass). Because the Court refused to expand the Fourth Amendment to protect telephone communications,33Id. at 465. the government could legally intercept citizens’ communications as long as they did not physically enter their homes. Olmstead was later overruled by Katz v. United States,34Katz, 389 U.S. at 357–59 (holding that putting a recording device in a public phonebooth violated a gambling suspect’s Fourth Amendment rights because the Fourth Amendment applies to people, not places). indicating a change in ideology that afforded citizens protection of their privacy even without a physical search. Because Katz held that a physical intrusion was not necessary to invoke the Fourth Amendment, online searches—which lack physical intrusions—can still violate the Fourth Amendment.

B. The Electronic Communications Privacy Act

In light of these changing viewpoints on the applicability of Fourth Amendment protections, Congress enacted the ECPA35Electronic Communications Privacy Act of 1986, Pub. L. No. 99-508, 100 Stat. 1848. in 1986 in an effort to adapt the doctrines of the Fourth Amendment to the various emerging technologies. The SCA, which provides privacy protections to stored electronic and wire communications, is one part of the ECPA. The ECPA was created with the purpose of protecting American citizens from “the unauthorized interception of electronic communications.”36S. Rep. No. 99-541, at 1 (1986), as reprinted in 1986 U.S.C.C.A.N. 3555, 3555. Congress recognized a need to “update and clarify Federal privacy protections and standards in light of dramatic changes in new computer and telecommunications technologies.”37Id. At the time, advances in technology included “large-scale electronic mail operations, computer-to-computer data transmissions, cellular and cordless telephones, paging devices, and video teleconferencing.” Id. at 2. Rightly, Congress worried that due to these advances, personal communications could be intercepted by individuals who had no right to obtain them, and thus felt it was important to enact the ECPA.38Id. at 3. However, the scope of the ECPA did not fully anticipate the impact of the growth and extent of social media.

C. Supreme Court Cases Addressing the Fourth Amendment and Technology

More recently, the Supreme Court heard a series of cases that addressed the applicability of the Fourth Amendment to newer technologies. In each of these cases, the Supreme Court Justices grappled with applying the existing legal framework, indicating that it is time for a change. In Justice Sotomayor’s concurring opinion in United States v. Jones,39United States v. Jones, 565 U.S. 400 (2012) (holding that using a GPS device without a warrant to track an individual’s car through public streets was a violation of his Fourth Amendment rights). she emphasized that in the absence of a physical trespass, a Fourth Amendment search occurs “when the government violates a subjective expectation of privacy that society recognizes as reasonable.”40Id. at 414 (Sotomayor, J., concurring) (quoting Kyllo v. United States, 553 U.S. 27, 31–33 (2001)). She also argued that “it may be necessary to reconsider the premise that an individual has no reasonable expectation of privacy in information voluntarily disclosed to third parties” because “[t]his approach is ill suited to the digital age.”41Id. at 417. Justice Sotomayor’s statements highlight the need to reevaluate the applicability of the current legal framework to new technologies.

Two years later, in Riley v. California,42Riley v. California, 573 U.S. 373, 401 (2014) (holding that a warrantless search of a cell phone conducted incident to arrest violated the Fourth Amendment because “a warrant is generally required before such a search, even when a cell phone is seized incident to arrest”). Justice Roberts acknowledged that because technology enables modern cell phones to contain and potentially reveal a wealth of private information, cell phones require greater privacy protections than would be necessary for a traditional search.43Id. at 403. Four years after Riley, the Court once again addressed warrantless searches in Carpenter v. United States, this time through the collection of cell phone records from a third party.44Carpenter v. United States, 138 S. Ct. 2206, 2211–12 (2018). Again, Justice Roberts recognized the need for stronger privacy protections, stating that “a warrant is required in the rare case where the suspect has a legitimate privacy interest in records held by a third party,” such as the cell site records indicating the defendant’s location and movements.45Id. at 2222. The government had acquired this information pursuant to a court order issued under the SCA, which was obtained based on evidence that the information might be relevant to the ongoing investigation.46Id. at 2221. Finding this burden of proof—requiring only that the information might be relevant, which is lower than the probable cause required to obtain a warrant—to be unacceptable, the Court held that to access these cell site records, a warrant was required.47Id. The differing standards of proof required to obtain warrants and court orders to access records from these new technologies illustrate that sometimes the SCA troublingly affords lesser protections to individuals’ private information.

III.  THE STORED COMMUNICATIONS ACT

The SCA was enacted to regulate electronic and wire communications that are stored on third-party servers48Privacy Rights in the Digital Age 564 (Jane E. Kirtley & Michael Shally-Jensen, eds., 2nd ed. 2019). and therefore governs the interaction between government investigators and administrators of third-party service providers.49Kerr, supra note 31, at 675. It was meant to expand the privacy protections afforded by the Fourth Amendment to digital content, clarifying its applicability. However, the SCA regulates retrospective communications, meaning it only applies when the government seeks to obtain information already in a provider’s possession.50Id. at 675–76. Additionally, the SCA only applies to two types of ISPs: providers of electronic communication service (“ECS”) and providers of remote computing service (“RCS”).51Privacy Rights in the Digital Age, supra note 48, at 565. An ECS is defined as “any service which provides . . . the ability to send or receive wire or electronic communications;”5218 U.S.C. § 2510(15). email and cell phone service providers would therefore be examples of ECS providers. An RCS, on the other hand, is defined as any service that provides to the public “computer storage or processing services by means of an electronic communications system.”53Id. § 2711(2). Thus, once an email has been received but not deleted or a voicemail has been left in storage for later review, email and cell phone services are treated as RCS providers. Because ECS and RCS providers are afforded different levels of protection, it is important to be able to appropriately categorize modern ISPs to determine how much protection users’ communications will be given.

While transmitting communications and storing communications are different functions, this distinction matters less today, as many modern ISPs provide both services. In 1986, however, Congress was concerned about businesses such as hospitals and banks using remote computing services to store records and process data.54S. Rep. No. 99-541, at 3 (1986), as reprinted in 1986 U.S.C.C.A.N. 3555, 3557. Thus, they felt the need to create the RCS category to address this concern.55It is unclear, however, why Congress felt that ECS and RCS communications should be afforded differing levels of protection. Generally, the SCA prohibits disclosure of both content and non-content56Non-content data is information the service provider collects about the subscriber of the service, such as their name and address. data of customer communications, but the SCA provides exceptions to this rule.57Stored Communications Act, 18 U.S.C. §§ 2701–2713. These exceptions, which are discussed below, are divided between § 2702, which regulates voluntary disclosure, and § 2703, which regulates required disclosure.

A. Disclosure of the Contents of Social Media Posts

1. Voluntary Disclosure of Customer Communications

Section 2702(b) details the nine circumstances in which a provider may voluntarily disclose the contents of a customer’s communications.58Id. § 2702(b)(1)–(9). These exceptions include allowing the contents to be disclosed “to an addressee or intended recipient of such communication” and “with the lawful consent of the originator or an addressee or intended recipient of such communication.”59Id. § 2702(b)(1), (3). The other seven instances in which a provider may also divulge the contents of a customer communication are as follows:

as otherwise authorized in section 2517, 2511(2)(a), or 2703 of [Title 18]; . . . to a person employed or authorized or whose facilities are used to forward such communication to its destination; . . . as may be necessarily incident to the rendition of the service or to the protection of the rights or property of the provider of that service; “to the National Center for Missing and Exploited Children, in connection with a report submitted thereto under section 2258A; . . . to a law enforcement agency . . . if the contents . . . were inadvertently obtained by the service provider; and . . . appear to pertain to the commission of a crime; . . . to a governmental entity, if the provider, in good faith, believes that an emergency involving danger of death or serious physical injury to any person requires disclosure without delay of communications relating to the emergency; or . . . to a foreign government pursuant to an order from a foreign government that is subject to an executive agreement that the Attorney General has determined and certified to Congress satisfies section 2523.

Id. § 2702(b)(2), (4)–(9). For the most part, the communications can be disclosed only with the permission of the sender or intended recipient, which protects the user, or without their permission in the case of an emergency, such as a missing child.60Id. § 2702(b)(6). Therefore, while individuals are generally protected against voluntary disclosures of their private information by ISPs, it does not mean that the government is unable to obtain this information; it can be compelled through required disclosure under § 2703.

2. Required Disclosure of Customer Communications

Should the government decide that obtaining an individual’s communications is essential for building a criminal case against them, the disclosure of those communications is governed by § 2703.61See id. § 2703. This is where the largest privacy threat to social media users lies, as ISPs are then legally required to turn over the contents of customer communications to law enforcement. How the government goes about getting this information under § 2703, however, depends on a variety of factors, beginning with whether the ISP is categorized as an ECS or an RCS.

If the government requires information from an RCS, there are three ways for it to compel disclosure.62Id. § 2703(b). First, the government can compel disclosure without notifying the customer if “the governmental entity obtains a warrant issued using the procedures described in the Federal Rules of Criminal Procedure (or, in the case of a State court, issued using State warrant procedures . . . ) by a court of competent jurisdiction.”63Id. § 2703(b)(1)(A). Alternatively, if the government provides notice to the customer, it can compel disclosure by using either (1) “an administrative subpoena authorized by a Federal or State statute or a Federal or State grand jury or trial subpoena;” or (2) “a court order . . . [obtained] under subsection [2703](d).”64Id. § 2703(b)(1)(B). Warrants place a higher burden on the government in order to obtain the requested information, while subpoenas and court orders are more easily obtainable. Thus, allowing the government to choose the second or third method to avoid having to obtain a warrant shifts the burden to the individual, who then must object to the subpoena or court order to protect their private information.

Required disclosure from an ECS, on the other hand, is even more complicated because it also considers information about the age of the communication.65See id. § 2703(a); see also Privacy Rights in the Digital Age, supra note 48, at 565. If the communication is 180 days old or less, the government may only compel disclosure “pursuant to a warrant issued using the procedures described in the Federal Rules of Criminal Procedure (or, in the case of a State court, issued using State warrant procedures . . . ) by a court of competent jurisdiction.”6618 U.S.C. § 2703(a). If the communication is more than 180 days old, however, the government can compel disclosure with either a warrant or, if prior notice is provided, a subpoena or court order.67Id. In effect, this makes it easier for investigators to obtain older communications, with no explanation as to why the 180-day mark is significant; thus, in this situation, users are arbitrarily68I use the word “arbitrarily” because it is unclear why Congress chose 180 days to delineate between stored communications and contemporaneous communications. There is no information in the congressional record to indicate why 180 days was chosen. See S. Rep. No. 99-541 (1986), as reprinted in 1986 U.S.C.C.A.N. 3555. Orin Kerr calls the 180-day rule “strange,” and suggests it was chosen by the drafters because they “figured that unretrieved files not accessed after 180 days ha[d] been abandoned.” Orin S. Kerr, A User’s Guide to the Stored Communications Act, and a Legislator’s Guide to Amending It, 72 Geo. Wash. L. Rev. 1208, 1234 (2004). afforded less protections.

B. Disclosure of the Non-Content Data of Social Media Posts

1. Voluntary Disclosure of Customer Records

Section 2702(a)(3) prohibits ECS and RCS providers from “divulg[ing] a record or other information pertaining to a subscriber to or customer of such service . . . to any governmental entity.”6918 U.S.C. § 2702(a)(3). However, § 2702(c) provides an exception to this rule: “A provider . . . may divulge a record or other information pertaining to a subscriber to or customer of such service . . . as otherwise authorized in section 2703.”70Id. § 2702(c). Therefore, while the SCA prevents ECS and RCS providers from voluntarily disclosing non-content information to governmental entities, as with content, the government can still obtain the information by utilizing § 2703’s required disclosure provision.

2. Required Disclosure of Customer Records

Section 2703(c)(1) states that a governmental entity can require an ECS or RCS provider to disclose a record or other information when the governmental entity “obtains a warrant issued using the procedures described in the Federal Rules of Criminal Procedure (or, in the case of a State court, issued using State warrant procedures . . . ) by a court of competent jurisdiction”; “obtains a court order”; “has the consent of the subscriber or customer”; “submits a formal written request relevant to a law enforcement investigation concerning telemarketing fraud”; or “seeks information” under § 2703(c)(2).71Id. § 2703(c)(1). Section 2703(c)(2) allows ECS and RCS providers to disclose the name; address; telephone connection records (or records of session times and durations); length of service and types of service utilized; subscriber number; and “means and source of payment” when the governmental entity “uses an administrative subpoena authorized by a Federal or State statute or a Federal or State grand jury or trial subpoena or any means available under [§ 2703(c)](1)].”72Id. § 2703(c)(2). Again, governmental entities are able to obtain varying amounts of private information about customers from ECS and RCS providers with either a warrant or a court order, sometimes even with only a subpoena. Even more troubling, § 2703(c) does not require the government entity receiving the records or information to provide notice to the customer.73Id. § 2703(c)(3). Thus, subscribers’ privacy may be being infringed without their knowledge, providing them with fewer opportunities to protect themselves.

IV. SOCIAL MEDIA AND THE STORED COMMUNICATIONS ACT

Prior to 2010, no court had specifically addressed whether social media platforms were within the jurisdiction of the SCA.74Crispin v. Christian Audigier, Inc., 717 F. Supp. 2d 965, 977 (C.D. Cal. 2010). In order for the SCA to apply to social media platforms, these ISPs must be considered either ECS or RCS providers. The District Court for the Central District of California was the first to examine whether social media platforms were ECS or RCS providers in Crispin v. Christian Audigier, Inc.75See id. at 980. The district court held that because the three social media platforms in question provided either private messaging or email services, they qualified as ECS providers.76Id. This demonstrated that the SCA could be applied to social media platforms and can, therefore, be used to control the release of social media communications. While Crispin made it clear that Facebook, Instagram, and Snapchat would be governed by the SCA, it remains unclear whether these platforms qualify as an ECS, an RCS, or both, in the context of specific functions. As a result, which regulations should be applied when the government seeks to obtain users’ content (or non-content) from social media platforms during a criminal investigation remains uncertain.

A. Obtaining Contents of Social Media Posts

1. Obtaining Contents from Private Social Media Accounts

The SCA only applies to communications that are not “readily accessible to the general public.”7718 U.S.C. § 2511(2)(g) (“It shall not be unlawful under [the SCA] for any person . . . to intercept or access an electronic communication made through an electronic communication system that is configured so that such electronic communication is readily accessible to the general public . . . .”). Thus, it is important to understand how a user’s varying privacy settings on social media platforms can affect the applicability of the SCA. Facebook, Instagram, and Snapchat each have varying features that provide users with controls to limit who can see the content they have posted on their individual accounts, in some instances allowing the users to limit who can view individual posts as well, and the ability to block other users from viewing their content.78See Facebook Privacy Basics, Facebook, https://www.facebook.com/about/basics/manage-your-privacy [https://perma.cc/NNR8-NFLX]; Facebook Privacy Basics: Posts, Facebook, https://www.facebook.com/about/basics/manage-your-privacy/posts [https://perma.cc/6Y9P-8V73]; Privacy Policy, Snapchat, https://snap.com/en-US/privacy/privacy-policy [https://perma.cc/J3RW-7NUS]; Meta Privacy Center: Privacy Policy, Instagram, https://privacycenter.instagram.com/policy [https://perma.cc/9THM-CLEX]. Accordingly, should a user want their social media content to be private, they have the ability to set those limits using the social media platform settings.

In Crispin, the court held that “[u]nquestionably, the case law . . . require[s] that [user content] be restricted in some fashion . . . [to] merit protection under the SCA.”79Crispin, 717 F. Supp. 2d at 981; see also Ehling v. Monmouth-Ocean Hosp. Serv. Corp., 961 F. Supp. 2d 659, 666 (D.N.J. 2013) (“Facebook wall posts fall within the purview of the SCA.”). Therefore, if a user sets their content visibility to anything other than public, it qualifies as private. This was confirmed in Ehling v. Monmouth-Ocean Hospital Service Corp., in which the District Court of New Jersey found that “when users ma[d]e their Facebook wall posts inaccessible to the general public, the wall posts [we]re ‘configured to be private’ for the purposes of the SCA.”80Ehling, 961 F. Supp. 2d at 668. Similarly, in Facebook v. Superior Court (Hunter), the Supreme Court of California held that social media posts that were configured to be public fell within § 2702(b)(3)’s lawful consent exception, which allows ISPs to disclose a user’s content with the user’s consent.81Facebook, Inc. v. Superior Court (Hunter), 417 P.3d 725, 728 (Cal. 2018). By this logic, if a user’s content is visible to the public, they are consenting to the RCS provider’s disclosure of their content. The SCA, therefore, does not protect social media content that is posted publicly because consent is an exception to the prohibition of voluntary disclosure under § 2702. The Hunter court also held that “restricted communications sent to numerous recipients cannot be deemed to be public—and do not fall within the lawful consent exception.”82Id. In other words, even if social media communications are limited to a large group of people, that does not mean these posts are considered public. According to the Ehling court, “the critical inquiry is whether Facebook users took steps to limit access to the information . . . . Privacy protection provided by the SCA does not depend on the number of Facebook friends that a user has.”83Ehling, 961 F. Supp. 2d at 668. By restricting one’s content with privacy settings, a social media user can therefore take advantage of the SCA’s privacy protections and make it more difficult for the government to obtain their content—by requiring them to get a warrant, for example—for use in a criminal case, but not all users are that savvy or careful.

Based on this jurisprudence, it should not matter how broad the user’s privacy settings are—as long as the individual specifically took steps to limit who can view their content, it becomes protected from voluntary disclosure. This is not foolproof, however, because, as discussed earlier, disclosure may still be permitted if authorized by § 2703.8418 U.S.C. § 2702(b)(2). This remains problematic because, as Justice Sotomayor stated in Jones, a Fourth Amendment search online occurs when the government violates a “subjective expectation of privacy[,]”85United States v. Jones, 565 U.S. 400, 414 (2012) (Sotomayor, J., concurring) (citing Kyllo v. United States, 553 U.S. 27, 31–33 (2001)). and one could argue that when an individual invokes privacy settings, they reasonably expect that their content will be kept private. If obtaining individuals’ social media data constitutes a search, then under Justice Roberts’s logic in Carpenter, a warrant should be required because social media content can contain lots of information about a person’s day, including their location and movements, like the cell site records in Carpenter. Therefore, it stands to reason that all searches of private social media content should require a warrant, which is not currently the case under the SCA.

2. Social Media: Does Disclosure of Its Content Follow ECS or RCS Regulations?

As previously discussed, the SCA has different standards for an ECS than for an RCS—the government can more easily obtain communications from an RCS, whereas obtaining communications from an ECS depends on how long ago the communications were created, thus emphasizing the importance of properly categorizing each social media platform. In Crispin, the court found that social media platforms can be characterized differently depending on the state of the messages: before the messages have been opened, ISPs operate as ECS providers, but once the messages have been opened and retained, the ISPs operate as RCS providers.86Crispin v. Christian Audigier, Inc., 717 F. Supp. 2d 965, 987 (C.D. Cal. 2010). This creates significant complexity and results in variability in how the SCA is applied to each social media platform, given the different standards between RCS and ECS providers and the difficulty in determining which standard will apply.

The Crispin court acknowledged that Facebook wall posts and MySpace comments “present a distinct and more difficult question” as to whether the social media platforms are acting as ECS or RCS providers.87Id. at 988. On one hand, the court stated that Facebook and MySpace were ECS providers with respect to wall posts and comments because they were being held for “backup purposes once read.”88Id. at 989. Here, the court relied on Snow v. DIRECTV, Inc., in which a district court found that because electronic bulletin board services (“BBS”) did not have temporary, intermediate storage, they were actually storing the information for backup purposes and thus were an ECS.89Id. at 988 (citing Snow v. DIRECTV, Inc., No. 2:04-cv-515-FtM-33SPC, 2005 U.S. Dist. LEXIS 48652 (M.D. Fla. May 9, 2005)). The court analogized Facebook and MySpace wall posts and comments to BBS, concluding that these posts and comments were also being stored for backup purposes since they were not deleted after being read, and thus the social media platforms should be considered ECS providers.90Id. at 989.

On the other hand, the court also said that Facebook and MySpace could be considered RCS providers with respect to wall posts and comments because they maintained these communications not only for storage, but also for display purposes, as users wanted their friends to be able to see the communications.91Id. at 990. The court relied on Viacom International Inc. v. YouTube Inc. in this instance, analogizing Facebook wall posts and MySpace comments to private YouTube videos.92Id. In Viacom, the court found that YouTube was an RCS provider because it stored videos on behalf of its subscribers.93See Viacom Int’l Inc. v. YouTube Inc., 253 F.R.D. 256, 264 (S.D.N.Y. 2008). Thus, the Crispin court concluded that Facebook wall posts and MySpace comments, like YouTube videos, can be stored for the purpose of allowing other users to view the content, thus making Facebook and MySpace RCS providers, like YouTube.94Crispin, 717 F. Supp. 2d at 990. Ultimately, the court did not rule whether Facebook and MySpace were ECS or RCS providers with respect to wall posts and comments, remanding the case for further development.95Id. at 991. This complexity demonstrates how ill-suited the SCA currently is to protect individuals’ privacy on social media platforms, as there is no clear and consistent way to apply it. Further, the arguments made in Crispin emphasize just how arbitrary the distinction between an RCS and ECS provider can be when it comes to social media platforms. Because social media platforms do not fit neatly into either category, courts can come to different conclusions as to how these ISPs should be regulated, thus leading to uncertainty regarding the protection of privacy rights of social media users. This arbitrariness can be explained by the fact that the SCA was written in 1986, as articulated in Konop v. Hawaiian Airlines, Inc.:

[T]he ECPA was written prior to the advent of the Internet and the World Wide Web. As a result, the existing statutory framework is ill-suited to address modern forms of communication like [social media platforms]. Courts have struggled to analyze problems involving modern technology within the confines of this statutory framework, often with unsatisfying results.96Konop v. Hawaiian Airlines, Inc., 302 F.3d 868, 874 (9th Cir. 2002).

The Konop court’s words make clear that the SCA has become outdated because Congress was unable to foresee the problems that would arise for privacy protections resulting from not yet existing communication technologies. This is further supported by the fact that the Crispin court was unable to make a decision regarding the status of Facebook and MySpace with respect to wall posts and comments, given the limitations in clearly and consistently applying the SCA to communications on the various social media platforms.97See Christopher J. Borchert, Fernando M. Pinguelo & David Thaw, Reasonable Expectations of Privacy Settings: Social Media and the Stored Communications Act, 13 Duke L. & Tech. Rev. 36, 56 (2015) (“The Crispin court’s reasoning is both conflicted and irresolute, and thus fails to clarify the SCA’s applicability to communications made via social networking platforms.”). Courts’ inability to readily place certain features of social media platforms into existing categories highlights the inadequacy of the SCA in affording privacy rights to users of the prevalent modern technologies and supports that now is the time to change the SCA to clarify its applicability and afford stronger protections for various types of social media communications by creating more appropriate categories that these ISPs can be classified into.

3. Challenges in Applying SCA Content Disclosure to Stories

Stories are a relatively new feature of social media platforms, having only been in existence since 2013.98Wiltshire, supra note 24. Like with the aforementioned difficulty in generally applying the SCA to social media platforms and user content, Stories, which disappear within twenty-four hours, provide another example that highlights the limited applicability of the current statutory framework under the SCA to modern communication technologies. From a privacy perspective, the good news is that most of these posts are removed from ISPs’ servers as soon as the twenty-four hour period is up.99See What Happens to Content (Posts, Pictures) That I Delete from Facebook?, Facebook, https://www.facebook.com/help/121995105053180 [https://perma.cc/DR4U-RXPJ]; Stories, Instagram, https://help.instagram.com/1660923094227526 [https://perma.cc/8VFX-TH2V]; When Does Snapchat Delete Snaps and Chats?, Snapchat, https://support.snapchat.com/en-US/article/when-are-snaps-chats-deleted [https://perma.cc/2JF3-MJQG]. Since the content is no longer on the social media platform’s server, it is not possible for ISPs to disclose this content—even pursuant to a court order, subpoena, or warrant—because the content would no longer be in storage.100See Ian Hoppe, Does Law Enforcement Have Access to Your Snapchat Photos? A Simple Guide, AL.com (Jan. 13, 2019, 8:19 PM), https://www.al.com/business/2014/11/snapchat_
subpeona.html [https://perma.cc/47K9-WJDP] (“Snapchat will not turn over the content of your past Snapchats because it no longer has access to them. Snapchat couldn’t cooperate with law enforcement even if they wanted to, because, as part of their base operations, the content of messages is not available to them.”).
However, concerns remain for any content that remains saved on the server, which might still be obtainable for criminal investigations under the current SCA.

In addition, both Facebook and Instagram Stories can be saved in Story Archives,101Facebook Stories are “only available to [the] selected audience for 24 hours, but after that they can be saved in [the] story archive.” View Your Facebook Story Archive, Facebook, https://www.facebook.com/help/2241356632587629 [https://perma.cc/25Q3-WL7U]. By saving Facebook Stories to the Story Archive, users can still view their stories even though they are no longer visible to anyone else. Users can turn their Story Archive on or off, though Facebook does not specify what happens to Stories when the archive is turned off. Similarly, Instagram Stories are automatically saved to the Stories Archive unless this setting is turned off. Stories, supra note 99. and Snapchat Stories can be saved in Memories.102Snapchat contains a feature called Memories, which is backed up by Snapchat, that allows users to save Snaps and Stories so that they can be looked back on anytime. Snapchat Support, supra note 99. Therefore, although “Snapchat servers are designed to automatically delete all Snaps after they’ve been viewed by all recipients,” users can still elect to save this content on Snapchats servers. Id. This content, therefore, could feasibly be disclosed to the government under the SCA if the proper exceptions and procedures were met. Because part of the appeal of Stories is that posts are only available for twenty-four hours, users likely do not think about how long their content is maintained in storage. Rather, many incorrectly assume that the content has been permanently deleted when the twenty-four hours expire. The problem here is that if Stories are governed by current ECS rules, once Stories are more than 180 days old, they can be obtained with notice and a subpoena or court order. This goes against the intent underlying Justice Robert’s opinion in Carpenter because one could similarly argue that individuals who post Stories believe they have a reasonable expectation of privacy in these Stories that are now only available for their own view, yet they can, in fact, still be obtained with lesser protections than a warrant. Therefore, even though the SCA was intended to extend the protections of the Fourth Amendment to online communications, currently it does so unsuccessfully, particularly in the case of Stories.

Because Stories are so new, there have not been many cases addressing how the SCA applies to them. In Facebook, Inc. v. Pepe, the District of Columbia Court of Appeals considered an allegedly sent “disappearing Instagram ‘Story’ ” for the first time.103Facebook, Inc. v. Pepe, 241 A.3d 248, 252 (D.C. 2020). The court found that the Instagram Story was content under the SCA, and that because James Pepe was an “addressee or intended recipient” under § 2702(b), Facebook was permitted to disclose any Instagram Stories that were responsive to the subpoena.104Id. at 256. However, this addressee or intended recipient exception would not apply if the government were seeking disclosure in a criminal case, as the individual who posted the Story would likely not have invited a government official to view their private Facebook, Instagram, or Snapchat Story. Thus, the inquiry then shifts to consider whether social media platforms are acting as RCS or ECS providers when it comes to Stories.

One could analogize Stories to Facebook wall posts and MySpace comments when applying the SCA to social media Stories. Following the Crispin court, this would mean that ISPs offering Stories could be considered either RSC or ECS providers. The first argument is that Facebook, Instagram, and Snapchat act as ECS providers when individuals post Stories because the individual is “sending” the electronic communication to the people who they have allowed to view it.105See 18 U.S.C. § 2510(15). This would follow from analogizing Stories to wall posts or comments that are in “backup” storage. As per Crispin, if the messages are being stored on the servers solely because they were not deleted, then they are in backup storage and, thus, should be governed by ECS rules. Unfortunately, users do not usually think about deleting this type of content because they know that once it disappears, no one else can see it. However, what they often fail to realize is that these communications are then considered to be in backup storage, meaning they can still be disclosed to the government under the SCA.

Alternatively, Facebook, Instagram, and Snapchat could be considered RCS providers because they are simply storing the Stories on the server for others to view.106See id. § 2711(2). In Crispin, wall posts were compared to YouTube videos that were stored for the purpose of allowing other users to view the content.107Crispin v. Christian Audigier, Inc., 717 F. Supp. 2d 965, 990 (C.D. Cal. 2010). Arguably, Stories are also stored for the purpose of allowing others to view them, not simply because they have not been deleted. Therefore, even though a Story disappears after twenty-four hours, the user can reshare the content from their Archive, similar to changing a YouTube video’s settings to modify who can view it at any point in time.

On the other hand, Stories could also be analogized to private messages, which further complicates the analysis of SCA protections, particularly when considering the reasoning in Crispin, which stated that when a message is unread, the ISP acts as an ECS, but once the message has been read, the ISP then acts as an RCS.108Id. at 987. Stories can be viewed by whomever the user allows, depending on their privacy settings, meaning that at any given point in time, the Story might have been viewed by a portion, but not all, of the potential audience. Thus, is the Story considered “unread” until all possible viewers have seen it, or does it switch to being “read” once at least one individual has viewed it? Alternatively, a Story could be “sent” while it is available for viewing by others but then switched to “read” once the twenty-four hours are up.

Whether or not a Story is considered to be an ECS or an RCS function directly impacts how law enforcement agencies can obtain its contents since the content of a Story would only be protected with a warrant if it were governed by ECS rules and 180 days old or less. Otherwise, Stories could be obtained with either a subpoena or a court order, making them easier to acquire for criminal investigations. These types of questions have not yet been adequately addressed by courts, and because Stories have qualities of both RCS and ECS communications, it is not possible to consistently predict whether RCS or ECS rules should govern in individual cases. The difficulty in determining how to appropriately apply the SCA to Stories supports the need for the proposed changes to the SCA.

B. Obtaining Non-Content Data From Social Media Posts

1. Applying SCA Non-Content Disclosure to Social Media Platforms

Disclosure of non-content data stored by social media platforms is different from disclosure of content in that non-content disclosure does not depend on whether the provider is an ECS or an RCS. While content is defined as including “any information concerning the substance, purport, or meaning of that communication,”10918 U.S.C. § 2510(8). non-content is not well-defined. The SCA does, however, define some non-content data that can be obtained with only a subpoena, including the user’s name, address, and telephone number.110See id. § 2703(c)(2). This stems from the third-party doctrine, which states “the
Fourth Amendment does not prohibit the [government from] obtaining . . . information revealed to a third party.”111United States v. Miller, 425 U.S. 435, 443 (1976) (holding that a defendant had no expectation of privacy in his bank records because he had disclosed his affairs to his bank when opening his accounts); see also Smith v. Maryland, 442 U.S. 735, 745 (1979) (holding that a defendant had no actual expectation of privacy in the phone numbers he dialed and that even if he did, the expectation was not reasonable). This creates an exception to the reasonable expectation of privacy that is protected by the Fourth Amendment: once an individual voluntarily shares information with a third party, they lose any reasonable expectation of privacy in that information.112Miller, 425 U.S. at 443. It can be assumed, however, that non-content data is any information that is not the main substance of the communication, including the metadata incorporated in the communication, for example, the user’s identity, location, payment information, and telephone number.113“One approach to distinguishing content from non-content is to divide electronic communications into ‘payload’ (content) and ‘delivery instructions’ (non-content).” Chris Conley, Non-Content Is Not Non-Sensitive: Moving Beyond the Content/Non-Content Distinction, 54 Santa Clara L. Rev. 821, 830 (2015) (arguing that information such as the IP address from which a comment on social media is posted is non-content). This is problematic because under § 2703(c), non-content data can sometimes be easily obtained by the government with a court order. Because the SCA does not explicitly state which types of non-content data can be obtained with a court order and which require a warrant, a lot of discretion is left to police officers and the courts.

“Some non-content information, particularly associational information and location information, is inherently expressive, capable of directly exposing intimate details of an individual’s life.”114Id. at 831. In the age of social media, people are constantly posting images and videos online;115On Instagram alone, “[a]t least 95 million photos and videos are posted . . . each day.” Jack Flynn, 30+ Instagram Statistics [2022]: Facts About This Important Marketing Platform, Zippia (May 23, 2022), https://www.zippia.com/advice/instagram-statistics [https://perma.cc/ZCH2-FZ4S]. when people take photos, for example, the image files contain metadata that includes the time and date when the image was taken, along with the exact location where the photograph was taken.116Gurpreet Singh, Understanding Metadata for Photographers, Pixpa (June 23, 2020), https://www.pixpa.com/blog/photo-metadata [https://perma.cc/273F-GNJT]. Facebook, Instagram, and Snapchat collect a lot of information about an individual’s daily life, including sensitive location information.117Meta Privacy Center: Privacy Policy, Meta (July 26, 2022), https://www.facebook.com/
privacy [https://perma.cc/N48L-2EM4] (describing data policies for Facebook and Instagram); Privacy Policy, supra note 78. As of October 2021, Facebook Inc., the company that owns both Facebook and Instagram, changed its name to Meta. Mike Isaac, Facebook Renames Itself Meta, N.Y. Times (Nov. 10, 2021), https://www.nytimes.com/2021/10/28/technology/facebook-meta-name-change.html [https://
perma.cc/WUD4-KFLH]. Thus, the Meta Privacy Policy details the information collected by both Facebook and Instagram. See Michel Protti, Here’s What You Need to Know About Our Updated Privacy Policy and Terms of Service, Meta (May 26, 2022), https://about.fb.com/news/2022/05/metas-updated-privacy-policy [https://perma.cc/YW5H-4A2F] (“The updated Meta Privacy Policy covers Facebook, Instagram, Messenger and other Meta products.”).
Like wireless providers, Facebook, Instagram, and Snapchat are all able to collect individuals’ locations from Bluetooth signals, wireless networks, and cell towers.118See Meta Privacy Center: Privacy Policy, supra note 117; Privacy Policy, supra note 78. Additionally, these platforms also store information such as the location, date, and time at which the photograph or file was created.119See Meta Privacy Center: Privacy Policy, supra note 117; Privacy Policy, supra note 78. This information could be used in a criminal investigation to pinpoint the time and place where a crime occurred or where a suspect was located at a particular time, making it highly valuable for the government when charging someone with a crime.120In United States v. Hart, the court held that “any expectation of privacy a person might have had in non-communication records given to a third party is destroyed upon disclosure, even if he disclosed the information on the assumption that it would be used only for a limited purpose.” United States v. Hart, No. 3:08-CR-00109-C, 2009 U.S. Dist. LEXIS 72597, at *45 (W.D. Ky. July 28, 2009). However, the non-content information that the government obtained included login tracker data, such as the date and time of the user’s last log in, and the user’s IP address, which allowed it to determine the exact location from which the email was sent. Id. at *13. This is troubling because it means that the government can easily obtain non-content information without a warrant and track a defendant’s precise location, which would reasonably require a warrant otherwise. Thus, it is important to afford this information the highest level of protection.

Because social media is a newer phenomenon, most courts have yet to address the issue of obtaining non-content data, which can include time and location information from a social media platform. In In re Application of the United States of America for an Order Pursuant to 18 U.S.C. § 2703(d), a magistrate judge ordered Twitter121Twitter is a social media platform that allows individuals to communicate with family, friends, and the general public through “Tweets,” which can be comprised of text, photos, and videos. See New User FAQ, Twitter, https://help.twitter.com/en/resources/new-user-faq [https://perma.cc/DZE7-JMCE] (describing how Twitter works). to turn over information
pertaining to multiple subscribers; this information included “records
of user activity . . . including the date [and] time” as well as
“non-content information associated with the contents of any communication . . . [including] IP addresses.”122In re Application of the U.S. for an Ord. Pursuant to 18 U.S.C. § 2703(d), 830 F. Supp. 2d 114, 121–22, 130–31, 153 (E.D. Va. 2011). The Virginia district court held that because § 2703(d) requires the government to show only “reasonable grounds” that the records sought are relevant and material to an ongoing criminal investigation, and because the third-party doctrine applies to IP address information, the court order was valid.123Id. at 121–22. The court differentiated IP addresses from beeper monitoring because IP addresses are shared with all internet routers when a user accesses Twitter, while tracking a beeper allowed the government to monitor inside a private residence, which was not otherwise open for visual surveillance.124Id. at 132. While this case clarified what one district court believed the SCA means for IP addresses, it does not help to clarify how the SCA applies to exact location information such as the metadata embedded in Facebook, Instagram, and Snapchat posts.

However, courts have addressed the issue of whether obtaining location information from a wireless carrier constitutes a search under the Fourth Amendment. In Carpenter, the Court held that a court order obtained under § 2703(d) was not a permissible means of acquiring a defendant’s historical cell-site location information (“CSLI”) from a wireless carrier.125Carpenter v. United States, 138 S. Ct. 2206, 2221 (2018). The Court found that individuals have a reasonable expectation of privacy in their physical location, and when the government accessed CSLI from the wireless carriers, it violated the defendant’s reasonable expectation of privacy.126Id. at 2217–19. As a result, the Court held that the government “must generally obtain a warrant supported by probable cause” before acquiring records containing location information.127Id. at 2221.

Because the SCA was intended to extend Fourth Amendment rights to online communications, it might be acceptable to infer that obtaining location information from social media platforms would also require obtaining a warrant supported by probable cause. However, the Carpenter Court articulated that its decision was “narrow” and that it does not “address other business records that might incidentally reveal location information,”128Id. at 2220. which means that the metadata contained in the photos and videos posted on social media may not require the government to obtain a warrant, which could compromise people’s privacy rights. As Justice Sotomayor pointed out in her concurrence in Jones, “it may be necessary to reconsider the premise that an individual has no reasonable expectation of privacy” in the information they disclose online.129United States v. Jones, 565 U.S. 400, 417 (2012) (Sotomayor, J., concurring). “This approach is ill suited to the digital age, in which people reveal a great deal of information about themselves to third parties in the course of carrying out mundane tasks.”130Id. Justice Sotomayor is right: in the digital age, individuals post a wealth of information online that they expect—as a result of their privacy settings—to be visible only to those they choose. Thus, it is time to reconsider the notion that revealing this information to third-party social media platforms means that the government should be able to easily obtain their locational information because there is no “reasonable expectation of privacy.”131Id.

2. Challenges in Applying SCA Non-Content Data Disclosure to Stories

Stories provide users with the unique opportunity to create information that can qualify as both content and non-content data at the same time. When an individual posts their Story online, they are able to add “stickers,” which can indicate to those viewing the Story the exact location of the individual and the date and time the Story was posted, among other things. Thus, when a user posts a location in their social media Story, it actually appears as part of a graphic. In this sense, it would appear to be content because it is part of the image. On the other hand, since it is a location, Instagram will likely also collect that information separately from the content. It would then appear that, in this situation, the location information would be both content and non-content data at the same time; how then should a court determine whether a subpoena, court order, or warrant is required to compel the information from Instagram? Unfortunately, this is unclear under the current statutory framework of the SCA.

Former CIA agent Michael Morell admits that “[t]here’s a lot of content in metadata” and that “[t]here’s not a sharp difference between metadata and content . . . It’s more of a continuum.”132Julian Sanchez, Obama Backs Off Real NSA Reform, Daily Beast (Apr. 14, 2017, 1:04 PM), https://www.thedailybeast.com/obama-backs-off-real-nsa-reform [https://perma.cc/2XQT-DZY4] (quoting Michael Morell). If even the government accepts that it is difficult to distinguish between content and non-content data, then the SCA should not be differentiating between the two and allowing weaker protections for non-content data when, in fact, it may reveal information just as sensitive as content. Because the SCA was created prior to the creation of social media, it does not account for the overlap in the types of information that can be obtained from non-content and content data. This is another reason why the SCA needs to be rewritten: to clarify and remove the ambiguity of how sensitive non-content information can be disclosed.

V. REVISING THE STORED COMMUNICATIONS ACT

A. Requiring Warrants for All Compelled Content Disclosures

While the SCA provides some protections for private communications on ISPs, the statute needs to be updated and better tailored so that it is applicable to all the various nuances of modern technologies. Currently, the strongest protections are afforded to unretrieved emails and other temporarily stored files that are 180 days old or less.13318 U.S.C. § 2703(a); see also Kerr, supra note 68, at 1233 (identifying that only transmissions pending for 180 days or less “receive the protection of a full warrant requirement”). All other communications can be more easily obtained with a subpoena combined with prior notice.13418 U.S.C. § 2703(a), (b)(1)(B). Under the Federal Rules of Criminal Procedure, a subpoena “may order the witness to produce any books, papers, documents, data, or other objects the subpoena designates.”135Fed. R. Crim. P. 17(c)(1). This is even less protective of an individual’s right to privacy than having to obtain a court order, which requires that the “governmental entity offers specific and articulable facts showing that there are reasonable grounds to believe that the contents of a[n] . . . electronic communication . . . are relevant and material to an ongoing criminal investigation.”13618 U.S.C. § 2703(d). To obtain a warrant, on the other hand, there must be “probable cause to search for and seize a person or property.”137Fed. R. Crim. P. 41(d)(1). This places a heavier burden on the government and thus ensures that social media users are not losing their right to privacy without stringent protections, which should be the goal of any such legislation.

Because the line between defining a social media platform as either an ECS provider or an RCS provider is so unclear, applying existing laws can lead to variable results that negatively impact users’ privacy rights. As previously discussed, under the SCA, the same ISP can be treated as an ECS for some functions, but an RCS for others; this leaves users with inconsistencies in the treatment of their personal communications, which can infringe on their privacy. Importantly, whether a social media platform is characterized as an ECS or an RCS has a direct impact on the stringency of the procedures that law enforcement must follow to obtain the content. Further, although the SCA does not specifically differentiate between public and private social media accounts, because the SCA was only intended to cover private communications, it inadvertently creates counterintuitive privacy protections. For example, in Crispin, the court held that opened private messages on Facebook and MySpace were covered by RCS rules, while ECS rules covered restricted wall posts and comments.138Crispin v. Christian Audigier, Inc., 717 F. Supp. 2d 965, 991 (C.D. Cal. 2010). Effectively, this meant that wall posts and comments, which can arguably be seen by all of an individual user’s friends, were afforded greater protections than private messages, which are typically only seen by the sender and the intended recipient. This is counterintuitive because it means that less private communications receive greater protection than more private communications.

Consequently, there is a clear need for Congress to reform the SCA now, and as a first step, require warrants for all communications, regardless of whether an ISP is characterized as an RCS or ECS.139In April 2022, the Warrant for Metadata Act was introduced in the House of Representatives, proposing that warrants be required for ECS and RCS disclosures. Warrant for Metadata Act, H.R. 7553, 117th Cong. (2022). Thus, it is clear that at least part of Congress has recognized the need for tighter restrictions to protect the liberties of U.S. citizens; only time will tell if this bill will pass and the SCA will finally be amended, as amendments have been proposed before with no success. See, e.g., Online Communications and Geolocation Protection Act, H.R. 983, 113th Cong. (2013); Electronic Communications Privacy Act Amendments Act of 2013, S. 607, 113th Cong. (2013); Electronic Communications Privacy Act Amendments Act of 2015, S. 356, 114th Cong. (2015); Email Privacy Act, H.R. 699, 114th Cong. (2016). Warrants provide the strongest protection for social media users, and when it comes to individual liberties, the government has an obligation to preserve these liberties with the broadest legal protections possible.140“No person shall be . . . deprived of life, liberty, or property, without due process of law . . . .” U.S. Const. amend. V. A citizen’s right to liberty is derived from the U.S. Constitution, which means that while the “[g]overnment has an obligation to protect the safety and security of its citizens, . . . it has an equally important responsibility to safeguard the freedoms and liberties that are the cornerstones of American democracy.” Anthony D. Romero, In Defense of Liberty at a Time of National Emergency, ABA: Hum. Rts. Mag. (Jan. 1, 2002), https://www.americanbar.org/groups/crsj/publications/human_
rights_magazine_home/human_rights_vol29_2002/winter2002/irr_hr_winter02_romero [https://perma.cc/
9AUZ-CY8D].
This is especially important considering the case law, which argues that individuals have a right to be protected under the SCA if they took steps to protect their content.141Ehling v. Monmouth-Ocean Hosp. Serv. Corp., 961 F. Supp. 2d 659, 668 (D.N.J. 2013). By requiring warrants for the disclosure of all social media communications, the SCA would be able to provide the strongest statutory framework to protect users’ privacy and prevent the unjust use of their social media content against them in criminal court.

B. Removing the Differentiation Between RCS and ECS

The previously highlighted variability and liability in characterizing social media platforms as RCS providers in some instances and ECS providers in others has become even more problematic with the recent emergence of social media Stories. If Stories are analogized to emails or private messages—because the user posts the Story with the intention that others will see it and it will be gone shortly after the message is read—they would be governed by ECS rules, similar to the private messages in Crispin.142Crispin, 717 F. Supp. 2d at 980. Alternatively, Stories considered analogous to YouTube videos—because they are stored for only a limited number of people to view—would be governed by RCS rules.143Viacom Int’l, Inc. v. YouTube Inc., 253 F.R.D. 256, 264 (S.D.N.Y. 2008). The courts have yet to address whether Stories should be governed by ECS or RCS rules, but there are arguments for both sides because Stories do not fit neatly into either category.

Because the SCA was not created to accommodate these newer technologies, it would be more effective to revise the SCA categories rather than attempting to fit new technologies into the existing categories. Because social media platforms offer various functions that involve both message transmissions and electronic storage, the language of the SCA needs to be amended to eliminate the distinction between RCS and ECS altogether. Orin Kerr suggested doing this by identifying that the SCA applies only to “network service providers,” which would encapsulate the current definitions of ECS and RCS and then apply the SCA rules to different types of files held by the network service providers.144Kerr, supra note 68, at 1235. This would alleviate the difficulty of determining which rules apply to social media providers in different situations and would further clarify privacy rights for users by establishing when and how their content is protected. Importantly, this would also provide consistency and give users a better understanding of their rights online, which may, in turn, influence what information they choose to post on social media—especially if they know it could later be used against them in a criminal case. Without this clarity, social media users do not know whether their content is protected and what steps they need to take to protect their private communications, which may, consequently, have a “chilling effect”145A chilling effect is “[t]he result of a law or practice that seriously discourages the exercise of a constitutional right.” Chilling Effect, Black’s Law Dictionary (11th ed. 2019). The constitutional right affected here would be the freedom of speech, as social media users are expressing the right to speak freely when they post content online. on their conduct.

C. Requiring Warrants for All Compelled Non-Content Data Disclosures

As technology has grown and evolved, the distinction between content and non-content data has continued to blur. This is particularly true when individuals include the date, time, and location of their posts in the actual post or Story. When Facebook, Instagram, and Snapchat collect that information, it becomes non-content data, some of which can be disclosed pursuant to only a subpoena, and some of which requires either a court order or a warrant. One way to address this issue would be to require warrants for all compelled disclosures of non-content data. This is in line with the suggestion to require warrants for all compelled disclosures of content.

By requiring warrants for compelled disclosures of non-content data, criminal investigators would then have to show probable cause before obtaining the information, which is the highest standard available. In Carpenter, the Court acknowledged that individuals have a reasonable expectation of privacy regarding their physical location.146Carpenter v. United States, 138 S. Ct. 2206, 2217 (2018). Unlike cell-site records, social media platforms do not collect information on users every time their phone pings a cell tower. Instead, locations are collected when individuals post to social media. Therefore, it is currently unclear whether location information would always be protected by a warrant under the SCA.147In 2013, the 113th Congress proposed the Online Communications and Geolocation Protection Act. This proposed amendment to the SCA included prohibitions on the disclosure of geolocation information to governmental entities. Online Communications and Geolocation Protection Act, H.R. 983, 113th Cong. (2013); see also Dell Cameron, New Bill Would Halt Warrantless Requests for Consumers’ Geolocation Data, Daily Dot (May 29, 2021, 3:18 PM), https://www.dailydot.com/debug/online-communications-geolocation-protection-act [https://perma.cc/V8BT-B3S3] (stating that the lawmakers said that “the ECPA in its current form offers inadequate protections to Americans who rely heavily on mobile devices operating location-based services”). Thus, Congress is aware that the SCA does not adequately protect against disclosure of non-contents containing location information. Although the bill was proposed, it was never passed and thus the problem remains.

While it is true that some non-content data records reveal more than others, advances in metadata analysis have shown that assembling disparate pieces of metadata can lead to larger discoveries. Thus, although one might argue that it would be better to specify which types of records require a subpoena, which require a court order, and which require a warrant, this practice would be difficult to consistently implement.148See Kerr, supra note 12, at 413 (“Identifying the proper particularity standard for noncontent information is difficult because such records exist in many different forms . . . . A list of every email address that a person emailed, together with the time each email was sent, is more sensitive than merely the name on the account.”). Rewriting the SCA to guarantee that such non-content metadata is protected by the highest protection affordable would ensure that social media users are provided their First Amendment rights.

D. Removing the Distinction Between Content and Non-Content Data

Perhaps a simpler solution to this problem of differentiating between content and non-content data would be to eliminate the distinction altogether. The distinction comes from Ex parte Jackson, in which the Court held that “a distinction is to be made between different kinds of mail matter,—between what is intended to be kept free from inspection, such as letters . . . and what is open to inspection, such as . . . printed matter, purposely left in a condition to be examined.”149Ex parte Jackson, 96 U.S. 727, 733 (1878). The Court held that mail can only be opened and examined under a warrant because otherwise it would constitute an illegal search.150Id. Thus, content is what is “intended to be kept free from inspection,” as it is sealed away, and non-content data is what is left in the open.

When the Court first created this distinction in Ex parte Jackson, it made sense to differentiate between the information on the outside of an envelope, which could be openly seen by others, and the content that was stored within an envelope. However, trying to apply that logic to social media now no longer makes sense because the distinction between content and non-content data has become so blurred. For example, when a user posts a picture of their dog on their Instagram profile, they can include a geotagged location to where the photograph was taken. Is the location still non-content data because it is not the “substance” of the post, or is the location content because the user is using it to indicate where the picture was taken and, therefore, it is part of the description? If the latter were true, it would then arguably be content.

If the same information can be considered both content and non-content, it does not make sense to allow law enforcement to obtain the same information with lesser protections solely because they can argue that it is non-content data. Eliminating the distinction between non-content and content data would remove the uncertainty and enable social media users to be confident that all aspects of their posts would be protected.

CONCLUSION

The Ninth Circuit had it right when it said, “until Congress brings the laws in line with modern technology, protection of the Internet and websites such as [social media platforms] will remain a confusing and uncertain area of the law.”151Konop v. Hawaiian Airlines, Inc., 302 F.3d 868, 874 (9th Cir. 2002). Social media platforms, as a whole, do not fit nicely into the existing ECS and RCS categories that Congress created when drafting the SCA in 1986. Some functions of social media platforms lead to the platform being treated as an ECS, while other functions lead to the platform being treated as an RCS. In other instances, it is difficult to determine whether a specific function indicates that the social media platform is acting as an ECS or an RCS. As a result, the SCA can be inconsistently applied to disclosures of social media content. Most importantly, certain functions on social media are arbitrarily afforded stricter protections than others, solely because of how they are inconsistently categorized under the current SCA. The rationale for affording communications greater protections when they are classified as an ECS that is 180 days old or less versus the fewer protections afforded to an ECS that is more than 180 days old or as an RCS is unclear. As a result of these arbitrary distinctions, law enforcement has an easier time searching an individual’s private social media, which may only require a subpoena or court order, than it would going through someone’s diary, which requires a warrant.

Further complicating the application of the SCA to social media today is the fact that in the age of social media, it is becoming more difficult to distinguish content from non-content data. When Congress drafted the SCA, it attempted to apply the Fourth Amendment to online communications and therefore made a distinction between content and non-content data; however, the difference between what constitutes content—analogous to what is contained inside an envelope—and non-content—analogous to what is on the outside of an envelope—in the digital context has become difficult to discern.152See Ex parte Jackson, 96 U.S. 727, 733 (1878). Courts have also considered the third-party doctrine when determining what information could be obtained with a subpoena, reasoning that because the information had been disclosed to a third party, the user had no reasonable expectation of privacy. However, social media users disclose a variety of personal information when signing up for an account, often including, at a minimum, their name, birthdate, and email address, and their posts include lots of additional metadata. The privacy of these data is critical to define because they can be used by law enforcement to piece together where an individual was at the time they posted to social media or where an individual was when the content they posted was retrieved. Whether this very sensitive information should require a warrant or a lesser means to be retrieved by law enforcement is not currently clearly defined in the SCA.

The ECPA—which includes the SCA—was enacted to protect citizens from having their electronic communications intercepted without the proper authorization, but these protections need to change in response to evolving communication technologies. This legislation was intended to extend Fourth Amendment protections to new technologies, but because social media technologies have evolved so rapidly since 1986, the SCA no longer truly affords the intended protections. For citizens to be protected against unreasonable searches of their digital media, Congress needs to restructure the existing legislation to properly address how communication technologies have evolved over the past thirty-six years. Not only can one social media platform function as both an ECS and an RCS provider under the current SCA definitions, but it is now also difficult to determine whether a specific social media function, such as Stories, which has properties of both, should be governed by ECS or RCS rules. Further, there is now duplication of content and non-content data, making it difficult to clearly differentiate them and ensure that all of this personal information is being adequately protected under the SCA.

To ensure the protection of constitutional privacy rights and prevent private social media communications from being unfairly used against their creators in court, Congress should require that all compelled disclosures be governed by the same rules as the Fourth Amendment; that is, it should require that there be a warrant and “probable cause.”153U.S. Const. amend. IV. If all compelled disclosures were to require a warrant, then equal protections would be applied in all situations, as the standard would be consistent across physical and digital searches; this would help ensure that defendants’ due process rights were not violated. Further, because the distinctions between an ECS and RCS, as well as content and non-content data, are no longer appropriate, it would be advantageous for Congress to revise the SCA to better align with modern technologies by drawing the necessary delineations based on the functions being used, not on the specific type of provider. This way, the SCA would not only better apply to modern technology, but it would hopefully also better apply to future emerging technologies.

 

96 S. Cal. L. Rev. 707

Download

Executive Senior Editor, Southern California Law Review, Volume 96; J.D. Candidate 2023, University of Southern California Gould School of Law; M.S. Clinical Research Methods 2020, Fordham University; B.A. Psychology 2015, New York University. My thanks to my parents, Marlene and Lee Allen, and Jennifer Guillen for their input and support throughout the note-writing process. I would also like to thank my Note advisor, Professor Eileen Decker, for her guidance, and the members of the Southern California Law Review for their hard work and thoughtful suggestions.

Battle of the Opinions: Conflicting Interpretations of False Opinions and the Falsity Standard Under the False Claims Act

Congress has let loose a posse of ad hoc deputies to uncover and prosecute frauds against the government . . . . [Bad actors] may prefer the dignity of being chased only by the regular troops; if so, they must seek relief from Congress.1United States ex rel. Milam v. Univ. of Tex. M.D. Anderson Cancer Ctr., 961 F.2d 46, 49 (4th Cir. 1992).

INTRODUCTION

What most people probably do not realize is that approximately ten percent of all government spending is lost to fraud, which amounts to hundreds of billions of dollars annually.2Joel D. Hesch, It Takes Time: The Need to Extend the Seal Period for Qui Tam Complaints Filed Under the False Claims Act, 38 Seattle U.L. Rev. 901, 901 (2015). It should be of no surprise then that public attitudes toward government spending are mixed.3See generally William G. Jacoby, Public Attitudes Toward Government Spending, 38 Am. J. Pol. Sci. 336 (1994) (exploring the nature, sources, and consequences of citizens’ attitudes toward government spending). With the recent COVID-19 pandemic, government spending and the number of fraudulent schemes have both reached unprecedented levels.4See Press Release, U.S. Dep’t of Just., Justice Department Takes Action Against COVID-19 Fraud (Mar. 26, 2021) [hereinafter COVID-19 Fraud], http://www.justice.gov/opa/pr/justice-department-takes-action-against-covid-19-fraud [http://perma.cc/6R6K-YCJW]. This alone is quite alarming from a policy perspective. Furthermore, in combatting this widespread fraud, the government has had to consider an important legal issue, which also happens to be a philosophical concern that permeates life and introduces uncertainty into the legal system.

The distinction between fact and opinion seems quite obvious, but there is more to this dichotomy than meets the eye. Most individuals intuitively understand that facts have an objective basis in reality whereas opinions are merely one’s own subjective interpretation of some matter. It follows that facts can be proven or disproven using an objective metric and that facts can reinforce or contradict any given claim. But what about opinions? Can they be “true” or “false” in the same sense? Can the substance of their truth be invalidated by other opinions? Do opinions gain an elevated legal status if they inevitably result in life-or-death consequences for another individual?

The circuit courts have recently grappled with these difficult questions in the context of Medicare-related claims under the False Claims Act (“FCA”), a civil anti-fraud statute.5See John T. Boese & Douglas W. Baruch, Civil False Claims and Qui Tam Actions
1-5 (5th ed. 2022). See generally 31 U.S.C. §§ 3729–3733 (creating liability for individuals who engage in fraudulent acts against the government).
To prevail on an FCA claim, plaintiffs must prove, inter alia, falsity; that is, the defendant made a false claim for government payment.631 U.S.C. § 3729. The FCA, in its current iteration, does not provide guidance on the standard for proving falsity.7See id. §§ 3729–3733. Normally, this would not present an issue because “absent other indication, ‘Congress intends to incorporate the well-settled meaning of the common-law terms it uses.’ ”8Universal Health Servs., Inc. v. United States, 579 U.S. 176, 177 (2016) (quoting Sekhar v. United States, 570 U.S. 729, 732 (2013)). However, claims for government payment or reimbursement are sometimes based only on a subject matter expert’s evaluation. This is particularly true in the medical field, where doctors are required to treat patients using their clinical judgments.9See infra Section I.C. Thus, proving falsity in these cases necessarily entails disproving expert opinion. Given the subjective nature of opinions, common-law developments have not been uniform, and circuit courts have entrenched themselves on different sides of the aisle.10Compare United States v. AseraCare, Inc., 938 F.3d 1278, 1281 (11th Cir. 2019) (holding that an objective falsehood standard is proper), with United States v. Care Alts., 952 F.3d 89, 91 (3d Cir. 2020) (ruling against an objective falsehood standard), cert. denied, 141 S. Ct. 1371 (2021).

On one side are circuit courts that believe that the FCA requires proof of an “objective falsehood.”11See, e.g., AseraCare, 938 F.3d at 1281. This seems to be the traditional interpretation, with many courts at the district and appellate levels dismissing plaintiffs’ claims when they failed to establish that a defendant’s representation was objectively false.12See infra Section I.D, Appendix A. Most recently, the Eleventh Circuit, in United States v. AseraCare, Inc., considered when the hospice provider certifications regarding a patient’s “terminally ill” status can be considered false under the FCA.13AseraCare, 938 F.3d at 1281. In its holding, the court determined that claims cannot be false based on “a reasonable disagreement between medical experts.”14Id.

Approximately six months after the Eleventh Circuit’s ruling, the Third Circuit, in United States v. Care Alternatives, explicitly rejected the objective falsity standard in favor of a subjective falsity standard, whereby expert testimony challenging a physician’s judgment can be adequate evidence of falsity.15Care Alts., 952 F.3d at 91. The Ninth Circuit seemingly followed suit in Winter ex rel. United States v. Gardens Regional Hospital & Medical Center, Inc. when it proclaimed that a party stating an FCA claim does not need to plead an objective falsehood.16Winter ex rel. United States v. Gardens Reg’l Hosp. & Med. Ctr., Inc., 953 F.3d 1108, 1113 (9th Cir. 2020), cert. denied sub nom. RollinsNelson LTC Corp. v. United States ex rel. Winters, 141 S. Ct. 1380 (2021). The defendants in both cases petitioned the Supreme Court for writs of certiorari; unfortunately, on February 22, 2021, the Court rejected the petitions without comment, leaving the question unaddressed and prolonging the circuit split.17Care Alts., 141 S. Ct. 1371; RollinsNelson, 141 S. Ct. 1380.

This Note explores the aforementioned circuit split and scrutinizes the decisions under various frameworks given the statutory gap regarding falsity under the FCA. In doing so, it will consider relevant common law guidance and regulations and focus on the courts’ adherence to precedent and principles. Few doctrinal analyses on the falsity element of the FCA have been conducted,18Most prior noteworthy analyses have explicated objective falsity through a healthcare lens. See, e.g., Sebastian West, Proof of Objective Falsehood: Liability Under the False Claims Act for Hospice Providers, 90 U. Cin. L. Rev. 328, 328 (2021) (arguing that when narrowly tailored to hospice-related claims under the FCA, the objective falsity standard adopted by the Eleventh Circuit is the correct interpretation but fails to sufficiently guide the lower courts); Elizabeth A. Caruso, Comment, Hospice Care’s Adventures in Fraudland: “Battle of the Experts” & Proving Falsity Under the False Claims Act, 62 B.C. L. Rev. E. Supp. 21, 38–42 (2021) (advocating for objective falsity in hospice certification claims because it aligns with Supreme Court precedent and the Centers for Medicare & Medicaid Services’ intent for the Medicare hospice benefit); Jameson Steffel, End of Life Uncertainty: Terminal Illness, Medicare Hospice Reimbursement, and the “Falsity” of Physicians’ Clinical Judgments, 89 U. Cin. L. Rev. 779, 780 (2021) (concluding that the Eleventh Circuit’s approach is the correct legal and policy interpretation with regards to Medicare-related false claims); Bryce T. Daniels, A Tale of Two Falsities: Objective Falsity and Common-Law Falsity in the False Claims Act 2 (Aug. 1, 2021) (unpublished manuscript), https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3922788 [https://perma.cc/QL6J-WAS2] (claiming that the objective falsity standard should be disfavored because medical opinions, whether construed as opinions or statements of fact, are falsifiable in many contexts). Some articles maintain that there is actually no circuit split over doctors’ FCA liability. See, e.g., Jenna L. Schaffer, Note, Not Quite What the Doctor Ordered: The Third Circuit Pulls the Plug on Objective Falsity in United States Ex Rel. Druding v. Care Alternatives, 67 Vill. L. Rev. 167, 171 (2022) (suggesting that the Eleventh Circuit’s ruling merely “created the perception of a circuit split—even though a split may not actually exist”); Matthew Gill, There Is No Circuit Split Over Doctors’ FCA Liability, Law360 (May 18, 2020), http://
http://www.porterwright.com/content/uploads/2020/05/Law360-There-Is-No-Circuit-Split-Over-Doctors-FCA-
Liability.pdf [http://perma.cc/G866-ATA7].
and to my knowledge, this is the one of the first to propose that (1) the recent disagreement over objective falsity is a nontraditional three-way circuit split, and (2) the falsity standard needs to be flexible to accommodate various controlling regulations and statutes. This Note then argues that the Ninth Circuit has correctly elucidated the issue: courts should not focus on the objective or subjective falsehood standard but rather on the context and circumstances of each case.

Part I of this Note provides a foundational understanding of the FCA, the healthcare industry, and falsity in common law contexts. This includes the FCA’s legislative history, qui tam claims, statistics regarding recovery, medical decision-making, Medicare hospice benefit (“MHB”), and history of objective falsity cases. Part II discusses prior Supreme Court and appellate decisions that provide a useful framework to analyze the circuit split. Part III analyzes the three central cases that have contributed to the recent circuit split: United States v. AseraCare, Inc.,19United States v. AseraCare, Inc., 938 F.3d 1278, 1278 (11th Cir. 2019). United States v. Care Alternatives,20Care Alts., 952 F.3d at 89. and Winter ex rel. United States v. Gardens Regional Hospital and Medical Center, Inc.21Winter, 953 F.3d at 1108. Part IV recommends that courts analyze falsity under the Tenth Circuit and Supreme Court’s common law test defined in United States ex rel. Polukoff v. St. Mark’s Hospital and Omnicare, Inc. v. Laborers District Council Construction Industry Pension Fund.22Omnicare, Inc. v. Laborers Dist. Council Constr. Indus. Pension Fund, 575 U.S. 175, 184–89 (2015); United States ex rel. Polukoff v. St. Mark’s Hosp., 895 F.3d 730, 741 (10th Cir. 2018). One report has previously suggested that courts should simply apply the Omnicare test. See Robert Salcido, When Can Opinions be “False” and Result in False Claims Act Liability: Three Circuit Courts Provide Conflicting Guidance, Salcido Rep.: False Claims Act Pub. Disclosure Alert (Nov. 24, 2020), http://
http://www.akingump.com/en/news-insights/when-can-opinions-be-false-and-result-in-false-claims-act-liability-three-circuit-courts-provide-conflicting-guidance.html [http://perma.cc/URM5-RK3L].
Part IV also argues that FCA-intersecting statutes and regulations have impliedly allowed for both objective and subjective falsity standards to exist. Furthermore, Part IV suggests that the issue may be more efficiently addressed by the legislature than the courts and contextualizes the problem within the broader whistleblower policy debate.

I. BACKGROUND

A. The False Claims Act and Its Legislative History

Originally enacted in 1863 at the request of President Abraham Lincoln,23132 Cong. Rec. H22,339 (daily ed. Sept. 9, 1986) (statement of Rep. Berman). the FCA is America’s first whistleblower law and currently one of the strongest whistleblower laws in the United States.24False Claims Act (Qui Tam) Whistleblower FAQ, Nat’l Whistleblower Ctr. [hereinafter Whistleblower FAQ], http://www.whistleblowers.org/faq/false-claims-act-qui-tam [http://perma.cc/
XNA2-T3MB].
The FCA allowed the federal government to combat widespread fraud committed by defense contractors against the Union Army during the American Civil War.25See Boese & Baruch, supra note 5 (describing Congress’s motivation in enacting the FCA). In a congressional session statement, Senator Jacob Howard noted that “shells for the use of the Army . . . have been filled not with the proper explosive materials . . . but with saw dust” and that “[a]rms have been supplied which, on examination and use, have turned out to be useless and valueless.”26Cong. Globe, 37th Cong., 3d Sess. 955 (1863) (statement of Sen. Jacob Howard). The original Act contained criminal and civil penalties for wrongdoers.27Pamela H. Bucy, Private Justice and the Constitution, 69 Tenn. L. Rev. 939, 945 (2002) (explaining that the penalties were separated in 1874 and the criminal portion can now be found at 18 U.S.C. § 287). But the crucial feature of the Act that allows for its effective implementation is the qui tam provision, which enables private citizens to bring suits on behalf of the government; this essentially “empower[s] . . . ordinary citizens to act as private attorneys general.”28Christopher C. Frieden, Protecting the Government’s Interests: Qui Tam Actions Under The

False Claims Act and the Government’s Right to Veto Settlements of Those Actions, 47 Emory L.J. 1041, 1041 (1998). Claimants in these qui tam actions, known as the “relators,” are incentivized by the fact that they receive a portion of the recovered damages.29See Isaac B. Rosenberg, Raising the Hue . . . and Crying: Do False Claims Act Qui Tam Relators Act Under Color of Federal Law?, 37 Pub. Cont. L.J. 271, 276–78 (2008). Relator is the term found in the FCA statute because the term whistleblower was not in use at the time of statutory enactment.30Whistleblower FAQ, supra note 24. Although the two terms are synonymous, courts and parties often prefer to use the term relator.31See id. Congress believed that it was necessary to “set[] a rogue to catch a rogue” due to the resource constraints that the government would have faced if it investigated and inquired into every business dealing involving its contractors.32See Cong. Globe, 37th Cong., 3d Sess. 956 (1863) (statement of Sen. Jacob Howard). Senator Howard declared that this provision was “the safest and most expeditious way I have ever discovered of bringing rouges to justice.”33Id. Those convicted under the original version of the statute were liable for double the government’s damages in addition to a $2,000 penalty for each false claim.34The False Claims Act: A Primer, U.S. Dep’t of Just. (Apr. 22, 2011) [hereinafter Primer], http://www.justice.gov/sites/default/files/civil/legacy/2011/04/22/C-FRAUDS_FCA_Primer.pdf [http://
perma.cc/NRM2-8KWD].
Relators would have received fifty percent of the total damages.35Charles Doyle, Cong. Rsch. Serv., R40785, Qui Tam: The False Claims Act and Related Federal Statutes 6 (2021).

Nonetheless, since its inception, the FCA has been amended by Congress several times. Given that the Act was made for the purposes of deterring fraudulent profiteers of war while rewarding those who were upstanding, it was only fitting that the statute would be abused and tested during a subsequent major conflict, World War II.36See James B. Helmer Jr., False Claims Act: Incentivizing Integrity for 150 Years for Rogues, Privateers, Parasites and Patriots, 81 U. Cin. L. Rev. 1261, 1267 (2013). Then Attorney General Francis Biddle pursued criminal action against a host of defense contractors using the criminal provision of the FCA.37See id. Concurrently, groups of petitioners filed civil complaints against the same contractors and undoubtedly attempted to piggyback off the government’s work in the hopes of gaining a piece of the settlement.38See id. at 1267–68. This parasitic exploitation of the Act did not go unnoticed, and Congress amended the FCA in 1943.39Doyle, supra note 35, at 7–8. The amendment reduced the relator’s guarantee of fifty percent of recovered damages to a maximum of ten percent.40False Claims Act of 1943, Pub. L. No. 78-213, 57 Stat. 608, 609 (1943). The recovery limit for relators was also capped at twenty-five percent in cases in which the United States did not join.41Id. Most importantly, Congress removed relators’ ability to file suits if “the United States, or any agency, officer or employee thereof” possessed evidence or information of the fraud.42Id. This alteration single-handedly eliminated the majority of qui tam FCA cases.43See Helmer Jr., supra note 36, at 1270.

Approximately forty years later, Congress caught wind of reports of rampant fraud committed by federal contractors.44See 131 Cong. Rec. 17818 (1985) (statement of Rep. Weiss). In 1986, the FCA experienced almost a complete reversal of the strict prohibitions which chilled qui tam cases. The “any prior government knowledge” proscription was replaced with the substantially less restrictive “public disclosure of allegations or transactions” qualification.45Compare False Claims Act of 1943, Pub. L. No. 78-213, 57 Stat. 608, 609 (1943) (creating strict prohibitions), with False Claims Act of 1986, Pub. L. No. 99-562, 100 Stat. 3153, § 3 (1986) (loosening of such restrictions). In addition, recovery for successful relators increased marginally, and liability for perpetrators of fraud increased from double damages to treble damages.46Id. at § 2.

The most recent iteration of the FCA occurred in 2009, when Congress made a somewhat subtle amendment to the statute which limited the scope of claims encompassed by the FCA.47Doyle, supra note 35, at 9. A “material to a false or fraudulent claim” element was added.48Fraud Enforcement and Recovery Act of 2009, Pub. L. No. 111-21, 123 Stat. 1617 (2009). In essence, the wording of the prior FCA iteration allowed one of the critical elements to be met if the government simply paid or approved a fraudulent claim. The new requirement, however, adds a materiality aspect; that is, the government’s decision to pay or approve a claim must have been predicated on a falsity.

This current version of the FCA specifically penalizes, among other offenses, (1) knowingly presenting, or causing to be presented, a false or fraudulent claim for payment,4931 U.S.C. § 3729(a)(1)(A). and (2) knowingly making, using, or causing to be made or used, a false record or statement material to a false or fraudulent claim.50Id. § 3729(a)(1)(B). FCA claims are broken down into the following requirements: falsity, causation, knowledge, and materiality.51United States v. Care Alts., 952 F.3d 89, 94 (3d Cir. 2020), cert. denied, 141 S. Ct. 1371 (2021). The statute provides functional definitions for knowledge but offers no guidance on the definitions of falsity.52See 31 U.S.C. § 3729(b)(1). The knowledge requirement includes (1) actual knowledge that the claim or information was false, (2) deliberate ignorance of the truth or falsity of the information, or (3) a reckless disregard of the truth or falsity of the claim or information. Id.

B. The False Claims Act in the Twenty-First Century

The importance of the FCA in combatting fraud in the twenty-first century should not be underestimated. Approximately ten percent of all government spending is lost to fraud.53Hesch, supra note 2. During fiscal year 2020, the government spent over $6 trillion dollars.54Press Release, U.S. Dep’t of the Treasury, Mnuchin and Vought Release Joint Statement on Budget Results for Fiscal Year 2020 (Oct. 16, 2020), http://home.treasury.gov/news/press-releases/sm1155 [http://perma.cc/5QX5-PAE3]. Accordingly, the government stands to lose hundreds of millions of dollars each year. Like cases in other areas of law, the majority of FCA cases settle or are dismissed before trial.55See Strategic Budgeting Can Result in Early Resolution of False Claims Act Cases, Jones Day (Aug. 2018) [hereinafter Strategic Budgeting], http://www.jonesday.com/en/insights/2018/08/strategic-budgeting-can-result-in-early-resolution [http://perma.cc/3MEP-R6CZ]. Nonetheless, the number of FCA cases and associated monetary payments have substantially amplified in recent years. More than 4,000 new cases have opened since 2015.56Fraud Statistics – Overview: October 1, 1986–September 30, 2020, Civ. Div. U.S. Dep’t of Just. (Jan. 14, 2021) [hereinafter Fraud Statistics], http://www.justice.gov/opa/press-release/file/
1354316/download [http://perma.cc/T74F-E6FG].
In 2020 alone, qui tam relators and the government filed 922 new FCA suits and subsequently obtained more than $2 billion dollars in recovery and settlements.57Id. With the onset of the COVID-19 pandemic, the Department of Justice has already begun investigating and prosecuting the spike in COVID-19 recovery-related programs.58See COVID-19 Fraud, supra note 4; Press Release, U.S. Dep’t of Just., Eastern District of California Obtains Nation’s First Civil Settlement for Fraud on Cares Act Paycheck Protection Program (Jan. 12, 2021), http://www.justice.gov/usao-edca/pr/eastern-district-california-obtains-nation-s-first-civil-settlement-fraud-cares-act [http://perma.cc/H8AB-FEH8]. Fraud cases are more prevalent now than ever, and the FCA creates a necessary foundation with which to combat these issues.

Although historically used to uncover and deter military-based fraud against the federal government, the FCA in the current era has undergone a drastic shift, not based on the substance of law but rather due to policy shifts in healthcare law. The rapid expansion of the healthcare sector and burgeoning government programs are likely responsible for this shift.59See National Health Expenditure Data: Historical, Ctrs. for Medicare & Medicaid Servs., http://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/
NationalHealthExpendData/NationalHealthAccountsHistorical [http://perma.cc/9J2N-7LP8]; Nicole Forbes Stowell, Carl Pacini, Nathan Wadlinger, Jaqueline M. Crain & Martina Schmidt, Investigating Healthcare Fraud: Its Scope, Applicable Laws, and Regulations, 11 Wm. & Mary Bus. L. Rev. 479 (2020) (describing the healthcare landscape and prevalence of healthcare fraud).
Over eighty percent of fraud cases against the government are now related to healthcare.60Fraud Statistics, supra note 56. Furthermore, healthcare-related FCA cases account for more recovery than FCA recovery from all other sectors combined.61Id.

C. Medical Decision-Making and Medicare Hospice Benefits

Given that all three cases contributing to the circuit split concern Medicare-related fraud, a general discussion of fraud within the medical practice area is warranted. Fraud in the medical industry is not novel. In particular, the Federal Bureau of Investigation has noted that health care fraud causes several billions of dollars in losses each year.62White-Collar Crime: Health Care Fraud, Fed. Bureau of Investigation, http://www.fbi.
gov/investigate/white-collar-crime/health-care-fraud [http://perma.cc/L86N-LD5U].
Although there are a variety of factors that contribute to the prevalence of health care fraud, the subjectivity inherent in medical decision-making is a prominent one.63See infra note 72 and accompanying text. Relatedly, it is quite possible that doctors undertreat patients due to fears of FCA liability. However, no literature has studied this specific issue. Nonetheless, this theory is not unfounded because there have been instances of lawsuits for the undertreatment of pain, indicating that physicians are capable of undertreating patients in various circumstances. See, e.g., Doctor Tagged with $1.5m Verdict in Landmark Elder Abuse Case: Bergman v. Chin, 3 Andrews Nursing Home Litig. Rep. 3 (2001). There will almost always be another medical professional who does not agree with the course of action taken. Moreover, the medical industry is unique in that the medical opinions of physicians sometimes lack the objective proof to reinforce their actions and regulations often give deference to medical judgments.64See Marissa Fritz, Using Subjective Evidence in FDA Review, Regul. Rev. (July 15, 2020), http://www.theregreview.org/2020/07/15/fritz-using-subjective-evidence-fda-review [http://perma.cc/
MQ7X-T2V3].

The MHB presents a fitting example of a controlling statutory restriction that specifically grants physicians this deference. Due to the growing number of aging individuals enrolled in Medicare, Congress passed the MHB in 1983.65Tax Equity and Fiscal Responsibility Act of 1982, Pub. L. No. 97-248, § 122, 96 Stat. 324, 356–63 (codified at 42 C.F.R. pts. 400, 405, 408, 409, 418, 420, 421, 489). Interdisciplinary teams are composed of health professionals from various specialties including physicians, therapists, spiritual counselors, and social workers. 42 C.F.R. § 418.56 (2020). The MHB allows Medicare beneficiaries to forego traditional curative care in favor of electing interdisciplinary palliative treatment.66See 42 U.S.C. § 1395y(a)(1)(C). It should be noted that Medicare only pays for palliative care services if they are deemed to be “reasonable and necessary.” Id. Curative care refers to care focused on improving a patient’s medical condition whereas palliative care provides relief, emotional and spiritual support, and comfort for patients with a terminal diagnosis. See James F. Barger, Jr., Symposium, Life, Death, and Medicare Fraud: The Corruption of Hospice and What the Private Public Partnership Under the Federal False Claims Act is Doing About It, 53 Am. Crim. L. Rev. 1, 13 (2016). However, eligibility is based on a written confirmation of a “terminally ill” prognosis by a physician or medical director.6742 U.S.C. § 1395f(a)(7)(A); see also 42 C.F.R. §§ 418.20(b), 418.22(a) (2021). Terminally ill is defined as “a life expectancy of 6 months or less if the terminal illness runs its normal course.”6842 C.F.R. § 418.22(b)(1) (2021). This certification must include a written narrative explaining the clinical findings and be accompanied by clinical information and other documentation.69Id. § 418.22(b)(2)–(b)(3). Once these conditions are met, Medicare and Medicaid programs will provide payment to hospice providers for costs incurred under the Social Security Act.70See Michael W. Youtt, H. Victor Thomas & Adam Robison, False Claims Act Actions-The Developing Case Law Regarding If and When Opinions of Medical Necessity Can Be Fraudulent, 27 Health Law. 36, 37 (2015). The regulations have directly acknowledged the “uniqueness of every Medicare beneficiary” and that predicting someone’s end-of-life is not an “exact science.”71Hospice Quality Reporting Requirements and Process and Appeals for Part D Payment for Drugs for Beneficiaries Enrolled in Hospice, 79 Fed. Reg. 50452, 50470–71 (Aug. 22, 2014) (to be codified at 42 C.F.R. pts. 405, 418). This shows that regulators have recognized the subjectivity in medical decision-making. Id. Accordingly, certifications may be renewed by the physician for additional sixty- or ninety-day periods.7242 U.S.C. § 1395f(a)(7)(A); see also 42 C.F.R. § 418.21 (2021).

Following the MHB’s establishment, FCA cases alleging hospice fraud have increased dramatically.73See, e.g., United States v. Care Alts., 952 F.3d 89 (3d Cir. 2020) (litigating FCA charges based on false hospice care claims), cert. denied, 141 S. Ct. 1371 (2021); United States v. AseraCare, Inc., 938 F.3d 1278 (11th Cir. 2019) (same); United States ex rel. Wall v. Vista Hospice Care, Inc., 778 F. Supp. 2d 709 (N.D. Tex. 2011) (same); United States ex rel. Holloway v. Heartland Hospice, Inc., 960 F.3d 836 (6th Cir. 2020) (same); United States ex rel. Lemon v. Nurses To Go, Inc., 924 F.3d 155 (5th Cir. 2019) (same). This includes two of the three circuit split cases.74See infra Sections III.A–B. Predictably, most cases are initiated by whistleblowers in qui tam suits, as foreseen by the legislature.75See Fraud Statistics, supra note 56. In 2016, the MHB provided hospice care to more than one million individuals, and Medicare reimbursed over $16 billion for hospice care.76Off. of Inspector Gen., U.S. Dep’t of health & Hum. Servs., OEI-02-16-00570, Vulnerabilities in the Medicare Hospice Program Affect Quality Care and Program Integrity: An OIG Portfolio 3 (2018). Moreover, unlike FCA litigation in other areas of law, FCA litigation in connection with the MHB demonstrates a unique scenario that has perplexed the courts: stratification of the FCA by a purposefully deferential statute.

D. An Undisputed Era of Objective Falsity

Objective falsity was widely considered to be the standard before the new Third and Ninth Circuit holdings challenged the status quo; a considerable number of courts, including the Third Circuit itself, have previously recognized this standard.77See infra Appendix A. This ostensibly established standard derived from a mix of healthcare- and non-healthcare-related FCA claims,78See infra Appendix A. which likely solidified its acceptance and promulgated its spread across jurisdictions. Some of these cases were decided as early as 200579See United States ex rel. Morton v. A Plus Benefits, Inc., 139 F. App’x 980, 982 (10th Cir. 2005). and are briefly explained below to illustrate the formerly unified landscape which has been shattered by the circuit split.80For a more comprehensive list, see infra Appendix A.

In United States v. Prabhu, the District of Nevada held:

To establish falsity under the FCA, it is not sufficient to demonstrate that the person’s practices could have or should have been better. Instead, plaintiff must demonstrate that an objective gap exists between what the defendant represented and what the Defendant would have stated had the Defendant told the truth.81United States v. Prabhu, 442 F. Supp. 2d 1008, 1032–33 (D. Nev. 2006).

The government alleged that the physician’s claims for pulmonary rehabilitation and simple pulmonary stress tests were false due to insufficient documentation.82Id. at 1010–11. The government interpreted the American Medical Association’s guidance publication to require specific measurements and a written report for a simple stress test.83Id. at 1028. However, the record indicated that Medicare failed to issue specific guidance regarding the precise type of documentation needed to provide care and that there was no physician writing documentation requirement.84Id. at 1016–17. In light of these facts, the parties’ contentions, and the “general confusion” among the government and its own experts, the court believed that “reasonable persons can disagree regarding the billing requirement[]” and the physician’s documentation practices fell within “the range of reasonable medical and scientific judgment.”85Id. at 1016–17, 1032. Furthermore, the government did not establish a concrete violation of a “controlling rule, regulation, or standard” when the physician provided pulmonary rehabilitation services.86Id. at 1032. As a matter of law, the government failed to establish falsity, and the court granted the motion for summary judgment.87Id. at 1026, 1032.

In United States ex rel. Wilson v. Kellogg Brown & Root, Inc., the Fourth Circuit determined that “[an] FCA relator cannot base a fraud claim on nothing more than his own interpretation of an imprecise contractual provision.”88United States ex rel. Wilson v. Kellogg Brown & Root, Inc., 525 F.3d, 370, 378 (4th Cir. 2008). The relators claimed that the defendant contractor, their former employer, falsely certified that it would uphold its contractual duties by maintaining military vehicles in “good appearance” when “it would not, and later did not, abide by those terms.”89Id. at 377. The court outright rejected this assertion because “[i]t is well-established that the FCA requires proof of an objective falsehood.”90Id. (citing United States ex rel. DRC, Inc. v. Custer Battles, LLC, 472 F. Supp. 2d 787, 797 (E.D. Va. 2007)). The court also found no evidence of this claim, as the United States government—the actual party to the contract—never expressed dissatisfaction with the contractor’s performance.91Id. Relying solely on their interpretation of imprecise maintenance provisions, the relators failed to state a valid falsity claim under the FCA.92Id. at 378.

In United States ex rel. Yannacopoulos v. General Dynamics, the Seventh Circuit decided that “[a] statement may be deemed ‘false’ for purposes of the False Claims Act only if the statement represents ‘an objective falsehood.’ ”93United States ex rel. Yannacopoulos v. Gen. Dynamics, 652 F.3d 818, 836 (7th Cir. 2011). The relator contended that amendments to a contract between a company and Greece were “reverse false claims,” false statements used to conceal, avoid, or decrease an obligation to pay or transmit money or property to the government.94Id. at 835. However, the relator simply relied on his interpretation of the terms of agreement without proof of any evidence.95See id. at 836–39. As a result, the court affirmed the district court’s motion for summary judgment.96Id. at 840.

In United States ex rel. Wall v. Vista Hospice Care, Inc., the Northern District of Texas ruled that “[a] testifying physician’s disagreement with a certifying physician’s prediction of life expectancy is not enough to show falsity.”97United States ex rel. Wall v. Vista Hospice Care, Inc., No. 3:07-cv-00604-M, 2016 U.S. Dist. LEXIS 80160, at *56 (N.D. Tex. June 20, 2016). The relator asserted, inter alia, that defendant hospice service providers improperly enrolled and sought reimbursement from Medicare and Medicaid for patients who were not eligible for hospice care.98Id. at *55. Although the relator presented a medical expert’s testimony that ninety percent of the records were ineligible for certification, it was not sufficiently linked to the corporate scheme to falsify records and thus did not create a triable “fact issue as to falsity.”99See id. at *33, *62.

II. BUILDING AN ANALYTICAL FRAMEWORK

Important cases have discussed how opinions relate to the FCA, when opinions may be considered false in the context of medical necessity, and the two theories of falsity.100See infra Sections II.A–C. The totality of these cases provides an analytical framework with which to analyze the circuit split and are discussed below:

 A. When Opinions Can Be False

As a prelude to the circuit split, the Supreme Court in Omnicare, Inc. v. Laborers District Council Construction Industry Pension Fund addressed the issue of when opinions can be false.101See Omnicare, Inc. v. Laborers Dist. Council Constr. Indus. Pension Fund, 575 U.S. 175, 176 (2015). The case involved Omnicare, the largest pharmacy services provider for nursing home residents in the United States, and its filed registration statement with the Securities and Exchange Commission (“SEC”).102Id. at 179. The filing included two statements of opinion. First, the company believed that its “contract[ual] arrangements with other healthcare providers, . . . pharmaceutical suppliers and . . . pharmacy practices [were] in compliance with applicable federal and state laws.”103Id. Second, the company believed that its “contracts with pharmaceutical manufacturers [were] legally and economically valid arrangements that [brought] value to the healthcare system and the patients.”104Id. at 180. The plaintiffs, pension funds that purchased Omnicare stock, alleged that the company’s statements were materially false based on later lawsuit filings from the government stating that the company received payments from drug manufacturers in violation of anti-kickback laws.105Id. In addition to claims of materially false representations regarding legal compliance, the complaint maintained that none of the company’s officers and directors possessed reasonable ground to believe that the opinions offered were truthful and complete.106Id. In support of this, plaintiffs pointed to the fact that one of Omnicare’s attorneys previously warned of a contract that carried a heightened risk of liability under anti-kickback laws.107Id.

The district court granted Omnicare’s motion to dismiss on the grounds that the statements about a company’s belief regarding its legal compliance are only actionable if those who made the statements knew they were untrue at the time.108Id. at 181. The court thus concluded that the plaintiffs’ complaint failed to meet the standard because there were no allegations stating that Omnicare’s officers knew they were violating the law.109Id.

On appeal, the Sixth Circuit reversed the district court’s holding.110Id. The court acknowledged that the opinions related to legal compliance, rather than “hard facts.”111Id. (quoting In re Sofamor Danek Group Inc., 123 F.3d 394, 401–02 (6th Cir. 1997)). Nonetheless, the court proceeded to explain that the plaintiffs simply had to allege that the opinion was objectively false and were not required to contend that an Omnicare employee “disbelieved [the opinion] at the time it was expressed.”112Id. (quoting Fait v. Regions Fin. Corp., 655 F.3d 105, 110 (2d Cir. 2011)).

After granting certiorari, the Supreme Court addressed the following two issues: (1) when an opinion may constitute a factual misstatement; and (2) when an opinion may be considered misleading by the omission of discrete factual representations.113See id. at 186–89. On the first issue, the Court held that sincere statements of pure opinion are not “ ‘untrue statement[s] of material fact,’ regardless [of] whether an investor can ultimately prove the belief [was] wrong.”114Id. at 186. To support its contention, the Court viewed the clause as limiting investors’ ability to “second-guess inherently subjective and uncertain assessments. In other words, the provision is not . . . an invitation to Monday morning quarterback an issuer’s opinions.” Id. Relying on common law principles, the Court illustrated two examples that provided exceptions to when statements of pure opinion can be false.115Id. at 184–86. These exceptions include when (1) the speaker does not actually hold the opinion, or (2) the opinion contains a false, embedded fact.116Id. On the second issue, the Court ruled that opinions may be misleading when a registration statement omits material facts about the issuer’s inquiry into or knowledge concerning a statement of opinion and if those facts conflict with what a reasonable investor would take from the statement itself.117Id. at 189. The Court asserted that this principle is consistent with the common law tort of misrepresentation. Id. at 191–92. Undisclosed facts may constitute a misleading opinion when the expression of such opinion involves an “implied assertion” that the speaker is unaware of any contradictory facts and that the speaker understands facts which justify the opinion. Id. The Court does note, however, that an opinion is not necessarily misleading if it omits facts that “cut[] the other way” and analyses into this inquiry should always consider context.118Id. at 189–90.

Thus, the Supreme Court effectively recognized that individuals make false opinions when (1) they do not actually hold the opinion; (2) the opinion contains a false, embedded fact; (3) they are aware of facts that would preclude such an opinion; or (4) they are not aware of any facts that would justify the opinion.119Id. at 184–89.

B. Opinions Analysis in the Context of Medical Necessity

In United States v. Paulus, the Sixth Circuit conducted an Omnicare-based analysis in the context of a medical case without explicitly referencing the case.120See United States v. Paulus, 894 F.3d 267, 275 (6th Cir. 2018). In Paulus, a cardiologist was criminally prosecuted for health care fraud and false statements.121Id. at 267. Specifically, the cardiologist exaggerated the extent of arterial blockages in his patients in order to perform and bill for medically unnecessary coronary stenting procedures.122Id. at 270–71. The crux of this case depended on the interpretation of angiograms, with the plaintiff using the testimony of nine doctors to testify that the level of blockage differed from what the defendant had reported.123Id. at 273–74. Of note, there were instances in which the defendant reported more than seventy percent blockage when in reality there was no blockage according to expert testimony. Id. The defendant responded by pointing out the subjectivity of angiogram interpretation, including data from multiple studies.124Id. at 272.

During trial at the district court level, the jury convicted the cardiologist of healthcare fraud and making false statements.125Id. at 270. However, the court directed a judgment of acquittal and subsequently ordered a new trial.126Id. at 274–75. The court reasoned that the degree of arterial blockage was a matter of “subjective medical opinion,” and thus the cardiologist’s angiogram interpretations “could be neither false nor fraudulent.”127Id. at 275.

On appeal, the Sixth Circuit reversed because it believed that clinical judgments can trigger FCA liability when an individual (1) asserts an opinion they do not truly believe, or (2) has knowledge of facts that contradict their opinion.128Id. at 275–76. The court reasoned that “[t]he degree of stenosis is a fact capable of proof or disproof.”129Id. at 275. The court then likened the deliberate inflation of blockages on an angiogram to the telling of a lie, which infers the commission of a fraud when paired with the billing of a more expensive procedure.130Id. In its analysis, the court essentially utilized the first two false opinion definitions described in Omnicare: (1) not honestly holding an opinion, and (2) an opinion containing a false, embedded fact.131Compare id. (discussing the two ways in which clinical judgments can be false), with Omnicare, Inc. v. Laborers Dist. Council Constr. Indus. Pension Fund, 575 U.S. 175, 184–86 (2015) (describing the same two factors with different phrasing). The court thought it to be clear that angiograms are facts and implied that angiogram interpretations are obviously not facts “capable of confirmation or contradiction.”132United States v. Paulus, 894 F.3d 267, 275 (6th Cir. 2018). Accordingly, the court pivoted to the idea that the cardiologist did not give an opinion but instead misrepresented facts by lying about the results.133Id. at 276. The court believed that the cardiologist did not simply misread the angiograms but rather “repeatedly and systematically saw one thing on the angiogram and consciously wrote down another, and then used that misinformation to perform and bill unnecessary procedures. The difficulty of interpreting angiograms has no bearing on the capacity of these statements to be false.” Id. Thus, the court reversed the trial court’s judgment and reinstated the jury’s verdict.134Id. at 280.

C. Factual Versus Legal Falsity

In United States ex rel. Polukoff v. St. Mark’s Hospital, the Tenth Circuit identified and distinguished between two types of falsities, factual and legal falsity, prior to conducting a falsity analysis under the FCA.135See United States ex rel. Polukoff v. St. Mark’s Hosp., 895 F.3d 730, 741 (10th Cir. 2018). In this case, a relator, the former co-worker of the defendant, sued the defendant-cardiologist as well as two hospitals under the FCA.136Id. at 734. The complaint alleged that the cardiologist performed thousands of medically unnecessary cardiac surgical procedures and fraudulently certified otherwise to receive reimbursement under the Medicare Act.137Id. Central to this claim was the Centers for Medicare and Medicaid Services’ (“CMS”) “reasonable and necessary” requirement for surgeries.138Id. at 735. Industry guidelines indicated when performing surgeries would be appropriate for specific types of patients, which the cardiologist allegedly ignored.139Id. at 736–37. Instead, he misrepresented on the certifications that he had performed them in accordance with the guidelines.140Id. Thus, this representation was false under the FCA.141Id. at 739.

The district court granted the defendants’ motion to dismiss. The court reasoned that “Medicare does not require compliance with an industry standard as a prerequisite to payment. Thus, requesting payment . . . does not amount to a ‘fraudulent scheme.’ ”142United States ex rel. Polukoff v. St. Mark’s Hosp., No. 2:16-cv-00304-JNP-EJF, 2017 U.S. Dist. LEXIS 8167, at *27 (D. Utah Jan. 19, 2017), rev’d and remanded, 895 F.3d 730 (10th Cir. 2018). Moreover, “because [o]pinions, medical judgments, and conclusions about which reasonable minds may differ cannot be false for the purposes of an FCA claim,” the relator failed to state a claim under the FCA.143Polukoff, 895 F.3d at 739 (internal quotation marks omitted).

On appeal, the Tenth Circuit reversed and remanded because it fundamentally disagreed with the district court’s narrow interpretation of the FCA’s reach.144See id. at 741. The court read the FCA broadly so as to encompass “claims for medically unnecessary treatment.”145Id. at 742. Another reason the court presented was “that an allegedly false statement constitut[ing] the speaker’s opinion does not disqualify it from forming the basis of FCA liability.”146Id. To support this reasoning, the court looked to its bifurcated understanding of falsity in a previously decided case.147See id. at 741. The court held that “false” may indicate factually false or legally false.148Id. (citing United States ex rel. Lemmon v. Envirocare of Utah, Inc., 614 F.3d 1163, 1168 (10th Cir. 2010)). Factually false claims are express claims that are not based in fact (for example, seeking payment for services that were never provided or submitting incorrect information), whereas legally false claims cover instances where an individual certifies compliance with applicable legal requirements when, in fact, the individual knew there was no compliance.149Id. Since the relator’s complaint alleged non-compliance with Medicare regulations, the court’s straightforward, logical analysis of legal falsity was as follows: (1) “[a] Medicare claim is false if it is not reimbursable;” (2) “a Medicare claim is not reimbursable if the services provided were not medically necessary;” and (3) in order for a claim to be medically necessary, “it must meet the government’s definition of ‘reasonable and necessary,’ as found in the Medicare Program Integrity Manual.”150Id. at 742. The procedures, certified by the cardiologist, did not comport with the government’s definition of the phrase, and thus the certifications were false under the FCA.151Id. at 743.

III.  ANALYSIS

While most articles have divided the circuit split issue between objective and subjective falsity,152See West, supra note 18; Caruso, supra note 18. further inspection demonstrates that the circuit split is not binary. All three cases in the circuit split look to the statutory language of the FCA.153See infra Sections III.A–C. The Eleventh and Third Circuit interpretations directly conflict, as they arrived at an objective and subjective falsity standard, respectively, after contemplating the same regulations surrounding the MHB.154See infra Sections III.A–B. The Ninth Circuit case did not involve the MHB but instead considered the statutory language of Medicare programs and the CMS’s definition of “reasonable and necessary.”155See infra Section III.C. Although the Ninth Circuit fundamentally employed the same analysis as the Eleventh Circuit, it explicitly rejected the Eleventh Circuit’s objective falsity standard and implicitly adopted the subjective falsity standard.156See infra Section III.C. Thus, three distinct standards have emerged from the case law.

First, this Section will discuss the Eleventh Circuit’s analysis and decision in United States v. AseraCare, Inc., which establishes a higher burden of proof at the summary judgment stage for relators and the government. Second, this Section will examine the Third Circuit’s holding in United States v. Care Alternatives and why it chose to critique and explicitly depart from the Eleventh Circuit’s adoption of the objective falsity standard. Third, this Section will consider the Ninth Circuit’s more even-handed analysis in Winter ex rel. United States v. Gardens Regional Hospital & Medical Center, Inc. and why it refused to adopt a rigid falsity standard.

A. United States v. AseraCare, Inc.

In AseraCare, the government intervened in a qui tam suit filed by three former AseraCare employees against AseraCare, claiming that the hospice provider had a practice of knowingly submitting unsubstantiated Medicare claims in violation of the FCA.157United States v. AseraCare, Inc., 938 F.3d 1278, 1282, 1284 (11th Cir. 2019). The government intervenes in approximately twenty-five percent of FCA claims. Government Intervention in False Claims Acts, Butler Prather LLP, https://www.butlerwprather.com/practice-areas/government-intervention-in-false-claims-acts [https://perma.cc/Y55G-7NH3]. Generally, the government reviews the information about the claim and initiates an independent investigation of the alleged illegal acts. Id. The government then decides whether to intervene, decline intervention, or move to dismiss the relator’s complaint based on the findings of the investigation. Id. It should be noted, however, that simply because the government intervenes in a case does not mean that the government automatically agrees with the relator’s claims. Id. Rather, the government may have found another basis on which to intervene. Id. The government likely intervened in this case due to the scale of the fraud and amount of monetary loss involved. See infra notes 161–63 and accompanying text. These reckless business practices allegedly enabled the provider “to admit, and receive reimbursement for, patients who were not eligible for [MHB],” resulting in the “misspending” of millions of Medicare dollars.158AseraCare, 938 F.3d at 1284. The court noted this case as falling under the “false certification” theory of FCA liability (in other words, when there is a false implication of having complied with a legal requirement).159Id. This theory is akin to the Tenth Circuit’s legal falsity framework in Polukoff. See United States ex rel. Polukoff v. St. Mark’s Hosp., 895 F.3d 730, 741 (10th Cir. 2018).

To establish its case, the government first identified over 2,000 hospice patients for whom AseraCare had billed Medicare.160AseraCare, 938 F.3d at 1284. The government then narrowed this population to a subset of 223 patients and retained a physician to directly review these patients’ medical records and clinical histories.161Id. at 1284–85. Acting as the government’s primary expert witness, the physician, relying on his own clinical judgment, opined that 123 out of 223 patients were ineligible for hospice benefits at the time AseraCare received reimbursements from Medicare.162Id. at 1285. Critically, the government’s case was substantially weakened when its expert witness conceded that he was unable to affirmatively say whether AseraCare’s medical expert, or any other physician, was wrong about the accuracy of the prognoses at issue.163Id. at 1287. The judgment of AseraCare’s medical expert expectedly conflicted with the judgment of the government’s expert witness. Id. Furthermore, the expert witness (1) never testified that no reasonable doctor could have concluded that the patients were terminally ill at the time of certification, and (2) changed his opinion concerning the eligibility of certain patients over the course of the proceeding.164Id. at 1287–88.

A brief recitation of the procedural posture and history is warranted so as to provide context for the appellate court’s analysis. Following discovery and analysis of relevant patient records, AseraCare moved for summary judgment on the grounds that the government failed to adduce evidence of falsity under the FCA.165Id. at 1285. In its motion, AseraCare specifically asked the district court to apply the “reasonable doctor” standard; that is, “the government must show that a reasonable physician applying his or her clinical judgment could not have held the opinion that the patient at issue was terminally ill at the time of certification.”166Id. at 1286. Even though the district court found this standard convincing, it declined to apply it and denied the motion.167Id. The district court noted that the standard had not been adopted by the Eleventh Circuit, which may have influenced its decision to deny the motion for summary judgment. See id. The court also believed that “fact questions remained regarding whether clinical information and other documentation in the relevant medical records supported the certifications of terminal illness.” Id. The district court then bifurcated the trial into two phases, one on the falsity element and the second on the remaining FCA elements.168Id. at 1286–87. This limited the government’s ability to rebut AseraCare’s expert testimony during the first phase.169Id. at 1288. Nonetheless, the dueling expert testimony was a critical component of trial. The government’s expert and AseraCare’s expert diverged in how they approached analysis of patient life expectancy.170See id. The government’s expert used a “checkbox approach” to assess terminal illness by comparing patient records to medical guidelines.171Id. By contrast, AseraCare’s expert did not formulaically apply guidance and used a more “holistic” approach.172Id. At the trial’s conclusion, the district court provided the following jury instruction: “A claim is ‘false’ if it is an assertion that is untrue when made or used. Claims to Medicare may be false if the provider seeks payment, or reimbursement, for health care that is not reimbursable.”173Id. at 1289. Thus, the jury had to decide which expert was more persuasive, with the less persuasive opinion being deemed a false opinion.174Id. at 1288–89. In its answers to special interrogatories, the jury found that AseraCare had submitted false claims for 104 of the 123 patients at issue.175Id. at 1289.

Following this partial verdict, AseraCare moved for judgment as a matter of law, contending that the district court articulated an incorrect legal standard in its instruction.176Id. at 1290. The court agreed that it had committed reversible error in its instruction and ordered a new trial.177Id. The court believed it should have advised the jury of two “key points of law,” which were not previously acknowledged: (1) “the FCA’s falsity element requires proof of an objective falsehood”; and (2) “a mere difference of opinion between physicians, without more, is not enough to show falsity.”178Id. (emphasis omitted). The court noted that “AseraCare had advocated for this legal standard since the start of trial, but only after hearing all the evidence had the court become ‘convinced’ that ‘a difference of opinion is not enough.’ ” Id. The court then considered summary judgment sua sponte and concluded that the government could not prove the falsity element as a matter of law because the government “presented no evidence of an objective falsehood for any of the patients at issue.”179Id. Summary judgment was granted in AseraCare’s favor, and the government appealed.180Id.

On appeal, the government’s core argument was that competing expert testimony regarding patients’ medical records supporting a terminal illness prognoses was enough to raise a factual question for the jury.181Id. at 1291. In contrast, AseraCare contended that the determinative inquiry was whether the certifying physician exercised genuine clinical judgment.182Id. at 1291–92. If so, the accuracy of such judgment cannot be false as a factual matter.183Id. at 1292. The Eleventh Circuit immediately recognized that “the standard for falsity [was] in the context of the Medicare hospice benefit, where the controlling condition of reimbursement is a matter of clinical judgment.”184Id. at 1291. Accordingly, the Eleventh Circuit was tasked with considering how the FCA intersects the scope of hospice eligibility requirements.185Id.

The Eleventh Circuit initially evaluated whether the falsity claim was a legal or factual falsity.186See id. The court concluded that the case concerned a legal falsity claim because “[t]here is no allegation that the hospice services AseraCare provided were not rendered as claimed.”187Id. Then, the court identified the following two “representations,” which may form the legal basis for an FCA claim: (1) the “representation by a physician to AseraCare that the patient is terminally ill in the physician’s clinical judgment”; and (2) the “representation by AseraCare to Medicare that such clinical judgment has been obtained and that the patient is therefore eligible.”188Id. at 1295–96. The court found that the government’s allegations only referred to the first representation.189Id. at 1296. The first representation, however, made it such that the government’s FCA case rested entirely on the question of when a “physician’s clinical judgment regarding a patient’s prognosis [can] be deemed ‘false.’ ”190Id.

To answer this question, the court heavily relied on applicable regulations and the text of the MHB statute due to the “dearth of controlling case law.”191Id. at 1292–95. The court looked to the plain meaning of the entire statue and regulations instead of focusing on specific words.192Id. at 1292. The general requirements were that (1) hospice providers must submit a certification claim for patients, (2) the certification must be in writing, (3) the certification must be based on clinical judgment, (4) clinical information and other documentation supporting the prognoses must accompany the certification, and (5) the reimbursement must be for “reasonable and necessary” payments for managing terminally ill patients.193Id. at 1292–93. The court subsequently pointed out that several requirements allow for a certain degree of subjectivity.194Id. at 1293. The court noted regulations stating that “[p]redicting life expectancy is not an exact science.” Id. For example, submission of claims must be individually tailored to each patient’s clinical circumstances.195Id. Check boxes and standard language used for all patients are prohibited.196Id. Furthermore, the subjective and objective medical findings of each patient should be considered.197Id. The court believed that this built-in flexibility was fully intended by Congress and that Congress would have used different language if it wanted a more rigid and objective standard.198Id. at 1294. Thus, the court’s role was not to establish a more objective standard against the implied language of the statute and regulations.199See id. at 1294–95.

Although the court emphasized that the regulations intended for MHB eligibility were to simply be predicated on the procurement of a physician’s clinical judgment, the government sought to elevate the standard such that the underlying information must support, “as a factual matter,” the certification.200Id. at 1294. The court disagreed with this framing of the eligibility requirements, stating that it is not consistent with the text or design of the law.201Id. at 1295. The relevant regulations merely require that clinical information and other documentation supporting the medical prognosis accompany the certification and be filed in the medical record.202Id. at 1294. The court therefore determined that supporting documentation does not have to, standing alone, prove the validity of a physician’s initial clinical judgment.203Id. As long as the physician’s interpretation is reasonable, certification requirements are met.204See id.

The Eleventh Circuit ultimately concurred with the district court’s holding that a mere difference of medical opinion alone is insufficient to establish falsity under the FCA; however, it also ruled that the district court had gone too far in sua sponte granting summary judgment.205Id. at 1297, 1302–05. The court recognized that reasonable doctors may disagree on a patient’s condition and that neither one could be wrong.206Id. at 1296. As a result, “[a] properly formed and sincerely held clinical judgment is not untrue even if a different physician later contends that the judgment is wrong.”207Id. at 1297. To reach this conclusion, the court relied on and cited to the Supreme Court’s decision in Omnicare.208See id. Adhering to Omnicare’s general principles, the court acknowledged that opinions regarding terminal illness can be deemed objectively false in various circumstances.209Id. For example, the court noted that a physician’s opinion can be false when the “physician fails to review a patient’s medical records or otherwise familiarize himself with the patient’s condition.” Id. An opinion can also be false when “a physician did not, in fact, subjectively believe that his patient was terminally ill at the time of certification.” Id. Moreover, a physician’s opinion can be false “when expert evidence proves that no reasonable physician could have concluded that a patient was terminally ill given the relevant medical records.” Id. These are essentially the same factors that the Omnicare decision identified. See Omnicare, Inc. v. Laborers Dist. Council Constr. Indus. Pension Fund, 575 U.S. 175, 184–89 (2015). The court, however, maintained that in each of the above examples, the “flaw . . . can be demonstrated through verifiable facts.” AseraCare, 938 F.3d at 1297. The court finally deferred to the legislature or CMS after the government expressed concerns that an objective falsity standard “will likely prove more challenging for an FCA plaintiff.”210AseraCare, 938 F.3d at 1301.

B. United States v. Care Alternatives

Like the AseraCare case, Care Alternatives involved qui tam relators who were former employees of a hospice provider, Care Alternatives.211United States v. Care Alts., 952 F.3d 89, 91 (3d Cir. 2020), cert. denied, 141 S. Ct. 1371 (2021). The government declined to intervene. Id. at 93. It is unclear why it pursued this option. The relators alleged that Care Alternatives admitted ineligible MHB patients and directed its employees to alter the patients’ certifications to reflect eligibility.212Id. at 91. During discovery, both sides produced extensive evidence, which included dueling expert opinions.213Id. at 94. The relators’ expert examined nearly fifty patient records and opined that thirty-five percent of patients’ records did not support a certification of need for hospice care.214Id. The expert went even further and testified that “any reasonable physician would have reached the conclusion he reached.”215Id. Care Alternatives’ expert disagreed and believed that a reasonable physician would have found all of the patients to be hospice-eligible.216Id.

At the district court level, Care Alternatives moved for summary judgment based on the finding that the relators could not satisfy the four elements of the FCA claim.217Id. In particular, Care Alternatives claimed that relators had not produced sufficient evidence of falsity.218Id. Of note, the government submitted a statement of interest urging the district court to reject the objective falsehood standard. Id. The court granted Care Alternatives’ motion “based solely on failure to show falsity.”219Id. To reach its conclusion, the court looked to the holding in AseraCare, finding that a “mere difference of opinion between physicians, without more, is not enough to show falsity.”220Id. (emphasis omitted) (quoting Druding v Care Alts., Inc., 346 F. Supp. 3d 669, 685 (D.N.J. 2018)). The relators appealed.221Id. Thus, the question before the appellate court was whether a reimbursement claim may be considered false under the FCA simply on the basis of conflicting medical expert testimony.222Id. at 95.

In reviewing the appeal, the Third Circuit began its analysis by discussing the MHB.223See id. at 92. For the most part, the court agreed with the Eleventh Circuit’s interpretation in AseraCare of the certification requirements for Medicare reimbursement of terminally ill patients.224See id. Similar to the AseraCare court, the Third Circuit even noted that “making a prognosis is not an exact science.”225Id. at 93. However, departing from the Eleventh Circuit’s reading, the court emphasized that this “inexactitude does not negate the fact that there must be a clinical basis for a certification.”226Id. (internal quotation marks omitted).

Where the Third Circuit truly departed from the Eleventh Circuit was in its common law analysis of the terms “false” or “fraudulent” under the FCA.227See id. at 95. Due to the lack of statutory guidance on the meaning of falsity, the court identified, from its prior cases and the Tenth Circuit’s rationale in Polukoff,228Id. at 98. the following two ways in which a claim may be false: (1) “factually, when the facts contained within the claim are untrue”; and (2) “legally, when the claimant . . . falsely certifies that it has complied with a statute or regulation the compliance with which is a condition for government payment.”229Id. at 96 (quoting Druding v Care Alts., Inc., 346 F. Supp. 3d 669, 682 (D.N.J. 2018)). As applied to the case before the court, Care Alternatives allegedly made incorrect certifications, which qualified the claim under the legal falsity theory.230Id. at 97. The court reasoned that the objective falsity standard is at odds with the concept of legal falsity, which is the appropriate standard, and by adopting the prior standard, the district court limited its analysis to factual falsity.231See id. The court further held that the district court’s objective falsity standard conflated the knowledge and falsity elements of an FCA claim.232Id. at 96. The Third Circuit believed that the district court incorporated the knowledge element into its analysis by finding that the relators “could not prove falsity because they had not produced evidence that any physician lied and received a kickback to certify any patient as hospice eligible” or “certif[ied] any patient whom that physician believed was not hospice eligible.” Id. Thus, by rejecting the objective falsity standard, the court sought to separate the knowledge and falsity analyses to comply with the text of the statute.233Id. Under a legal falsity standard, disagreement between experts as to a physician’s certification may be evidence of falsity under the FCA.234Id. at 97.

The Third Circuit also considered and rejected the district court’s bright-line rule that a doctor’s clinical judgment cannot be “false.”235Id. at 98. In doing so, the court acutely relied on the Paulus opinion.236See id. Underlying the district court’s decision was the premise that medical opinions are subjective and cannot be false.237Id. at 94. The Third Circuit sided with the Sixth Circuit’s emphasis on the fact that medical “opinions are not, and have never been, completely insulated from scrutiny.”238Id. (quoting United States v. Paulus, 894 F.3d 267, 275 (6th Cir. 2018)). The Paulus holding suggested that good faith medical opinions are not punishable but dishonest medical opinions may trigger liability for fraud.239Id. (citing Paulus, 894 F.3d at 275–76). Consequently, in line with its legal falsity analysis, the court believed that whether an individual acted in good faith or misrepresented a fact, thereby committing fraud, was “exclusively” a question for the jury.240Id.

The Third Circuit then went on to explain why it chose to depart from the Eleventh Circuit’s standard. The first issue that the court highlighted was how the Eleventh Circuit framed the falsity question.241Id. at 98–100. The court interpreted its sibling court as having construed the clinical information and documentation requirement of the MHB in an overly narrow fashion when it concluded that the supporting documentation requirement is only designed to address the mandate that there be a medical basis for certification instead of considering “whether the clinical information and other documentation accompanying a certification of terminal illness support[s] . . . the physician’s certification.”242Id. at 99. (alteration in original) (quoting United States v. AseraCare, Inc., 938 F.3d 1278, 1294 (11th Cir. 2019)). Therefore, this limited the inquiry to whether there was sufficient evidence of “the accuracy of the physician’s clinical judgment regarding terminality,” which the court understood to exclude legal falsity and only include factual falsity.243Id. (quoting AseraCare, 938 F.3d at 1296). The court posited that under the legal falsity theory, conflicting medical opinion is relevant evidence of the clinical information and documentation requirements.244Id. at 100. Furthermore, the court characterized the AseraCare court as coming to the conclusion that clinical judgments cannot be untrue, which it fundamentally disagreed with based on its interpretation of common-law definitions.245Id.

Ultimately, the Third Circuit had a drastically different breakdown of the falsity issue as compared to the Eleventh Circuit because it based its entire analysis upon the distinction between what it understood to be factual and legal falsity.

C. Winter ex rel. United States v. Gardens Regional Hospital and Medical Center, Inc.

In Winter, the relator, a registered nurse and former director at Gardens Regional Hospital (“Gardens”), filed a qui tam FCA suit against her former employer.246Winter ex rel. United States v. Gardens Reg’l Hosp. & Med. Ctr., Inc., 953 F.3d 1108, 1112, 1114 (9th Cir. 2020), cert. denied sub nom. RollinsNelson LTC Corp. v. United States ex rel. Winters, 141 S. Ct. 1380 (2021). The procedural history of this case is relatively simple compared to those of the aforementioned cases. The relator alleged in a complaint that Gardens submitted Medicare claims falsely certifying that patients’ hospitalizations were medically necessary.247Id. In support of this claim, the relator pointed to her own after-the-fact review of admission records.248Id. at 1112–13, 1120. Gardens moved to dismiss the complaint for failure to state a claim, which was subsequently granted by the district court.249Id. at 1116. The district court asserted that to prevail on an FCA claim, plaintiffs must show that a defendant knowingly made an objectively false representation. Thus, a statement that implicates a doctor’s clinical judgment can never state an FCA claim because subjective medical opinions cannot be proven to be objectively false.250Id. at 1113. The relator appealed.251Id.

The Ninth Circuit started its analysis by reviewing the medical necessity requirement and the FCA.252See id. at 1113–14. Medicare reimburses providers for inpatient hospitalization only if the expenses incurred are “reasonable and necessary.”253Id. at 1113 (quoting 42 U.S.C. § 1395y(a)(1)(A)). CMS administers the Medicare program and has defined a reasonable and necessary service as one that “meets, but does not exceed, the patient’s medical need, and is furnished in accordance with accepted standards of medical practice for the diagnosis or treatment of the patient’s condition.”254Id. (internal quotation marks omitted). Similar to the MHB, the Medicare program allows doctors to form their own clinical judgment based on complex medical factors.255Id. However, the language specifically provides that factors must be documented in the medical record and the regulations consider medical necessity a question of fact.256Id. Thus, a physician’s certification has no presumptive weight in determining medical necessity and must be evaluated in the context of medical evidence.257Id. The court subsequently reasoned that the relator’s allegations fall under the “false certification” theory of FCA liability.258Id. Since medical necessity is a condition of payment, every Medicare claim includes an express or implied certification of necessary treatment.259Id. Accordingly, claims for unnecessary treatment are false claims.260Id. The court stated that many other circuits, including the Tenth in Polukoff and Third in Care Alternatives, reached the same conclusion regarding the scope of FCA claims.261Id. at 1118.

The Ninth Circuit then proceeded to analyze the application of opinions to the FCA by interpreting the language of the statute.262See id. at 1116–18. The court interpreted the FCA broadly, citing congressional intent and the Supreme Court’s refusal to “accept a rigid, restrictive reading” of the FCA.263Id. at 1116 (quoting United States v. Neifert-White Co., 390 U.S. 228, 232 (1968)). Due to the lack of statutory guidance on what constitutes a false or fraudulent claim, the court looked to common-law definitions.264Id. at 1117. The court noted, however, that Congress actually intended for the FCA to be broader than the common law based on the knowledge requirement. See id. In doing so, the court referred to treatises and a number of cases, including Paulus and Omnicare, that a subjective opinion may be fraudulent if (1) it is “not honestly held,” (2) it implies the existence of nonexistent facts, (3) the speaker knows facts that would preclude such an opinion, and (4) the speaker does not know facts that justify it.265Id. The court additionally explained that the “knowing presentation of what is known to be false” does not mean “scientifically untrue.”266Id. (internal quotation marks omitted). Although a scientifically untrue statement is false, it may not be actionable if it was not made with the requisite intent.267Id. Likewise, an opinion with no basis in fact can be fraudulent if expressed with knowledge.268Id.

The court considered and outright rejected the request from Gardens and amici curiae for the court to hold that the FCA requires plaintiffs to plead an objective falsehood.269Id. The court stated that the plain language of the FCA “does not distinguish between ‘objective’ and ‘subjective’ falsity or carve out an exception for clinical judgments and opinions.”270Id. The court further noted that policy arguments cannot supersede the “clear” statutory text and it could not engraft that requirement onto the statute.271Id. at 1113, 1117. The court therefore held that the FCA does not require plaintiffs to plead an objective falsehood.272Id. at 1119.

Interestingly, the court claimed that the Eleventh Circuit’s decision in AseraCare was not “directly to the contrary.”273Id. at 1118. First, the court noted that the Eleventh Circuit, notwithstanding the language about objective falsehoods, did not consider all subjective statements to be incapable of falsity.274Id. at 1118–19. Second, the court believed that the Eleventh Circuit narrowly confined the objective falsity standard to the MHB, which granted deference to physician judgment.275Id. at 1119. In the court’s view, its sister circuit did not necessarily apply the standard to a physician’s certification of medical necessity by (1) explicitly distinguishing Polukoff, and (2) explaining that the less-deferential medical necessity requirement remained an important safeguard to its reading of the MHB eligibility framework.276Id.

Given that litigation was at the motion to dismiss stage, the court ruled that the relator’s complaint plausibly alleged false certifications of medical necessity.277Id. at 1119, 1121. The relator (1) showed correlations between the spike in admissions and timing of the scheme; (2) presented both irregular admission trends and admission statistics; (3) alleged a specific number of false claims, each in great detail; and (4) set forth anecdotal evidence which supported both an inference of knowledge and falsity.278Id. at 1120. The court also plainly dismissed Gardens’ argument and the district court’s characterization of the relator’s allegation as simply being her own competing opinion.279Id. First, according to the court, opinions can establish falsity.280Id. Second, the court believed that even if the relator’s own evaluations of the medical record were discounted, there were enough facts alleged to suffice the plausibility of fraud.281Id.

In sum, while the Ninth Circuit disagreed with the Eleventh Circuit about the objective falsehood standard, it applied the same common law rule regarding when an opinion can be false for the purposes of an FCA claim.

IV. DISCUSSION

AseraCare, Care Alternatives, and Winter highlight a growing tension between the different approaches and standards within the falsity element of the FCA. The hospice context has been the battleground between the Third and Eleventh Circuits, which have attempted to solve the issue of whether dueling expert testimonies, without more, create a triable issue of fact for the jury.282See supra Sections III.A–B. Nonetheless, it is quite evident that the imposition of a rigid falsity standard lends itself to application in FCA claims which have no basis in hospice care certifications, as seen in Winter.283See supra Section III.C. Furthermore, how courts analyze false opinions according to laws and regulations as well as the intent behind them is of great importance because it forms the conceptual foundation for constructing a proper framework and reaching the most legally sound conclusion. The following questions naturally follow: How should courts analyze false opinions and the falsity standard? And is the objective or subjective falsity standard the more appropriate reading of the FCA statute?

Section IV.A argues that Polukoff and Omnicare provide a comprehensive framework for the courts to categorize types of FCA claims and, if the alleged conduct includes opinions, whether the opinion is false. Section IV.B argues that, given the reach of the FCA, objective and subjective falsity standards are appropriate depending on the applicable regulations. Section IV.C suggests that, as a practical matter based on policy concerns, Congress amend the FCA to create special definitions and provisions for professional medical judgment. Finally, Section IV.D addresses the competing policy trade-off of over-incentivization to file false claims and contextualizes the arguments made in this Note to the broader whistleblower policy debate.

A. The PolukoffOmnicare Common Law Test

Unlike the Care Alternatives court’s factual and legal falsity breakdown, the AseraCare and Winter courts utilized the PolukoffOmnicare common law framework to reach their conclusions; this is the proper way to analyze the falsity element of the FCA. First, the PolukoffOmnicare framework fully encompasses all types of FCA claims. The Polukoff court divides FCA claims into factual and legal claims.284See United States ex rel. Polukoff v. St. Mark’s Hosp., 895 F.3d 730, 741 (10th Cir. 2018). The Omnicare decision sets out the four different ways in which an opinion may be false: (1) the actor does not actually hold the opinion; (2) the opinion contains a false, embedded fact; (3) the actor is aware of facts that would preclude such an opinion; or (4) the actor is not aware of any facts that would justify the opinion.285See Omnicare, Inc. v. Laborers Dist. Council Const. Indus. Pension Fund, 575 U.S. 175, 184–89 (2015). The first step in any FCA claim determination should be the Polukoffanalysis. Courts can properly distinguish the entire universe of FCA claims into two categories and decide where the claim before them fits. Moreover, if legal claims are not implicated, the standard automatically defaults to an objective falsehood standard.286See infra Section IV.B. Courts should subsequently consider whether a legal, FCA claim fits into one of the four Omnicare false opinion types. Regardless of whether courts adopt an objective or subjective falsehood standard, the Omnicareframework remains pertinent because it defines the totality of false opinions. Adhering to this analytical procedure will not only ensure that common law precedent has been properly followed but also unofficially standardize the framework across circuits. As discussed above, the Eleventh and Ninth Circuits identically and correctly applied this framework.287See supra Sections III.A, III.C.

Second, while the Third Circuit correctly relied on Paulus to identify that opinions can be false, it fully ignored when opinions can be false according to the common law; it would not have made this fatal error if it used the Omnicare framework. The Paulus court specifically stated that “opinions may trigger liability for fraud when they are not honestly held by their maker,or when the speaker knows of facts that are fundamentally incompatible with his opinion.”288United States v. Paulus, 894 F.3d 267, 275 (6th Cir. 2018) (emphasis added). The “when” conjunctions in the statement are critical to understanding the common law reasoning behind false opinions. However, the Third Circuit seemingly disregarded the dependent clauses, so that it could adduce some misconstrued holding from another circuit to support its conclusion regarding subjective falsity. In a similarly reductive fashion, the Third Circuit mischaracterized the Eleventh Circuit’s holding in AseraCare to state that clinical judgments are never false.289United States v. Care Alts., 952 F.3d 89, 100 (3d Cir. 2020), cert. denied, 141 S. Ct. 1371 (2021). The primary reason the government was unable to successfully make its case in AseraCare was that, in lieu of available evidence, it solely used an expert witness who was unable to claim that no reasonable physician could have reached the contested conclusions.290United States v. AseraCare, Inc., 938 F.3d 1278, 1287 (11th Cir. 2019). The Third Circuit, however, conflated the lack of evidence with the Eleventh Circuit’s framing of the issue. In reality, the Eleventh Circuit noted that opinions can be false, as it directly followed and cited to the Omnicare decision.291Id. at 1297. The Third Circuit invoked the common law but never identified any evidence to suggest a false opinion under the Omnicare categories. The Third Circuit completely discounted the Supreme Court’s principle that, as a general rule, sincere statements of pure opinion are not “untrue statement[s] of material fact” even if the speakers are ultimately wrong.292Omnicare, Inc. v. Laborers Dist. Council Const. Indus. Pension Fund, 575 U.S. 175, 186 (2015). Thus, if the certifying physicians in Care Alternatives truly believed that their patients were terminally ill, the Third Circuit, without conducting a proper Omnicare-based analysis, would have controverted existing Supreme Court precedent.

Third, the Third Circuit’s entire analysis is based on its understanding of factual and legal falsity, but the court fundamentally misconstrued the relationship between objective falsity and the factual/legal falsity distinction in the Polukoff holding. The Polukoff decision indicated that factual falsity refers to express claims which are entirely based on fact, whereas legal falsity refers to any claim where legal requirements were not met.293United States ex rel. Polukoff v. St. Mark’s Hosp., 895 F.3d 730, 741 (10th Cir. 2018). Accordingly, a subset of legal falsity claims includes claims where the legal requirement was not met due to negligent, reckless, or deceitful conduct, which implicates some extent of knowledge or lack thereof (in other words, implied claims).294Id. The Polukoff court simply demarcated the types of FCA claims which could reasonably be brought by plaintiffs. The Third Circuit, however, proclaimed that objective falsity is incompatible with legal falsity.295United States v. Care Alts., 952 F.3d 89, 97 (3d Cir. 2020), cert. denied, 141 S. Ct. 1371 (2021). The underlying assumption with this assertion is that objective facts may only be employed to challenge facts and not opinions. On a theoretical level, this line of logic is problematic because facts are objectively more concrete than opinions. Accordingly, as a matter of law, facts take precedence over opinions in the hierarchy of proof. Although “pure” opinions cannot be rebutted with facts,296See Omnicare, 575 U.S. at 186. not all opinions are “pure,” especially those of medical professionals. Generally, professional opinions have some foundation in fact, which essentially places them on a spectrum between fact and opinion. This hybridization makes professional opinions susceptible to dispute by both facts and opinions. Thus, relegating the objective falsity standard to factual claims of falsity severely misses the extent of the standard’s applicability. On a practical level, the Supreme Court in Omnicare codified these observations into common law.297See id. at 184–89. For example, an opinion which contains a false, embedded fact is considered a false opinion.298Id. at 185–86. False opinions naturally fall under the Third Circuit’s legal falsity umbrella. The Supreme Court stated that if the embedded fact is proven to be false, the opinion is also false.299Id. This type of false opinion clearly allows for rebuttal with a contradictory factual finding, so an objective falsity standard is not necessarily improper when applied to legal falsity claims.

Finally, the Third Circuit improperly accused the Eleventh Circuit of wrongfully conflating the FCA’s knowledge and falsity elements; in fact, the Third Circuit was the court that conflated these elements. In AseraCare and Winter, the Eleventh and Ninth Circuits acknowledged that some evidence applies to proving both knowledge and falsity.300See United States v. AseraCare, Inc., 938 F.3d 1278, 1302–05 (11th Cir. 2019); Winter ex rel. United States v. Gardens Reg’l Hosp. & Med. Ctr., Inc., 953 F.3d 1108, 1120 (9th Cir. 2020), cert. denied sub nom. RollinsNelson LTC Corp. v. United States ex rel. Winters, 141 S. Ct. 1380 (2021). This necessarily makes it difficult to analyze these elements separately. Even the district court in Care Alternativesrealized this when it rejected the relators’ claim because they failed to establish evidence of the physicians’ underlying knowledge (such that any physician lied or actually believed that the certified patients were not hospice-eligible).301United States v. Care Alts., 952 F.3d 89, 96 (3d Cir. 2020), cert. denied, 141 S. Ct. 1371 (2021). The Third Circuit made clear that “in our Court, findings of falsity and scienter must be independent from one another for purposes of FCA liability.”302Id. at 100. This is a misunderstanding of what common law principles apply. The fundamental distinction between an honest opinion and an opinion made in bad faith is the prerequisite knowledge used in forming the opinion. Therefore, when determining whether an opinion is false, the common law specifically looks to the speaker’s intent, which necessarily implies a knowledge requirement. The Third Circuit clearly subverted Omnicare by ruling that an after-the-fact reasonable disagreement between physicians can show falsity. Furthermore, as a practical concern, the Third Circuit’s falsity and knowledge separation could substantially increase the risk that the juror’s perception becomes tainted. The district court in AseraCare bifurcated its trial because it feared that evidence related to the knowledge element, particularly AseraCare’s flawed admissions policies and certification procedures to determine if a patient was terminally ill, would be inferred by the jury to satisfy the falsity element.303See AseraCare. 938 F.3d at 1287. This did, in fact, confuse the jury’s analysis of the threshold falsity question.304Id. Conceptually, general corporate practices have no bearing on whether a particular hospice claim is false if the medical evidence points to the fact that the patient was terminally ill. Accordingly, the Third Circuit’s interpretation of the knowledge and falsity elements potentially writes the falsity element out of the FCA statute by allowing evidence of knowledge to cloud a jury’s perception of falsity.

B. Courts May Reasonably Reach Different Falsity Standards

The objective and subjective falsehood standards are not necessarily diametrically opposed in the broad legal sense. In remaining true to Congress’s intent, courts have used the FCA “to reach all types of fraud, without qualification, that might result in financial loss to the Government.”305Winter, 953 F.3d at 1116 (quoting United States v. Neifert-White Co., 390 U.S. 228, 232 (1968)). In doing so, the FCA has been interpreted alongside other applicable laws and regulations since its inception. Prior to the recent medical FCA cases, this was not an issue because fraud was never predicated solely on subjective professional opinions without a tangible associated fact.306See supra Section I.D. In Polukoff terms, the entire realm of FCA claims were factual claims and non-opinion legal claims. Due to the factual basis for these types of claims, courts had to adopt an objective falsehood standard, which slowly resulted in uniformity among jurisdictions. However, as noted by the Winter court, the Supreme Court “ ‘has consistently refused to accept a rigid, restrictive reading’ of the FCA.”307Winter, 953 F.3d at 1116 (quoting Neifert-White, 390 U.S. at 232). The same reasoning can be extrapolated to the falsity standard to the extent that it is an element of an FCA claim. Interactions between the FCA and applicable laws and regulations thus do not inherently allow for a universal falsity standard but rather the possibility of different falsity standards to be adopted in specific circumstances. This flexibility in the legal interpretation of falsity is also the better policy approach that allows for a more robust legal system.

The objective falsehood standard is an appropriate legal interpretation based on CMS’s guidelines and the MHB’s purposeful deference to physician judgment. The MHB and CMS’s guidelines for hospice eligibility repeatedly reference the subjectivity involved in determining terminal illness.308See AseraCare, 938 F.3d at 1293, 1295, 1304. First, as a general matter, Congress has not amended the hospice eligibility criteria.309See id. at 1295. Second, the MHB specifically prohibits the use of check boxes and requires a narrative explanation of the diagnosis.310Id. at 1293. Third, the MHB allows for unlimited recertifications.311Id. at 1283. Fourth, the MHB requires physicians to consider subjective and objective medical findings.312Id. at 1293. Finally, the MHB explicitly declared that predicting life expectancy is not an exact science.313Id. Taken in totality, these factors show the imprecise nature and complexity of hospice certifications. Thus, based on the lack of any statistical or medical measurement for longevity, medical professionals have been afforded the utmost deference by Congress. If courts were to adopt a subjective falsity standard with regards to hospice care, FCA trials would devolve into a meaningless battle of expert opinions, neither of which may be false. This is exactly what happened at the district level in AseraCare.314Id. at 1287. The objective falsity standard is more sensible and provides a safeguard to trivial FCA claims based on falsity. Congress’s intention was not to allow “rogues” to take a doctor to court based on their certification that a patient had six more months to live simply because they found another doctor who believed the same patient had a remaining life expectancy of six and a half months. By adopting an objective standard, juries are not forced to become a “third doctor” who simply evaluates purely medical judgments.

At the same time, the subjective falsehood standard is an appropriate legal interpretation based on Medicare regulations and CMS’s definition of “reasonable and necessary.” Medicare reimbursements for inpatient hospitalizations are contingent on the provided services being reasonable and necessary, as defined by the CMS.315Winter ex rel. United States v. Gardens Reg’l Hosp. & Med. Ctr., Inc., 953 F.3d 1108, 1113 (9th Cir. 2020), cert. denied sub nom. RollinsNelson LTC Corp. v. United States ex rel. Winters, 141 S. Ct. 1380 (2021). Similar to the MHB, Medicare regulations demand that doctors evaluate complex medical factors to form their clinical judgment. In contrast to the MHB, however, Medicare regulations do not give physicians “unfettered discretion.”316Id. at 1114. The regulations explicitly defer to the accepted standards of medical practice but provide no presumptive weight to a physician’s certification.317Id. Expert opinions must be analyzed in the context of medical evidence.318Id. An objective falsehood standard is not necessarily incompatible with Medicare hospitalization claims but would be redundant. Since the Medicare program already requires medical evidence for initial certifications, facts are presumably available in every case. The focal point of these cases surrounds the interpretation of these facts. Therefore, medical expert testimony offers more value than just competing medical theory. Testimony effectively provides valuable, logical medical inferences and contextualizes the interpretation of medical data, which is substantially less subjective than end-of-life determinations. Due to the fact that judgments are less rooted in medical theory and more rooted in medical practice, juries are able to make more substantiated findings, as they did so in Paulus and Polukoff. Accordingly, the subjective falsity standard is the more appropriate standard under these circumstances.

C. Legislative Action for the FCA

Given that rejections of the objective falsity standard have occurred exclusively in medical-related FCA claims, the unique challenges associated with the medical realm may be more efficiently handled through legislation. Judicial interpretations of falsity have been effective in filling the statutory gap in the FCA until the current circuit split, where false opinions in the medical context have divided the common law landscape. The crux of the issue is that the medical sector is quite anomalous when compared to other areas of practice, but courts cannot simply apply a medical-specific standard.319See Frank H. Easterbrook, Cyberspace and the Law of the Horse, 1996 U. Chi. Legal F. 207, 207–08 (1996). As a general matter, courts do not derogate legal standards based on the field of application (for instance, the financial sector does not receive a different legal standard from the technology sector simply because the fields are different).320Id. Common law seeks to prescribe a set of legal rules and principles that can be consistently applied.321Id. Absent some countervailing statute or regulation, the common law is standardized across all fields.322Id. The countervailing statute in AseraCare and Care Alternatives was the MHB. The countervailing regulation in Winter was Medicare. However, the plain language of the MHB statute and Medicare regulations grants different degrees of deference to doctors. As a result, it is unclear how courts can adopt one standard without subverting congressional deference to doctors. If courts adopt a bright-line objective falsity standard, they will comply with the MHB and protect medical professionals from frivolous FCA suits but impose a higher standard of proof for plaintiffs, which will prevent valid suits involving false Medicare certifications from getting through trial. On the other hand, if courts adopt a subjective falsity standard, false Medicare certification claims could be handled appropriately through FCA litigation while frivolous claims will be brought against physicians who genuinely certify hospice care, which neither the FCA nor the MHB protects against. As a result, while the Supreme Court can attempt to resolve the circuit split in the future, the resolution may not be desirable as the adopted standard may lack the nuance needed to accommodate both medical laws and regulations.

From a policy perspective, it is also in Congress’s interest to clarify how and when physicians should be held accountable for their clinical opinions. The legislative branch, beholden to the people, creates laws while the judiciary promotes fairness and justice through the interpretation of such laws. Congress originally wrote the MHB and authorized Medicare programs after appreciating the host of factors that go into complex medical decision-making. The objective was to strike a balance between accountability and scrutiny within different medical settings. It is simply not the courts’ job to engage in judicial policymaking that overrides congressional intent. Courts cannot require stricter or looser scrutiny of physician judgments as they see fit and would effectively be doing so by adopting a bright-line standard. Moreover, the majority of FCA claims for the past several years have been from the medical field.323See Fraud Statistics, supra note 56 (showing that fraud cases in the Department of Health and Human Services have drastically increased from 1986 to 2020 and now comprise the vast majority of all fraud cases). Based on this consistent trend and America’s aging population, the composition of FCA claims for the foreseeable future will remain dominated by healthcare-related claims. Thus, it is imperative that Congress provide explicit, meaningful guidance on this issue.

D. Policy Considerations for the FCA

Although this Note primarily focuses on the fraud deterrence aspect of the FCA, a comprehensive discussion would not be complete without addressing the competing policy tradeoff—over-incentivization of whistleblowers to file false or frivolous FCA claims. In 2020, the relator share awards totaled over $300 million with only 672 qui tam claims filed.324Id. (presenting the number of qui tam claims filed in column two and the total relator share awards in the last column). Although the potential for monetary gain differs for each case, it can be extrapolated from this data that whistleblowers can win hundreds of thousands, if not millions, of dollars if they prevail on a claim. This is a powerful incentive for unscrupulous individuals hoping to profit from this well-intentioned statute. A legal falsity standard certainly eases their ability to do so. However, the hurdles that would have to be overcome by such individuals virtually eliminate the risk of undeserving payouts. First, after the filing of a qui tam complaint, the government is required to investigate the allegations and can move to dismiss if the findings show that the relator has no grounds for the complaint.325Primer, supra note 34 (describing the government investigation process under the qui tam provisions section). Even if the government does not move to dismiss, approximately 52% of FCA cases are resolved at this stage by agreed dismissal or settlement.326Strategic Budgeting, supra note 55. Second, at the motion-to-dismiss stage, the complaint may fail as a matter of law due to lack of specificity. Third, at the summary judgment stage, the relator must show a genuine dispute of material fact. In the event that a legal falsity standard is applied, the relator may not have any issues convincing the court. Nonetheless, nearly 80% of FCA cases were resolved before the summary judgment stage.327Id. Thus, it is statistically unlikely for a relator, much less a dishonest one, to even reach this stage. Finally, the relator must prevail at trial. But more than 99% of FCA cases settle or are dismissed before reaching trial due to the high stakes nature of FCA litigation.328Id.; see Pamela H. Bucy, Games and Stories: Game Theory and the Civil False Claims Act, 31 Fla. St. U.L. Rev. 603, 608 (2004). Consequently, the systemic barriers and costly litigation process should sufficiently dissuade fraudulent rogues and assuage any concerns regarding overburdening of the judicial system.

In addition, while the extracted case law from the described cases does not provide an exactly useful model for FCA litigation in non-medical practice areas, the proposed non-duality falsity standard concept can be utilized to address the broader whistleblower policy debate. Whistleblower laws typically attempt to strike a balance between protecting the rights of whistleblowers and respecting an employer’s rights to remove personnel.329Philip Berkowitz, The Anti-Money Laundering Act (AMLA): Defending Whistleblower Claims in the Financial Services Industry, A.B.A. (Apr. 28, 2021), http://www.americanbar.org/groups/
business_law/publications/blt/2021/05/amla [http://perma.cc/8EYV-YRQE].
Lawmakers must therefore decide who whistleblowers can report information to while still receiving sufficient protections from employer retaliation.330See id. This has been the subject of scholarly debate and criticism for years, which still rages on today.331See, e.g., Elletta Sangrey Callahan & Terry Morehead Dworkin, The State of State Whistleblower Protection, 38 Am. Bus. L.J. 99, 100 (2000) (describing the competing incentive and protection approaches of various federal whistleblower laws); Thomas M. Devine, The Whistleblower Protection Act of 1989: Foundation for the Modern Law of Employment Dissent, 51 Admin. L. Rev. 531, 532–35 (1999) (discussing the relative effectiveness of the Whistleblower Protection Act of 1989 over the Civil Service Reform Act of 1978). In January 2021, the Anti-Money Laundering Act was enacted, which expanded the recipient list for employees of financial services institutions.332Berkowitz, supra note 329; 31 U.S.C. § 5323. However, the Act still imposes a rigid report recipient requirement.333See 31 U.S.C. § 5323(g)(1) (allowing whistleblowers to only report compliance violations to their employer, the attorney general, secretary of treasury, regulators, and members of Congress). Similar to how a rigid falsity standard fails to account for the plethora of intersecting laws and regulations, an unduly restrictive recipient list likely cannot match the diversity of situations that whistleblowers find themselves in. It is evident that the issue of rigid standards permeates the whistleblower legal arena. Moving forward, open-ended or flexible standards may provide the nuance necessary to usher in a new era of comprehensive whistleblower reforms.

CONCLUSION

AseraCare, Care Alternatives, and Winter are the first cases to adopt the objective or subjective falsehood standard for FCA claims in the context of medical certifications based on “false opinions.” The specific question at issue is whether dueling expert opinion, without more, creates a triable issue of fact for the jury. The objective falsity standard posits that conflicting opinions are not enough whereas the subjective falsity standard believes contradictory judgments are sufficient. The practical result of courts adopting the subjective standard is that relators and the government are more likely to survive the pleading and summary judgment stage by simply providing dueling expert opinion. An objective falsity standard makes it more difficult for plaintiffs to prevail on an FCA claim. The Eleventh Circuit in AseraCare adopted the objective standard after analyzing the plain language of the FCA and MHB. The Third Circuit in Care Alternatives arrived at the subjective standard after analyzing the same statutes. The Ninth Circuit in Winter implicitly agreed with the subjective standard after it considered the FCA and Medicare regulations. Superficially, there is a clear circuit split over the falsity standard. The two standards are at odds, but each one is applicable in different medical settings based on Congress’s intent and the plain meaning of the governing statutes and regulations. More importantly, however, is not what standard each circuit adopted but how the courts arrived at their conclusions.

While there are countless examples of FCA certifications requiring a medical opinion or exercise of discretion, the above trifecta of cases perfectly contrasts how courts should and should not invoke common law. The Eleventh and Ninth Circuits, while reaching different conclusions, employed the same common law framework and principles in their analyses. They primarily relied on (1) the Tenth Circuit’s Polukoff holding to distinguish factual and legal falsity, and (2) the Supreme Court’s Omnicare decision discussing when opinions may be deemed false. Conversely, the Third Circuit stated that it looked to common law for guidance while (1) misconstruing case law, (2) ignoring common law precedent, and (3) failing to apply common law in its lackluster analysis. Unlike the Third Circuit, courts should utilize the PolukoffOmnicare framework because it categorically constricts the universe of FCA claims into a logical, comprehensive framework with which to analyze false opinions.

The Supreme Court missed an opportunity to at least resolve the analytical differences between the circuits when it denied certiorari for Care Alternatives and Winter. As a matter of policy, the decision to adopt or reject a rigid falsity standard will have wide-ranging consequences, and it should be up to the legislature to insulate or scrutinize physicians for their certifications. Aside from adjudging these exercises of discretion as true or false, the Supreme Court has the responsibility of correcting circuits when they falter in their representation of common law principles. Omnicare is arguably the most relevant common law precedent in terms of providing an analytical framework for determining false opinions. Thus, the Third Circuit’s disregard of Omnicare sets an extremely disruptive example for other courts. Moving forward, the Supreme Court should announce that courts must abide by Omnicare when engaging in an FCA analysis involving opinions.

 

APPENDIX:  COURTS ADOPTING OR REJECTING OBJECTIVE FALSITY

Courta

Case Name

Year

Type of Legal Claim

Adopting Courts

Tenth Circuit

United States ex rel. Morton v. A Plus Benefits, Inc.b

2005

Medicaid

District of Nevada

United States v. Prabhuc

2006

Medicare

Fourth Circuit

United States ex rel. Wilson v. Kellogg Brown & Root, Inc.d

2008

Contract

Seventh Circuit

United States ex rel.Yannacopoulos v. General Dynamicse

2011

Contract

Third Circuit

United States ex rel. Hill v. University of Medicine & Dentistry of New Jerseyf

2011

Research Grant

Third Circuit

United States ex rel. Thomas v. Siemens AGg

2014

Contract

Northern District of Texas

United States ex rel. Wall v. Vista Hospice Care, Inc.h

2016

Medicare/Medicaid

Eleventh Circuit

United States v. AseraCare, Inc.i

2019

Medicare

Rejecting Courts

Third Circuit

United States v. Care Alternativesj

2020

Medicare/Medicaid

Ninth Circuit

Winter ex rel. United States v. Gardens Regional Hospital & Medical Center, Inc.k

2020

Medicare

Notes:  This list merely demonstrates the uniformity of the legal landscape prior to the Third and Ninth Circuit decisions in 2020 and is not intended to show every single jurisdiction that has ruled on the issue. aOf note, the Third Circuit originally adopted the objective falsity standard in 2011, reaffirmed the standard in 2014, and rejected the standard in 2020. This is quite peculiar, as it is the only Circuit that has switched its opinion on the issue. The Tenth Circuit adopted the objective falsity standard in 2005 but moved away from that decision in United States ex rel. Polukoff v. St. Mark’s Hospital. However, the Tenth Circuit has not explicitly embraced a subjective falsity standard. Sources:  bUnited States ex rel. Morton v. A Plus Benefits, Inc., 139 F. App’x. 980 (10th Cir. 2005). cUnited States v. Prabhu, 442 F. Supp. 2d 1008 (D. Nev. 2006). dUnited States ex rel. Wilson v. Kellogg Brown & Root, Inc., 525 F.3d 370 (4th Cir. 2008). eUnited States ex rel. Yannacopoulos v. Gen. Dynamics, 652 F.3d 818 (7th Cir. 2011). fUnited States ex rel. Hill v. Univ. of Med. & Dentistry of New Jersey, 448 F. App’x 314 (3d Cir. 2011). gUnited States ex rel. Thomas v. Siemens AG, 593 F. App’x 139 (3d Cir. 2014). hUnited States ex rel. Wall v. Vista Hospice Care, Inc., No. 3:07-cv-00604-M, 2016 U.S. Dist. LEXIS 80160 (N.D. Tex. June 20, 2016). iUnited States v. AseraCare, Inc., 938 F.3d 1278 (11th Cir. 2019). jUnited States v. Care Alts., 952 F.3d 89 (3d Cir. 2020), cert. denied, 141 S. Ct. 1371 (2021). kWinter ex rel. United States v. Gardens Reg’l Hosp. & Med. Ctr., Inc., 953 F.3d 1108 (9th Cir. 2020), cert. denied sub nom. RollinsNelson LTC Corp. v. United States ex rel. Winters, 141 S. Ct. 1380 (2021).

 

96 S. Cal. L. Rev. 665

Download

* Executive Postscript Editor, Southern California Law Review, Volume 96; J.D. Candidate, 2023 University of Southern California, Gould School of Law; M.P.H. Global Epidemiology 2018, Emory University Rollins School of Public Health; B.S. Integrative Biology 2016, University of Illinois at Urbana-Champaign. I would like to thank Professor Jonathan Barnett for guidance, Professor Eileen Decker for serving as my advisor, and the Southern California Law Review for excellent editorial assistance.

Lenity and the Meaning of Statutes

Ordinary canons of statutory interpretation try to encode linguistic rules into jurisprudence. Their purpose is to figure out the meaning of a text, and their outcome is to determine the meaning of the text. Both the purpose and the outcome are linguistic.

The rule of lenity is not an ordinary canon of statutory interpretation. The rule of lenity’s outcome is to determine the meaning of a text, giving ambiguous criminal statutes a narrow interpretation, but its purpose is public policy, protecting defendants when ambiguous statutes failed to give fair notice that their actions would be punished. Unlike the ordinary canons of statutory interpretation, lenity encodes into jurisprudence not a linguistic rule, but a policy rule. Thus, a discrepancy arises: lenity’s outcome is linguistic, but its purpose is non-linguistic.

This Article makes the following three contributions. First, it analyzes the nature of the discrepancy between lenity’s purpose and outcome. Second, it demonstrates that this discrepancy leads to doctrinal issues in how the rule of lenity is applied. Sometimes the rule of lenity is over-inclusive: it is applied even when there is no violation of fair notice. Sometimes the rule of lenity is under-inclusive: the rule of lenity fails to protect certain defendants that were misled by ambiguous criminal statutes. Third, this Article argues that we can align lenity’s purpose and outcome by reforming lenity into an excuse in criminal law, and this theoretical reformation will resolve the aforementioned doctrinal issues.

INTRODUCTION

Short-barreled rifles are often used for criminal purposes because their shorter length allows them to be more easily concealed.1See United States v. Thompson/Ctr. Arms Co., 504 U.S. 505, 517 (1992). For that reason, § 5821 of the Internal Revenue Code levies an excise tax on the manufacture of short-barreled rifles, while no such tax is levied on the manufacture of long-barreled rifles.2I.R.C. § 5821.

Thompson/Center Arms, a firearms manufacturer, packaged as one unit the following separate parts that were to be put together by the customer: a shoulder stock, a pistol, and a barrel extension.3Thompson/Ctr. Arms, 504 U.S. at 507. For convenience, I will call this unit of three parts the “Thompson/Center kit.” Putting the three pieces together—attaching the shoulder stock to the handle of the pistol and the extension to the barrel of the pistol—the customer would end up with a long-barreled rifle.4Id. at 508. If the customer only attached the shoulder stock to the pistol handle without using the barrel extension, then they would end up with a short-barreled rifle.5Id.

Thus, the following legal issue arose in United States v. Thompson/Center Arms Co. Is Thompson/Center Arms liable for the § 5821 excise tax? Does the manufacture of the Thompson/Center kit count as an instance of manufacturing a short-barreled rifle?

The Supreme Court stated that § 5821 is ambiguous about what counts as the manufacture of a short-barreled rifle and that the Thompson/Center kits sat squarely in the penumbra.6Id. at 513–24. On one hand, Thompson/Center Arms intended for the kits to be put together into a long-barreled rifle, but on the other hand, the kit made it tremendously easy for consumers to put together a short-barreled rifle regardless of Thompson/Center Arms’s intention.7Compare id. at 523 (Scalia, J., concurring), with id. at 524–25 (White, J., dissenting). Were the Court to construe § 5821’s language broadly, Thompson/Center Arms would be liable for the excise tax on short-barreled rifles, but were the Court to construe the statute’s language narrowly, Thompson/Center Arms would not be liable.

To resolve whether § 5821 should be given a broad or narrow reading, the Court applied the rule of lenity, which gives all ambiguous criminal statutes a narrow meaning, thus absolving Thompson/Center Arms of liability on the excise tax.8Id. at 517–18. This is a surprising application of the rule. The rule of lenity is a rule of statutory interpretation meant to apply only to criminal statutes to protect criminal defendants, yet it was applied in Thompson/Center to determine the meaning of a civil tax statute in favor of a civil plaintiff. Because the company had already paid the tax and was suing for a refund, no criminal penalties were at stake for Thompson/Center Arms.9Id. at 505.

Thompson/Center’s holding presents a major problem for the administration of tax law. The standard rule in civil law grants deference to an administrative agency’s interpretation of the relevant laws.10See Ryan D. Doerfler, Can a Statute Have More than One Meaning, 94 N.Y.U. L. Rev. 213, 233 (2019); Kristin E. Hickman, Of Lenity, Chevron, and KPMG, 26 Va. Tax Rev. 905, 912–21 (2007); Cass R. Sunstein, Chevron Step Zero, 92 Va. L. Rev. 187, 210 n.106 (2006); Cass R. Sunstein, Law and Administration After Chevron, 90 Colum. L. Rev. 2071, 2115–16 (1990). The topic of deference to the IRS’s interpretations of the tax law is much discussed, but it begins with the case law Chevron U.S.A. Inc. v. Natural Res. Def. Council, Inc., 467 U.S. 837, 842–44 (1984); Nat’l Muffler Dealers Ass’n v. United States, 440 U.S. 472, 476–77 (1979); Skidmore v. Swift & Co., 323 U.S. 134, 139–40 (1944). The rule of lenity runs in the opposite direction, interpreting statutes in favor of the taxpayer over the agency, the Internal Revenue Service (“IRS”).11See infra Section III.B.2. This poses a special danger to the IRS’s enforcement efforts against abusive tax shelters that prey on indeterminacies in the tax law.12See also Marvin A. Chirelstein & Lawrence A. Zelenak, Tax Shelters and the Search for a Silver Bullet, 105 Colum. L. Rev. 1939, 1950 (2005) (analyzing the formation of tax shelters and their interplay against countervailing measures).

Despite this problem, the Court’s hands were bound by a technicality. According to the rule of lenity, criminal statutes should be interpreted narrowly such that uncertainty about the meaning of the statute is resolved in a way lenient to the defendant.13United States v. Kozminski, 487 U.S. 931, 952 (1988). Section 5821 is, like tax law generally, a civil statute, but it is also a criminal statute because its meaning has implications for criminal liability. Under § 5871, criminal penalties would be imposed for non-compliance with § 5821.14I.R.C. § 5871. Section 5821 plays a dual role, determining how much tax one is required to pay and, thereby, defining the actus reus for criminal liability. Thus, although Thompson/Center Arms was litigating the civil matter of how much tax it owed, because the outcome of this case might have criminal implications down the road, § 5821 would need to be read narrowly, following the rule of lenity.15United States v. Thompson/Ctr. Arms Co., 504 U.S. 505, 517–18 (1992).

Thompson/Center thus establishes that the rule of lenity applies to statutes that serve both a criminal and civil purpose, even if the issue at bar is a purely civil one,16See id. because the interpretation of dual-purpose statutes in the civil context necessarily carries over to define criminal liability.17See id. at 518 n.10. Tax laws generally play this dual role since they determine civil tax liability, and criminal penalties are imposed for non-compliance with tax law.18See infra Section III.B.1. Using lenity to narrowly interpret the meaning of a tax statute will both limit the reach of criminal sanctions for tax evasion and also limit the assignment of civil tax liability.19See Thompson/Ctr. Arms, 504 U.S. at 506, 517–18.

The purpose of the rule of lenity, however, is to protect fair notice for criminal defendants.20Dan M. Kahan, Lenity and Federal Common Law Crimes, 1994 Sup. Ct. Rev. 345, 345 (1994). When statutes are ambiguous, citizens can be misled into thinking that their actions were permitted rather than prohibited. The law fails to communicate the expected standard of behavior. Given the severity of criminal punishment and the moral condemnation that attaches, we ought to be especially concerned about criminal defendants who did not receive fair notice of the law.21See infra Section I.B. Thus, when a defendant’s act is a borderline case of an ambiguous criminal statute, the law absolves them of criminal liability as a recognition of its own failure to provide fair notice that such an act would be punished.

Since the rule of lenity was supposed to provide fair notice in punishment, its application to civil tax law, where no punishment is at stake, grossly oversteps its purpose.22United States v. Fisher, 6 U.S. 358, 390 (1805); see also Andy S. Grewal, Why Lenity Has No Place in the Income Tax Laws, 81 Mo. L. Rev. 1045, 1051–53, 1051 n. 45 (2016) (arguing that there is no unique taxpayer-favorable interpretation as lenity would require); Hickman, supra note 10, at 932–33 (noting that tax shelters will be harder to police if lenity is applied to civil tax law). Even if a taxpayer loses a case determining their civil tax liability, so long as they continue to pay said tax liabilities, they would avoid criminal penalties.23See I.R.C. § 7201. I call this overstep of lenity’s purpose the “too much lenity” problem.

On my analysis, the central theoretical issue with the rule of lenity is the discrepancy between the rule’s purpose and outcome. The rule of lenity’s purpose is to ensure fair notice about which actions are punished under the law.24Notice of the laws that govern individuals has long been held to be a central tenet of the rule of law. 2 St. Thomas Aquinas, Summa Theologica pt. I–II, q. 90, art. 4; 1 Jeremy Bentham, Essay on the Promulgation of Laws, and the Reasons Thereof, in The Works of Jeremy Bentham 155, 157 (Edinburgh, William Tait 1843); Lon L. Fuller, The Morality of Law 39 (1964); John Locke, Second Treatise of Government 83–84 (Richard H. Cox ed., Harlan Davidson, Inc. 1982) (1690) (noting that lack of notice leads to uncertainty about the future); Antonin Scalia, The Rule of Law as a Law of Rules, 56 U. Chi. L. Rev. 1175, 1179–80 (1989); see Roscoe Pound, Theories of Law, 22 Yale L.J. 114, 117 (1912) (noting that publication of laws demonstrating the importance of fair notice extends back to ancient Greece). This value has been considered doubly important where the laws impose criminal punishment. United States v. Fisher, 6 U.S. 358, 390 (1805); John Gardner, Introduction to H.L.A. Hart, Punishment and Responsibility: Essays in the Philosophy of Law, at xiii, xxxix–xliii (2d ed. 2008) (putting forth that rule of law values “apply with particular force to the criminal law” because punishment is intended to inflict suffering on the punished and clarity in law makes statutes more effective in guiding action); Scalia, supra, at 1180. The rule’s outcome, as a canon of statutory interpretation, is to determine the meaning of a statute.25See infra note 60 and accompanying text. The rule of lenity has a linguistic outcome, but a non-linguistic purpose. Thus, lenity’s purpose and outcome are not consistent with one another.26See infra Part II.

This application of lenity as a canon of statutory interpretation, which I call the “semantic rule of lenity,” is incongruous with its normative purpose of fair notice in criminal law, resulting in its encroachment into civil matters where no punishment is at stake. Unlike other canons of statutory interpretation, which aim to figure out the meaning of a statute, substantive canons, like the rule of lenity, aim to implement normative principles, like fair notice.27See infra Part II. Therein lies the disconnect. The purpose of the rule of lenity does not have anything to do with the ascertainment of meaning, yet the rule ends up determining the meaning of the statute.28See infra note 60 and accompanying text. The resulting problem of too much lenity demonstrates that this disconnect leads to real consequences.

But notice that this is a contingent feature of the rule of lenity. Lenity need not be applied as a canon of statutory interpretation. Its purpose merely requires us to let go those criminal defendants who never received fair notice of punishment. Other legal doctrines that require us to absolve certain defendants of guilt—for example, excuses such as insanity or duress—do not involve determining the meanings of statutes.29See, e.g., Model Penal Code §§ 2.09(1), 4.01 (Am. L. Inst., Proposed Official Draft 1962). So why should the rule of lenity perform this odd, dangerous, vestigial function of determining the meaning of statutes? If the proximate aim is to absolve defendants of liability when their actions were not unambiguously criminalized by Congress, we can and ought to do so without invoking the semantics of statutes.

Challenging the standard semantic application of lenity, I will instead argue for the unorthodox position that lenity should be reworked from a canon of statutory interpretation to an excusing condition specific to criminal law.30Other academics have proposed less radical revisions that are more amenable to agency deference such as Dan Kahan, Is Chevron Relevant to Federal Criminal Law?, 110 Harv. L. Rev. 469, 507–11 (1996). These less radical approaches, however, fail to solve the linguistic ambiguity problem and the higher-order vagueness problem outlined in Part III. In that way, lenity would be applied in the same manner as the doctrines of duress or insanity, as an affirmative defense to prosecution rather than a canon of statutory interpretation. Without any of the semantic baggage that currently burdens the rule of lenity, excuses can apply in criminal law without extending into civil law and thus avoid the too much lenity problem. For instance, when a taxpayer is just litigating the issue of how much taxes they will have to pay for such-and-such economic transaction because they disagree with the IRS about the meaning of a statute, the courts should use ordinary interpretative principles that would best allow the tax law to serve its function of justly and efficiently collecting revenue.31See infra Section III.B.2. But if that same taxpayer was being tried for tax evasion because the statute at issue was ambiguous—as § 5821 was with regard to Thompson/Center kits—then lenity should be applied as an excuse, an affirmative defense, in order to protect fair notice of punishment.32See infra Part IV.

Viewed top-down, this Article can be understood to present the following argument for my conclusion that lenity should be applied as an excusing condition in criminal law rather than as a canon of statutory interpretation: First, I demonstrate that lenity’s purpose of fair notice of punishment does not match its outcome of determining the meaning of statutes. Second, I analyze three distinct doctrinal problems that stem from this mismatch between purpose and outcome. Third, I solve these problems by showing how the legal system can unite lenity’s purpose and outcome by instituting lenity as an excuse rather than a rule of statutory interpretation. Because of this conceptual harmony, the three aforementioned problems are solved if we apply lenity as an excusing condition in criminal law. Each step presents novel contributions to the literature.

Part I explicates the rule of lenity and justifies the doctrine as upholding the structural rule of law value of fair notice. Fair notice is best understood as a structural consideration about the legal system. The laws must be structured so as to provide a path safe from punishment along which ordinary citizens can walk.33For a discussion of the theory underlying excuses in criminal law, see H.L.A. Hart, Legal Responsibility and Excuses, in Punishment and Responsibility: Essays in the Philosophy of Law, supra note 24, at 28; Sanford H. Kadish, Excusing Crime, 75 Calif. L. Rev. 257, 263–65 (1987). In our society, this path is marked by published statutes delineating which acts are permissible and which are impermissible. Fair notice is thus essential to providing a genuine choice to avoid punishment.

Part II shows that lenity’s purpose and outcome are at odds with one another. Whereas ordinary rules of statutory interpretation have the purpose of trying to figure out the meaning of a statute and the outcome of determining the meaning of a statute, the rule of lenity has the purpose of protecting criminal defendants and the outcome of determining the meaning of a statute.34See William Baude & Stephen E. Sachs, The Law of Interpretation, 130 Harv. L. Rev. 1079, 1111, 1127 (2017). Thus, while ordinary rules of statutory interpretation have a semantic purpose and semantic outcome, the rule of lenity has a semantic outcome and a non-semantic purpose.

Part III demonstrates three doctrinal problems that arise from the mismatch between the rule of lenity’s purpose and outcome.

Section III.A presents the linguistic ambiguity problem. To use a stylized example, suppose a statute ambiguously imposes criminal penalties for starting a fire next to a “bank.” Defendant A started a fire next to a financial bank. Defendant B started a fire next to a river bank. Because of the ambiguity, neither Defendant A nor B had fair notice that their actions were prohibited. However, because neither interpretation of the word “bank” lets both defendants go free, the rule of lenity cannot resolve the fair notice problem here. This is the problem of linguistic ambiguity.

Section III.B presents the too much lenity problem, introduced above. In this Section, I consider the impact of a lenity-driven tax regime both in terms of the areas of tax law where lenity is most likely to be applied and its contrast to the deference regime it replaces.

Section III.C demonstrates the problem of higher-order vagueness. Applying the semantic rule of lenity to a vague statute that prohibits a certain category of actions changes the meaning of the statute to prohibit only clear cases of that category of actions.35See Michael S. Moore, Semantics, Metaphysics, and Objectivity in the Law, in Vagueness and Law 127, 134 (Geert Keil & Ralf Poscher eds., 2016). For instance, a statute may say “do not drive dangerously,” but after the court applies the rule of lenity, the statute means “do not drive clearly dangerously.”36Though heavily simplified, the vagueness of “do not drive dangerously” is not too far off from the vagueness of actual safe driving statutes. See infra note 171 and accompanying text. The problem is that the new meaning that the rule of lenity has assigned will itself be vague. Just as vague predicates have borderline cases of which items qualify as members of the category, there is also vagueness one level up about which items qualify as borderline cases.37See Moore, supra note 35; Joseph Raz, Legal Reasons, Sources, and Gaps, in The Authority of Law: Essays on Law and Morality 53, 73–74 (1979). If the vagueness of “do not drive dangerously” violates fair notice, then construing the statute to mean “do not drive clearly dangerously” will not satisfy fair notice because what counts as “clearly dangerous” is itself a vague matter as some driving is clearly clearly dangerous and some driving only borderline clearly dangerous. The semantic rule of lenity is thereby under-inclusive, creating vagueness at a higher-order but failing to take that second-order vagueness into account for purposes of fair notice.38See infra Section III.C.

Part IV connects the legal theory set out in Parts I and II with the doctrinal analyses of Part III to support my ultimate proposal that lenity be provided solely as an excuse in criminal law instead of its current application as a canon of statutory interpretation. In criminal law theory, excuses are most often understood in comparison to justifications, another category of affirmative defense. Whereas justifications typically serve to make an act permissible—for instance, killing another is not morally wrong if done in self-defense—excuses absolve an actor of criminal liability for their wrongful conduct when the actor lacked a genuine choice to follow the law.39See Michael S. Moore, Choice, Character, and Excuse, 7 Soc. Phil. & Pol’y 29, 32–35 (1990). For instance, a browbeater may have threatened to bust the defendant’s kneecaps unless the defendant commits a criminal act for the browbeater’s benefit. In such a situation, because the browbeater’s coercive threat left the defendant no choice in the matter, the law affords the defendant an excuse of duress.40Model Penal Code § 2.09(1) (Am. L. Inst., Proposed Official Draft 1962).

The semantic rule of lenity functions more closely to justification; by assigning a narrow meaning to a statute, it shrinks what counts as impermissible. The semantic rule of lenity, when it applies, concludes that the defendant’s actions were not prohibited by law.41See, e.g., United States v. Kozminski, 487 U.S. 931, 952 (1988). However, I argue that the purpose of lenity instead aligns most closely with that of an excuse. Though lenity may seem an unlikely bedfellow to doctrines such as duress or insanity, I demonstrate that all of these doctrines aim to protect citizens who lacked a genuine choice to follow the law. In cases such as duress, one lacks the choice because of some coercive threat. In cases of lenity, one lacks the choice because one was not given fair notice about which acts would be punished. Although, in contrast to justification, the defendants may have done some prohibited act in these cases, punishing them would nevertheless go against the rule of law principle of preserving a path safe from punishment.

By shedding lenity of its semantic cloak, jurisprudence can avoid the three aforementioned doctrinal problems. Providing lenity as an excuse rather than fixing the meaning of a statute would allow the law to absolve both Defendant A and Defendant B (from the “bank” example above) of criminal liability since both defendants lacked fair notice that their actions would be punished. By restricting lenity to criminal law, taking the form of an excuse stops lenity from creeping into civil law, thereby solving the too much lenity problem. Because the excuse would not determine the meaning of the statute, issues of higher-order vagueness do not require additional iterations of lenity, thereby solving the higher-order vagueness problem.

Part V considers two counterarguments to the excuse of lenity. The first counterargument states that the excuse is unnecessary because strict construction of the civil tax code42By “tax code” and “the Code” I mean to refer to the Internal Revenue Code of 1986, as amended. is good jurisprudence. In response, I analyze the ways that the teleology of tax law is distinct from the teleology of criminal law. Tax law helps citizens figure out how much to contribute to the public fisc as a matter of distributive justice.43Jeesoo Nam, Taxing Option Luck, 11 U.C. Irvine L. Rev. 1067, 1115–17 (2021). Unlike criminal law, tax law is not meant to sanction prohibited behaviors—a tax on income, for instance, is not meant to morally condemn those who earn income.44Criminal law, in contrast, carries moral condemnation. United States v. Bass, 404 U.S. 336, 348 (1971) (noting also the general seriousness of criminal penalties); Doerfler, supra note 10. Since tax law is not meant to serve as a system of incentives, ex-ante notice is far less important. Furthermore, choosing strict construction over the best interpretation will undo the effort to justly allocate social burdens, to the detriment of the very people who relied on the tax law to serve this function. The second counterargument against the excuse of lenity states that a legislature could satisfy the requirement of fair notice by letting citizens know by statute that the rule of lenity will not be applied to the criminal code. I argue that such a move is tantamount to notifying the public that there will be no fair notice given.

I. THE RULE OF LENITY

At its core, the rule of lenity is a rule of statutory construction that resolves any “uncertainty concerning the ambit of criminal statutes” in favor of the defendant.45See, e.g., United States v. Kozminski, 487 U.S. 931, 952 (1988). Often, such uncertainty can arise due to linguistic indeterminacy, the most common type of which is vagueness.46See Lawrence M. Solan, Multilingualism and Morality in Statutory Interpretation, 1 Language and L. 5, 8 (2014). In these cases, the meaning of the vague statute is narrowly interpreted to include only clear, prototypical cases of the criminal statute.47Moore, supra note 35. Such narrow construction is justified by the rule of law value of fair notice. Fair notice allows citizens who wish to avoid punishment to seek safety in reading the statute and choosing to avoid those actions that carry criminal penalties.

A. Lenity’s Outcome: Statutory Construction

Indeterminacy of meaning (linguistic indeterminacy) is a universal feature across natural languages.48The legal philosopher Joseph Raz went as far as to say that not only is indeterminacy of meaning universal across natural languages, but that indeterminacy of meaning is also universal within a natural language. Raz claims, “all, and not only some, nouns, verbs, adverbs, and adjectives of a natural language are vague.” Raz, supra note 37, at 73. It is difficult to see how this could be true. As a counterexample, consider that we sometimes use the verb is to denote numerical identity, the relation between an object and itself. For instance, we may say “Superman is Clark Kent” or even “Clark Kent is Clark Kent.” The word “is” in these cases do not admit of borderline cases; of any object, once we had enough information regarding that object, we would definitively be able to say that it is or is not Clark Kent. See Gareth Evans, Can There Be Vague Objects?, 38 Analysis 208, 208 (1978) (formally proving that there can be no indeterminate cases of identity). Statutes, since they are written in natural language, sometimes have indeterminate meanings.49See Raz, supra note 37, at 73. Such instances can give rise to what we may call hard cases or legally ambiguous cases,50See Ronald Dworkin, Hard Cases, 88 Harv. L. Rev. 1057, 1057 (1975). where the statute gives no direction one way or another to those cases that straddle the indeterminacy.51This is not the only way that statutes can fail to guide. The meaning of a statute may simply be unknown to many readers. Perhaps this lack of knowledge is best exemplified by the following account of the Constitutional Convention’s discussion of the phrase “direct tax” in U.S. Const. art. I, § 9, cl. 4. See generally Ari Glogower, A Constitutional Wealth Tax, 118 Mich. L. Rev. 717 (2020) (detailing the constitutional apportionment requirement for direct taxes and the interpretative difficulties surrounding the term “direct tax”). “Mr. King asked what was the precise meaning of direct taxation? No one answered.” James Madison, Notes of the Debates in the Federal Convention of 1787, at 494 (Ohio Univ. Press 1966) (1787).

Of course, even when the statute gives no direction, a case at bar cannot go unresolved. One way to resolve a case in which there is no resolution provided by the statute itself is to have what legal theorists call a “closure rule.”52Moore, supra note 35. A closure rule simply determines which way a judge should rule when the law is unclear one way or another, acting as a tie breaker of sorts.53Id.; Lawrence M. Solan, Law, Language, and Lenity, 40 Wm. & Mary L. Rev. 57, 115 (1998). The rule of lenity is often held by jurisprudents to be a paradigm closure rule.54Moore, supra note 35. When a criminal statute fails to resolve a case because its meaning is indeterminate with respect to the facts at bar, then the judge must assign the statute a narrow meaning that favors the criminal defendant.

In law, by far the most common sort of linguistic indeterminacy arises from vagueness.55See Solan, supra note 46. Although vagueness is the most common sort of linguistic indeterminacy, it is certainly not the only sort. See id. Consider, for instance, the well-trodden “no vehicles in the park” statute:

NO VEHICLES IN THE PARK ACT: Any person who brings or drives a vehicle into a federal park shall be guilty of a misdemeanor, which may be punished by a fine.56This standard example originates from H.L.A. Hart, Positivism and the Separation of Law and Morals, 71 Harv. L. Rev. 593, 607–15 (1958). The variation I use here is closer to that from William N. Eskridge, Jr., No Frills Textualism, 119 Harv. L. Rev. 2041, 2041 (2006).

From the language of the statute, no one can deny that driving an automobile into a federal park is prohibited. In contrast, we may be quite uncertain about whether someone who pushed a wheelbarrow into a federal park is criminally liable under the statute since it is uncertain whether a wheelbarrow is or is not a vehicle in this context. The term “vehicle” is vague since it admits of borderline cases where the application of the term “vehicle” is indeterminate.

An instance of a vague predicate is prototypical or core if and only if it is clearly a member of the predicate’s category.57See Hart, supra note 56, at 607. A borderline or penumbral case of a vague predicate is an object that is neither clearly a member nor clearly not a member.58Id. Thus, a sedan is a prototypical vehicle while wheelbarrows are borderline cases that do not clearly fall into nor outside of the vehicle category.

When it comes to vague statutes, the rule of lenity is best cashed out using this distinction between clear and borderline cases.59See, e.g., Moore, supra note 35. Under the rule, a vague criminal statute will be assigned a narrow meaning that includes only the clear cases of the vague categories. For instance, the narrow interpretation of “vehicle” includes automobiles but not wheelbarrows. The rule of lenity, like statutory interpretation more generally, is semantic in that it operates to determine the meaning of the statute.60See United States v. Thompson/Ctr. Arms Co., 504 U.S. 505, 518 n.10 (1992) (“The rule of lenity, however, is a rule of statutory construction whose purpose is to help give authoritative meaning to statutory language.”); Grewal, supra note 22, at 1053; Hickman, supra note 10, at 916–17. Statutory interpretation is ordinarily a matter of construing the meaning of a statute. Steven A. Dean & Lawrence M. Solan, Tax Shelters and the Code: Navigating Between Text and Intent, 26 Va. Tax Rev. 879, 880 (2007); see also Lawrence M. Solan, Statutory Inflation and Institutional Choice, 44 Wm. & Mary L. Rev. 2209, 2213, 2213 n.14 (2003) (“But once the courts interpret a statute . . . , the ruling becomes part of the meaning of the statute . . . .”).

Courts are supposed to employ the rule of lenity in the realm of criminal law.61William N. Eskridge, Jr., Abbe R. Gluck & Victoria F. Nurse, Statutes, Regulation, and Interpretation 494–95 (2014). Under such a rule, the destruction of a fish was not found to be a violation of a statute prohibiting the destruction of “tangible objects” in a federal investigation,62Yates v. United States, 574 U.S. 528, 528–30 (2015). and transporting a stolen airplane did not count as transporting a stolen “vehicle.”63McBoyle v. United States, 283 U.S. 25, 26–27 (1931). Though McBoyle does not mention the rule of lenity by name, it is nevertheless understood to be, and is cited for, applying the rule. E.g., United States v. Lanier, 520 U.S. 259, 266 (1997). These were, in the eyes of the court, not prototypical cases of the statutes’ language.

B. Lenity’s Purpose: Fair Notice

Courts have typically appealed to fair notice, sometimes referred to as “due-process notice,”64See, e.g., Nicholas Quinn Rosenkranz, Federal Rules of Statutory Interpretation, 115 Harv. L. Rev. 2085, 2094 (2002). There are also other justifications that appear to be distinct from the fair notice value, such as non-delegation—courts cannot legislate criminal law, United States v. Wiltberger, 18 U.S. 76, 92 (1820)—and that the rule has a long history in criminal law interpretation, Antonin Scalia, A Matter of Interpretation: Federal Courts and the Law 29 (new ed. 2018). These alternative reasons for the rule of lenity are not counterarguments to what I present herein in that their truth does not imply the falsity of my conclusions. My argument is unmotivated only if one thinks the rule of lenity is not justified by the fair notice principle. as the principal justification for the rule of lenity.65See Kahan, supra note 20. For clear statements of the fair notice principle, see Liparota v. United States, 471 U.S. 419, 427 (1985) (“[T]he rule of lenity ensures that criminal statutes will provide fair warning concerning conduct rendered illegal . . . .”); McBoyle, 283 U.S. at 27. The rule of law value of fair notice is also the most popular justification in academia. Kahan, supra note 20, at 349, 349 n.12. Punishment of criminal activity is serious both in the severity of its costs on the punished and in the moral condemnation that attaches to it.66See United States v. Bass, 404 U.S. 336, 347–50 (1971); Doerfler, supra note 10. For the exercise of the sword of government in doling out punishments to individuals, rule of law is of principal order. A central criterion of rule of law is that those who are subject to the threat of such force be given fair warning that they are under such threat.67See supra note 24 and accompanying text. In our society, such notice is primarily given by the publication of criminal statutes. But publication is only the first step. Statutory notice is fair only when the content of the prohibitions can be readily ascertained from the published statute. Thus, when Emperor Caligula posted new statutes high on the top of Roman columns to prevent the citizenry from reading them, he failed to give fair notice to his citizens.68Timothy Lynch, Introduction to In the Name of Justice: Leading Experts Reexamine the Classic Article “The Aims of the Criminal Law”, at vii, xi (Timothy Lynch ed., 2009).

Posting laws where no one can read them is not the only way to violate fair notice. Notice can be unfair due to a statute’s linguistic indeterminacy. Similar to how linguistic indeterminacy can fail to give guidance to judges on how to rule on hard cases, linguistic indeterminacy fails to provide guidance to citizens on what sort of behavior is prohibited by law. Consider the following illustration of this aspect of fair notice employing hypothetical expectations about a vague statute.

When one reads a vague criminal statute, so one version of the fair notice story goes, one thinks not of the borderline cases, but instead the prototypical cases. For instance, it is most likely that bringing a wheelbarrow into the park never crosses an individual’s mind as they read the words, “Any person who brings or drives a vehicle into a federal park shall be guilty of a misdemeanor.” The mental representation of the concepts conveyed by a statute typically does not include borderline cases.69Solan, supra note 53, at 65–75, provides a helpful look into the scientific research on how individuals cognitively represent concepts through the use of “prototypes for categories.” As a result, punishing someone for a borderline violation of a criminal statute would go against the natural reading of the statute. They would not have been given fair notice that their conduct would be subject to punishment but rather misled into thinking that they were following the law by the vagueness of the statute.70See United States v. Santos, 553 U.S. 507, 523 (2008) (noting that lenity must apply “lest those subject to the criminal law be misled”). In order to preserve the important rule of law value of fair notice, the rule of lenity requires a narrow construction of such statutes.

Leading cases on the rule of lenity often explicitly endorse a similar story regarding the expectations that readers of a vague statute are likely to have. For instance, in McBoyle v. United States, the Supreme Court applied the rule of lenity to rule that airplanes were outside the scope of the phrase “motor vehicle” as it was used in a federal criminal statute.71McBoyle v. United States, 283 U.S. 25, 27 (1931). The opinion justifies excluding airplanes from the motor vehicle category by stating that the “motor vehicle” phrase “evoke[s] in the common mind only the picture of the vehicles moving on land.”72Id. Courts are worried about the lay citizen reading a statute and naturally having only the prototypical instances come to mind.

Importantly, the analysis just described is meant to be focused on the statute itself rather than the defendant. That is, for any given statute, the test is not to see if the defendant in the instant case actually read the statute. Many, perhaps most, defendants have not.73See id.; Dru Stevenson, Toward a New Theory of Notice and Deterrence, 26 Cardozo L. Rev. 1535, 1536, 1536 n.8. Tax law presents somewhat of an exception to this general observation since individuals who aim to get around the tax law typically employ agents who do take the time to read the tax code and advise them of what is and is not permissible behavior with regard to paying one’s taxes. See Kahan, supra note 20, at 400. Thus, the expectations story is less of a fiction when it comes to tax law. Instead, the analysis looks at the statute itself and how the text comes across to the ordinary reader. If the indeterminacy of a statute risks misleading readers, the rule of lenity attempts to limit punishment in such instances by requiring a narrow construction of the statute.74See McBoyle, 283 U.S. at 27. The rule of lenity aims to correct a deficiency in the law itself.75See also id. (reinforcing the value of fair notice even if criminals do not “carefully consider the text of the law”); United States v. R.L.C., 503 U.S. 291, 309 (1992) (Scalia, J., concurring) (citing McBoyle, 283 U.S. at 27); Hart, supra note 33, at 50 (“[T]he fact that only a few people, as things are, consider the question Shall I obey or pay?, does not in the least mean that the standing possibility of asking this question is unimportant . . . .”); Paul H. Robinson, Fair Notice and Fair Adjudication: Two Kinds of Legality, 154 U. Pa. L. Rev. 335, 372 (2005).

H.L.A. Hart’s rule of law account of excusing conditions to criminal liability can provide additional theoretical grounding to the concept and value of fair notice. On Hart’s account, people should be able to avoid law’s sanctions if they so choose.76Hart, supra note 33; cf. Moore, supra note 39, at 31–40 (presenting arguments in favor of the choice theory of excuses at the individual level of moral responsibility). There is an important security provided by knowing that we will be safe from punishment so long as we choose to follow the laws set out for us.77Hart, supra note 33, at 48; see Kadish, supra note 33, at 263 (noting that on Hart’s account, excuses further “the satisfaction people derive in knowing that they can avoid the sanction of the law if they choose.”). However, if we read a statute and naturally think only of the prototypical cases, then we will think that we are following the law when we commit borderline violations of that statute. Punishments for non-prototypical violations of a criminal law statute subvert the safety of choice to follow the law. The park-goer does not think that they violate the “no vehicles in the park” statute when pushing a wheelbarrow across the park gates. If it were not for the rule of lenity, their having read the law and intention to follow it would provide no assurance that they are safe from punishment; the court could arrive at an interpretation that they had never expected by considering a wheelbarrow a vehicle.

Hart’s position here can be understood as a safe path argument. It is a minimal requirement of a legal system that it provide at least one path safe from punishment along which ordinary citizens can walk. The clearest violation of a safe path is the criminalization of both an action and its absence. For instance, suppose that criminal law both required citizens to wear a face mask and forbid citizens from wearing a mask. It may even be the case that both laws, understood separately, are reasonable—perhaps the legislature passed the first law to minimize transmissions of an infectious disease and the legislature passed the second law because the purchase of face masks by laypeople caused a shortage for healthcare workers.78One can see similar, though not identical, policy considerations at play in N95 Respirators, Surgical Masks, Face Masks, U.S. Food and Drug Admin., (June 14, 2020) (on file with author). However, having both laws at once clearly violates the safe path requirement. There would be no way for a citizen to avoid punishment in a system that punishes both an action and its absence. In this situation, we would say that there is no safe path at all.

The absence of fair notice likewise violates the safe path principle. This is because the presumed safe path for ordinary citizens is the option to read the law and avoid the prohibited acts. When fair notice is violated, for instance by the punishment of non-prototypical violations of law, this safe path is upturned. These citizens’ reading of a vague statute would mislead them into thinking that they are outside the reach of punishment only to have the rug pulled out from under their feet. The state cannot be said to have provided its citizens a genuine choice to avoid punishment because the citizens were misled about which actions would lead to punishment. Vague statutes thus compromise the availability of a path safe from punishment along which ordinary citizens can walk. Lenity aims to protect for citizens a genuine choice to avoid punishment.79Lenity, however, is not the only way to preserve a genuine choice to avoid punishment. For example, another way to preserve a “safe path” would be for the crime’s mens rea to require knowledge that one’s action is a rule violation. See infra Section III.B.1. Alternatively, one might see a company like Thompson/Center Arms Co. as following yet another safe path—it paid the required taxes, then litigated for a refund, thereby avoiding punishment. Such a method, however, is not a safe path along which ordinary citizens can walk. This maneuver is made possible in the first instance by the fact that the company noticed the indeterminacy of the statute as it came to Thompson/Center kits. As the Court expressed in McBoyle, many readers may not recognize that there is indeterminacy in a statute. Second, the costs of litigation can be prohibitively expensive, making this option practically unavailable in many instances.

In sum, it is a rule of law principle that the government may not punish an individual without having first given fair notice that such actions would be punished. Such a principle protects the ability of citizens to find out which acts are punished and avoid committing such acts. The statutory notice that government gives to citizens is fair only insofar as citizens can naturally discern which of their acts are prohibited from reading the statute. Without lenity, citizens can be misled by a vague statute into thinking that they are safe from punishment. The significance of this rule of law value has been thought by some to endow on the rule of lenity a “quasi-constitutional status” due to its role in protecting fair notice.80Kahan, supra note 20, at 346.

II. A MISMATCH BETWEEN PURPOSE AND OUTCOME

Part I, in the process of explicating the rule of lenity, has presented two propositions that deserve further consideration:(1) the rule of lenity determines the meaning of criminal statutes,81See supra note 60 and accompanying text. and (2) the rule of lenity is best justified by the normative principle of fair notice. The second proposition has to do with the rule of lenity’s purpose. The first proposition has to do with lenity’s mechanism; in order to carry out its purpose of fair notice, the rule of lenity stipulates a narrow meaning to a linguistically indeterminate statute. Though each proposition is well-accepted—one would not have trouble finding Supreme Court opinions82See, e.g., United States v. Thompson/Ctr. Arms Co., 504 U.S. 505, 518 n.10 (1992); Liparota v. United States, 471 U.S. 419, 427 (1985); McBoyle v. United States, 283 U.S. 25, 27 (1931). or law school casebooks83See, e.g., Eskridge et al., supra note 61, at 494–95; Sanford H. Kadish, Stephen J. Schulhofer, Carol S. Steifer & Rachel E. Barrow, Criminal Law and Its Processes 159–60 (9th ed. 2012). that repeat these truths—it nevertheless seems to me that the two propositions are at odds with one another. Lenity’s purpose is normative, but its outcome is semantic.

By definition, to interpret a text is to ascertain its meaning.84See, e.g., Jack M. Balkin, Framework Originalism and the Living Constitution, 103 Nw. U. L. Rev. 549, 559–60 (2009). The rule of lenity is not an attempt to ascertain the meaning of a statute; it instead stipulates a narrow meaning to a statute in order to protect fair notice.85See supra Part II. Thus, it is odd that the rule of lenity, which does not even purport to ascertain the meaning of a statute, is nevertheless a canon of statutory “interpretation.”86Eskridge et al., supra note 61, at 494. If the purpose of the rule of lenity is something other than figuring out the meaning of a statute, then why does it end up determining the meaning of the statute?

This oddity of the rule of lenity may be best understood in contrast to more ordinary canons of statutory interpretation. For example, many canons rely on “maxims of word meaning”87Id. at 450–57. or rules of grammar88Id. at 458–64. to help piece together the meaning of a text. For these canons, their purpose and outcome are aligned. These rules rely on linguistic premises to ascertain the meaning of a text,89See id. at 448–49. so it makes sense that the outcome of applying these rules is to determine the meaning of statutes.

Canons of statutory interpretation can be analytically divided into three categories. While textual canons “[find] meaning from the words of the statute” and reference canons determine “what other materials might be consulted to figure out what the statute means,” substantive canons like the rule of lenity instead implement normative principles external to the task of interpretation like fair notice.90Id. Substantive canons, in contrast to the other two types of canons, are not concerned with “finding” or “figuring out” what the statute means.91See id.; see William Baude & Stephen E. Sachs, The Law of Interpretation, 130 Harv. L. Rev. 1079, 1111, 1127 (2017). Substantive canons are grounded in normative policy principles rather than interpretative principles.

Putting these distinctions to work, one can only conclude that the rule of lenity is a canon of statutory “interpretation” in name only. The purpose of the rule of lenity is to protect criminal defendants who failed to receive fair notice that their conduct would be punished.92See supra Section I.B. Rather than interpreting a text, the rule assigns the words of a statute narrow meaning in order to implement normative principles concerning rule of law values. The canon is not a rule of interpretation properbecause it never seeks to interpret, that is, ascertain the meaning of, a statute.93See also Balkin, supra note 84 (aligning “interpretation proper” with “the ascertainment of meaning”). The rule of lenity has a semantic outcome—determining the meaning of a statute—which is flatly inconsistent with its non-semantic purpose.

Notice also that the problem I have outlined here does not depend on any particular theory of statutory interpretation. The discrepancy between the rule of lenity’s purpose and outcome relies only on the distinction between figuring out a meaning and stipulating a meaning. The rule of lenity stipulates the meaning of a statute instead of trying to figure out what the statute means. On no theory of statutory interpretation is providing fair notice for criminal defendants a way of figuring out the meaning of a statute.94See Eskridge et al., supra note 61, at 318–46 (explicating various views about textualist and purposive approaches to statutory interpretation). Providing fair notice is, on its face, neither a way of getting at the plain or ordinary meaning of a text nor uncovering the purpose of a statute, so it cannot be understood as either a textualist or purposive doctrine of interpretation.95See id. at 301. The discrepancy between the rule of lenity’s purpose and outcome should worry legal scholars of all stripes.

III.  THREE DOCTRINAL PROBLEMS

The theoretical disconnect between lenity’s purpose and outcome just outlined in Part II entails thorny doctrinal consequences. This Part explores three such doctrinal consequences: the rule of lenity cannot handle linguistic ambiguity,96See infra Section III.A. the rule oversteps its boundaries and enters civil law,97See infra Section III.B. and the rule fails to resolve issues of fair notice that result from higher-order vagueness.98See infra Section III.C.

A. Linguistic Ambiguity

Consider the following hypothetical. The word “bank” may refer to either the financial institution (“financial bank”) or the land next to a river (“river bank”). Suppose the Bank Safety Act criminalizes starting a fire within one hundred feet of a bank, and it is indeterminate which of the two meanings should be applied to the term “bank.” As argued in Section I.B, such indeterminacy of meaning violates the principle of fair notice. Suppose further that two defendants are on trial, Defendant A for having set fire next to a financial bank, and Defendant B for having set fire next to a river bank.

The rule of lenity states that an indeterminate text must be interpreted in favor of the defendant.99See United States v. Kozminski, 487 U.S. 931, 952 (1988); Lawrence M. Solan, Pernicious Ambiguity in Contracts and Statutes, 79 Chi.-Kent L. Rev. 859, 861 (2004). But which one? Giving the statute either meaning will absolve one of the defendants but still condemn the other. The rule of lenity is like the Buridan’s ass unable to choose between two identical stacks of hay. If the court rules that “bank” refers to financial banks, then Defendant A will be held criminally liable, and if the court rules that “bank” refers to river banks, then Defendant B will be held criminally liable.100Recall that the rule of lenity assigns meaning to a statute, see supra Section I.A, and a statute can have just one meaning, United States v. Santos, 553 U.S. 507, 522–23 (2008). This is because the statute’s indeterminacy arises from linguistic ambiguity rather than vagueness.

Linguistic ambiguity should be understood as distinct from another kind of ambiguity discussed earlier, what one might call legal ambiguity.101See supra note 50 and accompanying text. What judges and practicing lawyers most often mean when they use the term ambiguity is a general kind of uncertainty about the application of a statute.102See Solan, supra note 99. Legal ambiguity can arise for a variety reasons. One such reason for legal ambiguity, discussed in the previous part of this Article, is the vagueness of language.103See supra note 55 and accompanying text. Another reason for legal ambiguity is the kind of linguistic indeterminacy we saw with the two meanings of bank, what I refer to here as linguistic ambiguity.

Vagueness in language concerns how far out to draw the boundaries of certain terms, for instance how broadly we draw the category of manufacturing a short-barreled firearm.104Solan, supra note 46. When linguists use the term ambiguity, they are instead referring to terms that can have disparate meanings altogether, such as the two possible meanings of the term bank.105Id. Put succinctly, vagueness concerns interpretations that differ in degree while ambiguity concerns interpretations that differ in kind. Whereas in cases of vagueness, the court can choose between broad and narrow readings because the narrow reading is a proper subset106“Set A is a proper subset of set B (A ⊂ B) if all of the elements of set A are members of set B, but there is at least one element of set B that is not a member of set A (A ≠ B).” Proper Subset, Mathematics Glossary, http://www.learnalberta.ca/content/memg/Division03/Proper%20Subset/
index.html [https://perma.cc/T6N9-AQJH].
of the broad reading, in cases of ambiguity, there is no narrow interpretation because neither the river bank meaning nor the financial bank meaning is a proper subset of the other. Either reading of bank holds one defendant culpable while letting the other go free.

Plainly, this result of the rule of lenity is inconsistent with the demands of the rule’s fair notice purpose. Neither Defendant A nor Defendant B had fair notice that their action was punishable because the statute was ambiguous between their two readings. One could read the Bank Safety Act and come away thinking that it permits starting fires next to financial banks or come away thinking that it permits starting fires next to river banks. Given the indeterminacy of meaning, both are natural readings of the statute. The law does not clearly mark the path safe from punishment. Since neither defendant received fair notice, it would be unfair to punish either defendant.

Thus, the rule of lenity’s outcome is under-inclusive with respect to its purpose. Though the rule of lenity’s purpose of protecting fair notice would dictate absolving both defendants of criminal liability, its semantic outcome is unable to provide such a result.107This also means that the rule of lenity cannot be a closure rule, a rule that dictates for judges how to resolve cases where the law is unclear, since there is a class of cases (namely linguistic ambiguity cases) where the rule of lenity does not provide any resolution. See Part I.A. for a discussion of closure rules.

At this point, the astute reader might raise the following objection. Thus far, by focusing on the fact that bank has just two possible meanings, I have obscured a third option that would work best. When it comes to linguistically ambiguous statutes, the objection states, the rule of lenity should say that the statute has no meaning at all. That is, the Bank Safety Act should be construed not to criminalize any behavior because its use of the term bank has no meaning. Following this rule, both Defendants A and B would go free, and the result would thus comport with the demands of fair notice.

In response to this objection, suppose that there is a third defendant, Defendant C. Defendant C started a fire next to a financial bank that happened to be located on a river bank. On either meaning of bank, Defendant C is guilty and, thus, had fair notice their actions were prohibited by law. Defendant C cannot possibly claim that the ambiguity in the statute would mislead someone into thinking that their actions were permissible. If the court construes the Bank Safety Act to have no meaning at all, then it would let Defendant C go, despite the fact that they had fair notice of punishment. The rule would still fail to serve its purpose.

The hypothetical Bank Safety Act demonstrates one way the disconnect between the rule of lenity’s purpose and outcome could lead to its being under- or over-inclusive, but a critic may nevertheless contend that such ambiguities appear rarely in the actual law. When will a reader actually be faced with the term “bank” in a statute and be unable to figure out whether it refers to river banks or financial institutions? Usually, the context and purpose of a statute will make one meaning the clearly right interpretation for a linguistic ambiguity, thereby eliminating any indeterminacy.108See Lawrence M. Solan, Linguistic Issues in Statutory Interpretation, in The Oxford Handbook of Language and Law 87, 89 (Lawrence M. Solan & Peter M. Tiersma eds., 2012).

In part, I agree with the critic and, in part, I disagree. I concede I have no quantitative measurement of how often courts are faced with linguistic ambiguity, so these cases may indeed be rare. Scholars have noted real examples where the courts have had to interpret linguistically ambiguous statutes, but it is not obvious how often such ambiguities appear.109Lawrence M. Solan, The Interpretation of Legal Language, 4 Ann. Rev. Linguistics 337, 342–43 (2018). Where I disagree with the critic is that I fail to see how this is a criticism. There is, at minimum, a conceptual problem at issue—the rules and principles of our legal system fail to conceptually form a coherent whole. The hypothetical example I used here lays bare a real incoherence in our legal system. Uncovering this previously unnoticed incoherence deepens our understanding of the rule of lenity. Moreover, even if linguistically ambiguous statutes are rare, the incoherence of the rule of lenity will still have other critical real-world consequences as the next Section of this Article will show.

B. Tax Law’s Rule of Lenity

Incongruous with its purpose to provide fair notice of punishment, the rule of lenity leads to narrow constructions of texts even outside of the criminal context. For instance, in United States v. Thompson/Center Arms Co., the Supreme Court, relying on the rule of lenity, assigned a narrow meaning to the phrase “making of a firearm” with regard to an excise tax levied on the manufacture of firearms under I.R.C. § 5821.110United States v. Thompson/Ctr. Arms Co., 504 U.S. 505, 518–19 (1992). The statute’s definition for “firearm” included short-barreled rifles, but excluded pistols and long-barreled rifles. The taxpayer packaged as one unit three parts that could be connected together: a shoulder stock, a pistol, and a barrel extension. As before, let us call this unit of three parts a “Thompson/Center kit.” Putting the three parts together would create a long-barreled rifle, on which no excise tax is laid. Putting just the shoulder stock and pistol together would create a short-barreled rifle on which excise tax is laid.

The Court stated that the manufacture of a Thompson/Center kit was not clearly an instance of making a firearm, but was also not clearly not an instance of making a firearm.111Id. at 513–17. Its next move, surprisingly, was to apply the rule of lenity. The Court assigned a narrow meaning such that only clearly making a firearm would count under § 5821. Since making the Thompson/Center kit is not clearly an instance of making a firearm, § 5821 does not here apply. Therefore, under the rule of lenity, the taxpayer need not pay any excise tax on the manufacture of a Thompson/Center kit.

The imposition of tax is a civil matter, not a criminal one. Thompson/Center Arms had paid the excise tax and was merely bringing suit to get a refund of those payments.112Id. Recall that the rule of lenity was justified under the context of punishment and the special kind of notice that the harshness of punishment demands.113See supra note 24 and accompanying text. It seems no more appropriate to apply the rule of lenity in a civil matter than it would to apply a beyond a reasonable doubt standard of evidence to civil trials.114See also Thompson/Ctr. Arms, 504 U.S. at 525–26 (Stevens, J., dissenting) (analyzing the incongruence between the purposes of civil law and the rule of lenity). So why was the rule of lenity being used within the context of tax law? The Court’s winding reasoning proceeds as follows. To begin, § 5871 imposes criminal penalties for nonpayment of the § 5821 excise tax on firearms. I.R.C. § 5821 is, by that fact, both a criminal statute and a tax statute (or “dual-purpose statute”).115Id. at 518 n.10. Therefore, the rule of lenity should apply to § 5821 within the context of criminal law to assign a narrow meaning to the phrase “making of a firearm.” The meaning of a single statute cannot fluctuate depending on what the statute is being used for.116United States v. Santos, 553 U.S. 507, 522–23 (2008). Other cases also echo this point. Justice Stevens, in his dissent for Thompson/Center, states that we should cabin the rule of lenity to criminal law. Thompson/Ctr. Arms, 504 U.S. at 525 (Stevens, J., dissenting). None of the other Justices agreed. I side with the eight Justices on the linguistic point, though I side with Justice Stevens that lenity must be cabined. See infra Section IV. This principle of consistency in statutory interpretation is well grounded.117But Ryan Doerfler argues that dual-purpose statutes sometimes have multiple meanings: one meaning in the civil context and another meaning in the criminal context. Doerfler, supra note 10, at 228–38. Given the technical nature of this topic, I would need a separate essay to address the multiple meanings argument in full. For the moment, I merely relegate a brief summary of my disagreement to this footnote. Almost everyone holds the Thompson/Center view of interpretation that statutes are univocal, with just one meaning across different contexts. Doerfler himself speaks as though almost everyone agrees that statutes are univocal—presumably, such universal assent is what makes Doerfler’s contrary conclusion so interesting. Id. at 213, 216–18, 223 (stating that courts would find Doerfler’s own conclusion to be “madness”). Central to Doerfler’s claim is his premise that Congress sometimes intended multiple meanings. Id. at 243. But how can Congress have the intention for multiple meanings if everyone believes that statutes have just one meaning? As a general principle, one cannot intend what one believes will fail. See generally Stephanie Rennick, Things Mere Mortals Can Do, but Philosophers Can’t, 75 Analysis 22, 23–24 (2015) (noting that this necessary condition for intention is widely accepted). For instance, I cannot intend to jump from the sidewalk to the roof of a skyscraper because I know I will not make it. (If you have doubts, I urge you to form such an intention yourself.) Similarly, legislators should believe readers will not interpret their statutes to have multiple meanings since the generally accepted view of interpretation, as mentioned above, is that statutes have just one meaning. Therefore, applying the principle that one cannot intend what one believes will fail, legislators cannot intend their statutes to communicate multiple meanings. (Using technical language of utterances, types, and tokens, it is easier to state this proposition more precisely: though a single utterance type may have multiplicity of meaning depending on context, a single utterance token cannot.118Lawrence B. Solum, The Fixation Thesis, 91 Notre Dame L. Rev. 1, 38 (2015). Types are the general abstractions, and tokens are the “particular concrete instances.” Id. at 37. Thus, the word “I,” qua type, could refer to any speaker of the term. Since the identity of the speaker is a feature of the context under which the word is being used, we say that the term “I” is context dependent. The word “I,” when used within a particular context, qua token, only refers to one person, the actual speaker. The same can be said of legal expression types, such as the Model Penal Code, which is replicated across many tokens by the state-by-state uptake of the model. Id. Note also that I use the term “utterance” broadly to encompass inscriptions. For helpful further discussion of the type-token distinction in the context of constitutional interpretation, see id. at 35–41. ) Therefore, if the rule of lenity requires assigning a narrow meaning to “making of a firearm” in the criminal law context, the narrow meaning assigned to “making of a firearm” applies to cases of civil tax law as well.

Thompson/Center stands for the principle that the rule of lenity properly applies to dual-purpose statutes. This abstract principle has left unresolved the concrete questions of exactly how lenity will change the interpretation and administration of civil law. Does lenity apply to all tax statutes? Where lenity does apply, what is its effect, counterfactually speaking? Though there is a lot of uncertainty in this area of jurisprudence, the following Sections analyze these two questions in order.

It should be noted that there are also dual-purpose statutes outside of the tax realm in areas ranging from securities law to environmental law, where violations of civil law can carry criminal penalties.119Doerfler, supra note 10 at 221. Thus, I intend for my analysis of lenity in tax law to be valuable in itself as well as serving as an illuminating case study for the problem more generally across the variety of dual-purpose statutes in the law.

1The “Willfulness” Requirement

The dual-purpose nature of tax law will serve as the starting point of the inquiry.120See Solan, supra note 60, at 2237–51 (discussing the limited application of the rule of lenity to dual purpose statutes). I.R.C. § 5821 is not the only statute that carries criminal penalties for non-compliance. I.R.C. §§ 7201 and 7203 assign criminal penalties to nonpayment and evasion of any tax imposed under the tax code, Title 26.121See Hickman, supra note 10, at 938–40. Similar provisions assign criminal penalties for various procedural violations.122E.g., I.R.C. § 7202. So, one might reasonably conclude that the rule of lenity ought to apply to the interpretation of tax laws generally.

However, the Court in Thompson/Center implies that the rule of lenity need not be applied to all tax laws because § 7201 and related statutes can only be violated if the taxpayer acts “willfully.”123United States v. Thompson/Ctr. Arms Co., 504 U.S. 505, 506 (1992). The willfulness requirement of § 7201 already builds in notice as a pre-condition of punishment since the willfulness requirement is a requirement that the taxpayer know of and understand the law that they are breaking. Though the opinion is not explicit about either the rule or the underlying principle, the Court appears to be taking the position that the willfulness requirement satisfies the requirement of fair notice,124However, the protection provided by the willfulness requirement is not strictly greater than the protection provided by the rule of lenity. Suppose that Thompson/Center Arms Co. was being tried criminally, and we had conclusive proof that the company believed it was breaking the law when it did not pay any excise taxes on the manufacture of Thompson/Center kits. This would be an instance of a defendant believing it is violating the law when it is only doing so with a borderline case. In such an instance, applying only the rule of lenity would provide protection for the defendant and applying only the willfulness requirement will not. Therefore, the protection provided by the willfulness requirement is not strictly (in the logical sense) greater than the protection provided by the rule of lenity; there are some cases in which defendants would prefer a rule of lenity over the willfulness requirement. The argument that willfulness acts as a dam against applying the rule of lenity must instead be grounded in the notion that fair notice exists to protect expectations. Insofar as the defendants believed they were breaking the law, it violates no expectation to punish them. As we have seen, however, the notice value is best understood as a structural claim about the legal system itself rather than any particular defendant. See supra Part I. so no application of the rule of lenity to the general tax law is required.125See Thompson/Ctr. Arms, 504 U.S. at 506–18; cf. United States v. O’Hagan, 521 U.S. 642, 644 (1997) (stating that a willfulness requirement negates what would otherwise be unfairness from applying an “indefinite” statute).

The Thompson/Center opinion’s use of the willfulness requirement as a dam against applying the rule of lenity is colorable, but not without cracks. The first crack in the dam is that not all tax statutes require willfulness. Thompson/Center presented just such a case, as § 5871 had no willfulness language. In those instances, it is clear that the rule of lenity should apply. The second crack in the dam is that it is not some necessity of tax law that its violations be punished only if such violations are willful. The willfulness requirement of tax law, as this Section argues, is contestable and contingent.

Generally, ignorance of the law is not an excuse.126Some lawyers may more easily recognize this doctrine in its Latin formulation as ignorantia legis neminem excusat. E.g., Vartelas v. Holder, 566 U.S. 257, 280 (2012) (Scalia, J., dissenting) (setting out the principle that ignorance of law is no excuse and its Latin formulation). One is not released from criminal liability for not having known about the existence of a law criminalizing that particular conduct. Justice Oliver Wendell Holmes gave an oft-cited defense of the doctrine, “to admit the excuse at all would be to encourage ignorance where the law-maker has determined to make men know and obey.”127Oliver Wendell Holmes, Jr., The Common Law 41 (Routledge 2019) (1881). Such a deterrence rationale satisfies the utilitarians.128See also Joshua Dressler, Understanding Criminal Law 159–60 (8th ed. 2018) (characterizing Justice Holmes’s “most commonly accepted explanation for the general no-defense rule” as utilitarian). See generally Jeremy Bentham, An Introduction to the Principles of Morals and Legislation (Batoche Books 2000) (1781) (discussing the general principles of utilitarian views on punishment). Retributivists, in contrast, have tended to consider the mistake of law doctrine a thornier problem.129Though most contemporary legal philosophers are retributivists, mistake of law has mostly received utilitarian justifications. Douglas Husak, Mistake of Law and Culpability, 4 Crim. L. & Phil. 135, 135–36 (2010). In particular, retributivists have found the lack of an excuse for mistake of law unfair when applied to mala prohibitaoffenses130Dressler, supra note 128, at 158; Michael L. Travers, Comment, Mistake of Law in Mala Prohibita Crimes, 62 U. Chi. L. Rev. 1301, 1322–23 (1995). and in cases where the defendant was not culpable for his ignorance of the law.131Dressler, supra note 128, at 159; Husak, supra note 129, at 139 (characterizing the state of the literature and proposing some rebuttals).

Tax law presents an exception to the rule that mistake of law does not excuse. I.R.C. § 7201 provides that “[a]ny person who willfully attempts in any manner to evade or defeat any tax imposed by this title or the payment thereof shall, in addition to other penalties provided by law, be guilty of a felony.”132I.R.C. § 7201. The “willfulness” requirement of § 7201, as interpreted in Cheek v. United States, is a legislative exception to the rule that ignorance of law does not excuse.133Cheek v. United States, 498 U.S. 192, 205 (1991) (“[T]he willfulness requirement in the criminal provisions of the Internal Revenue Code . . . require[s] proof of knowledge of the law.”). However, statutes explicitly requiring willfulness are not always interpreted this way. At times, such statutes are construed to require only knowledge of the facts rather than the law. See, e.g., United States v. Overholt, 307 F.3d 1231, 1246 (10th Cir. 2002).

As the court stated in Cheek, the central idea behind this exception is that “[t]he proliferation of statutes and regulations has sometimes made it difficult for the average citizen to know and comprehend the extent of the duties and obligations imposed by the tax laws.”134Cheek, 498 U.S. at 199–200. Knowledge of a body of law as complicated as tax law requires either ability and effort devoted to understanding the requirements of tax law or the resources to hire an able person who has devoted time to studying the tax law. When the barrier to knowledge of tax law is so high, it would be unfair to punish individuals who have violated the tax law due to ignorantia legis.135See Dressler, supra note 128, at 158, 164–65. The Court’s reasoning thus echoes the aforementioned retributivists’ fairness concerns regarding cases in which defendants are not culpable for their ignorance of the law. For this reason, knowledge of law has been understood to be required for criminal liability across cases interpreting several criminal tax statutes.136Sharon L. Davies, The Jurisprudence of Willfulness: An Evolving Theory of Excusable Ignorance, 48 Duke L.J. 341, 344, 344 n.10 (1998).

Whether ignorance of tax law should be an excuse is a matter of balancing costs and benefits of such a rule. As the law becomes more complex, the unfairness of punishing a mistake of law increases.137Dressler, supra note 128, at 158. However, exempting mistake of law cases adds costs to the litigation process and lowers deterrence effects when people who know the law can credibly claim in court that they did not.1381 John Austin, Lectures on Jurisprudence: Or the Philosophy of Positive Law 498 (4th ed. 1879); Dressler, supra note 128, at 165 (“Courts would become hopelessly enmeshed in insoluble questions regarding the extent of a defendant’s true knowledge of the relevant law.”). Whereas legislatures have typically found that the balance tips against allowing ignorance of law as an excuse to criminal liability generally, Congress has found the balance tips in favor of allowing ignorance as an excuse when it comes to issues of tax law.139Though Congress has shown ready willingness to amend willfulness statutes in other areas when the courts have interpreted them to require knowledge of law, John Shepard Wiley Jr., Not Guilty by Reason of Blamelessness: Culpability in Federal Criminal Interpretation, 85 Va. L. Rev. 1021, 1077 (1999), there has been no such amendment to I.R.C. § 7201.

Reasonable minds, of course, can disagree with Congress about the outcome of the cost-benefit analysis. The cost-benefit balancing is contingent on not only the complexity of law and our valuation of the competing normative principles, but also the positive facts. For instance, suppose that the Treasury Department could provide a pre-populated tax return for low- and middle-income individuals. State-level implementation in California has been successful in providing pre-populated returns for those with simple tax situations,140Randall Stross, Why Can’t the I.R.S. Help Fill in the Blanks?, N.Y. Times (Jan. 23, 2010), http://www.nytimes.com/2010/01/24/business/24digi.html [https://perma.cc/9KSA-4U2E]. Those with simple tax situations amount to roughly forty percent of California taxpayers. Joseph Bankman, Simple Filing for Average Citizens: The California ReadyReturn, 107 Tax Notes 1431, 1431 (June 13, 2005). and pre-populated tax returns could plausibly be implemented at the federal level as well,141Implementation at the federal level would require solving a few procedural issues, the main issue being “the lack of timely wage data at the federal (although not the state) level.” Bankman, supra note 140, at 1434. the proposal even having been a part of then-Senator Barack Obama’s presidential campaign platform.142Stross, supra note 140. Furthermore, most developed nations have return-free filing for low- and middle-income taxpayers, and such a system is not outside the realm of possibility in the United States.143Bankman, supra note 140, at 1434 (noting that such a reform would require certain changes in substantive tax law for accurate withholding at the source of income). If we were to resolve the compliance difficulties currently in our system for low- and middle-income taxpayers, then the case for removing willfulness becomes much stronger,144Cf. Cheek v. United States, 498 U.S. 192, 202 (1991) (noting that a tax return form and attached instructions could serve as evidence of knowledge of the contents of the instructions). perhaps overwhelming the reasons for keeping the willfulness requirement. If the Treasury were to do all of the legwork for the taxpayer, then complying with the tax law would require no greater intellectual sophistication than following criminal law generally.

Regardless of how one would, from one’s preferred moral and political valuations, balance the costs and benefits, I take it that we all agree that if the balance of reasons weighed against the willfulness requirement, Congress should be able to revise the language of I.R.C. § 7201 (and corresponding criminal tax statutes) to delete the word “willfully.” An amendment by the legislature that ignorance of the law does not absolve one of criminal liability in tax law, which is a matter of retributive justice, should not have enormous implications for the distribution of tax liabilities, a matter of distributive justice. Yet this is precisely the consequence of the too much lenity problem.

2. From Deference to Strict Construction

Following the question of to which statutes the rule of lenity will apply, the second question is what effect such an application will have when the rule does apply to a dual-purpose statute. Recall that the rule of lenity requires finding in favor of the defendant when the law is unclear. This interpretative stance is striking as an approach to tax law. Indeterminacy is a persistent problem for statutes, and the tax code is no exception.145See Hickman, supra note 10, at 908. Moving from an approach of uncovering the best interpretation of tax statutes to a taxpayer-wins approach in hard cases is a harsh blow to the tax law’s aims. An application of the rule of lenity is particularly harmful to the IRS’s enforcement efforts, as it is in the gaps of legal ambiguity where tax shelters thrive.146Id. at 932.

This issue becomes the clearest when comparing the rule of lenity to the general doctrines granting deference to the Treasury, and by extension the IRS, in interpreting the tax law. This comparison serves to analyze the counterfactual, the interpretative approach that would govern were lenity not to apply. An examination of the counterfactual brings to light just how starkly lenity contrasts in terms of both its purpose and effect.

Generally, when statutes are ambiguous, administrative agencies are granted deference (often called “Chevron deference”) by the courts in the agencies’ interpretation of the statutes they administer.147The deference regime is complex, but it begins with the case law Chevron U.S.A. Inc. v. Natural Res. Def. Council, Inc., 467 U.S. 837, 842–44 (1984); Nat’l Muffler Dealers Ass’n v. United States, 440 U.S. 472, 476–77 (1979); Skidmore v. Swift & Co., 323 U.S. 134, 139–40 (1944). The deference given to the IRS helps it to fill in the gaps of statutes in a way that comports with the aims of the tax code, collecting revenue in a just and efficient manner.148See also Hickman, supra note 10, at 909 (noting the role of deference in stopping abusive tax shelters).

In contrast, the Department of Justice, which prosecutes federal criminal offenses, receives no such deference in its interpretation of criminal statutes. Instead, it is well established that to afford it deference would be to run completely opposite the rule of lenity. Whereas the rule of lenity is a pro-defendant approach to interpretation, affording deference to the Justice Department would be pro-prosecution. As Justice Scalia has put it, to afford deference to the Justice Department would “turn the normal construction of criminal statutes upside-down” into “a doctrine of severity.”149Crandon v. United States, 494 U.S. 152, 178 (1990) (Scalia, J., concurring).

The doctrines of deference and lenity clearly juxtapose two distinct considerations about the right approach to take with regard to gaps in the law.150Hickman, supra note 10, at 912–17. On the lenity account, legal indeterminacy represents a rule of law failure and, in order to protect a path safe from punishment, citizens who fall under that penumbra cannot be punished. This account makes sense given the role of criminal law in carrying out retributive justice aims of punishment and moral condemnation. On the deference account, the gaps in the law ought to be filled by the expert, policy-driven approach of administrative agencies.151Id. at 932. This account makes sense given the role of tax law in coordinating distributive justice and revenue-raising functions. By cabining lenity to criminal law and deference to civil law, these opposing doctrines would have been kept aligned to their respective purposes, but under the Thompson/Center holding, lenity would apply to dual-purpose statutes that are being interpreted in the civil context. Even in cases that solely determine civil tax liability, instead of the interpretive regime that would best carry out the purposes of the tax law, the courts must employ a rule built to protect criminal defendants. The rule of lenity is incongruous with its purpose.

Without the kind of policy-driven approach permitted by Chevron, it is hard to imagine that there can be effective policing of tax shelters.152Id. at 932–33 (citing Chevron, 467 U.S. at 844–45). In order to distinguish between abusive tax shelters and permissible tax planning, the agencies must look to the general purpose of the tax laws. This is because tax shelters follow the letter of the tax law while going against the fundamental spirit of the tax code.153Dean & Solan, supra note 60, at 882–83; Steven A. Dean, Lawrence M. Solan & Lukasz Stankiewiez, Text, Intent and Taxation in the United States, the United Kingdom and France, in The Routledge Companion to Tax Avoidance Research, 139, 146 (Nigar Hashimzade & Yuliya Epifantseva eds., 2018); see also Noël B. Cunningham & James R. Repetti, Textualism and Tax Shelters, 24 Va. Tax Rev. 1, 2, 4 (2004) (noting that the rise of textualism has led tax advisors to be more aggressive in planning tax structures that go against the underlying purpose of tax law). Whereas deference allows the IRS to interpret statutes in line with the spirit of the law, lenity swings much closer to the textualist “letter of the law” interpretation. Foreign jurisdictions applying ordinary meaning textualist approaches to interpretation have struggled to strike down tax shelters,154Cunningham & Repetti, supra, note 153 at 27. and Thompson/Center threatens the same for the US system.155Cf. Dean & Solan, supra note 60, at 903–04 (noting that the most effective interpretative approach against tax shelters would swing far towards the purposive side of the spectrum).

As with the willfulness dam limiting the statutes to which lenity applies, the Court has partly walled off the deference due to some agency interpretations from Thompson/Center’s assault.156Nicholas R. Bednar & Kristin E. Hickman, Chevron’s Inevitability, 85 Geo. Wash. L. Rev. 1392, 1426 (2017) (noting also that the relationship between lenity and Chevron is still unresolved). Though the tension is not yet fully resolved, the Supreme Court has laid out a middle way between the two competing doctrines for some dual-purpose statutes in Babbitt v. Sweet Home Chapter of Communities for a Greater Oregon, such that not all administrative interpretations will be stripped of deference.157Babbitt v. Sweet Home Chapter of Cmtys. for a Great Or., 515 U.S. 687, 704 n.18 (1995) (citing United States v. Thompson/Center Arms Co., 504 U.S. 505, 517–18 (1992)). According to the middle way, agency interpretations of dual-purpose statutes will still be granted deference if they satisfy fair notice principles. In Sweet Home, the Court noted that the agency interpretations satisfied fair notice because they came in the form of regulations that had been published for twenty years.158Id.

But the Sweet Home middle way is limited. Not all administrative interpretations come by longstanding published regulations. Thompson/Center, for instance, presented a case in which no regulations were present. In the spaces where the IRS has not passed longstanding, formal, law-like regulations or has passed regulations with language itself subject to competing interpretations, it appears that fair notice will not have been provided.159Hickman, supra note 10, at 923. These gaps are significant.160See also Chirelstein & Zelenak, supra note 12 (“As Congress closes one loophole, tax shelter designers find other glitches in the Code around which to build new shelters.”); Dean & Solan, supra note 60, at 904 (noting the importance of dealing with “individual shelters”). Practitioners (or indeed anyone familiar with the tax system) would vouch for the importance of informal, nonbinding IRS guidance on tax matters.161See Hickman, supra note 10, at 942. Abusive transactions exploiting legal ambiguities in the tax code are often noticed by the IRS only after a taxpayer has engaged in such transactions.162Id. at 932. For these cases, Thompson/Center would severely hinder the Service’s efforts in effectuating the purpose of the tax laws by shifting from a deference regime to lenity.163See id. at 942 (demonstrating the impact of lenity on IRS enforcement efforts).

Furthermore, the Sweet Home approach to deference has also drawn academic criticism for failing to coincide with the non-delegation principle, which would confine the morally laden task of drafting criminal law statutes to elected officials in the legislature.164Id. at 922–23. Chevron is essentially a delegation doctrine, recognizing the delegation of interstitial lawmaking authority from the legislature to the administrative agencies.165Chevron U.S.A. Inc. v. Nat. Res. Def. Council, Inc., 467 U.S. 837, 843–44 (1984) (“If Congress has explicitly left a gap for the agency to fill, there is an express delegation of authority to the agency to elucidate a specific provision of the statute by regulation. . . . Sometimes the legislative delegation to an agency on a particular question is implicit rather than explicit. In such a case, a court may not substitute its own construction of a statutory provision for a reasonable interpretation made by the administrator of an agency.”) Since dual-purpose statutes serve criminal functions, allowing agency interpretations deference essentially puts the agencies in the role of filling in the criminal law and thereby violates the non-delegation principle. Agency deference ought to be limited to civil law just as the rule of lenity ought to be limited to the criminal law.

3. Legislative Solutions to the Too Much Lenity Problem

As I hope to demonstrate in this Article, I think that there are solutions to the too much lenity problem. Before getting to my preferred solution in Part IV, I discuss in this Section a possible legislative response and the difficulty it faces.

One possible response to the problem of too much lenity is for Congress to draft a separate criminal tax code and civil tax code. The problem of too much lenity arises when a criminal tax law refers to the language of a civil tax law. For instance, § 5871 states, “Any person who violates or fails to comply with any provision of this chapter shall, upon conviction, be fined not more than $10,000, or be imprisoned not more than ten years, or both.”166I.R.C. § 5871. The phrase “this chapter” refers to chapter 53 of the Internal Revenue Code, which governs the taxation of machine guns, destructive devices, and certain other firearms.167Id. §§ 5801–5872. It thereby requires substantive tax laws within chapter 53 to now perform double duty, assigning civil tax liability and serving as part of the criminal actus reus for § 5871.

Separating the two contexts through drafting may seem a reasonable solution at first, but thinking through how such a solution could be carried out leads to a primary difficulty. How could the legislature be able to draft language regarding the violation of tax law without referring to such laws? The content of the crime set out in § 5871 is that someone violated the tax law. And if this violation of the tax law is what we hold to be criminal, then it is hard to see how the criminal statute could be drafted without reference to the civil tax law.

The act/omission distinction partly explains the issue at hand. The distinction is ordinary and, so, should be familiar to most. To water a plant involves carrying out some willed bodily movement, an action.168See Michael S. Moore, Act and Crime: The Philosophy of Action and Its Implications for Criminal Law 28 (paperback ed. 2010). Omissions can best be understood negatively as the absence of a certain act. If you have agreed to water your friend’s plants while they are on vacation, then your failing to do so is an omission—an absence of the act of watering.169I borrow this example from Sarah McGrath, Causation by Omission: A Dilemma, 123 Phil. Stud. 125, 125 (2005). The law typically criminalizes acts; a major exception is in tax law, where omissions are criminalized.

Consider the language of 18 U.S.C. § 1584, which punishes “[holding another person] to involuntary servitude.”17018 U.S.C. § 1584. Holding another person to involuntary servitude is an act. The statute reflects the prohibition against involuntary servitude laid out in the Thirteenth Amendment171U.S. Const. amend. XIII, § 1. but, importantly, does not directly reference the Thirteenth Amendment.172Consider, by contrast, 18 U.S.C. § 241, which punishes conspiracy against any “right or privilege secured to [another] by the Constitution.” 18 U.S.C. § 241. This is a direct reference to the Constitution, which means the Constitution has criminal implications. Since § 1584 assigns punishment to an act, it need not refer to any other provision. It can merely replicate the language of the Thirteenth Amendment and punish holding others to “involuntary servitude.” And although this is an instance of replication between the Constitution and a statute, it is not hard to see how the same could be accomplished with replication between criminal law and civil law. The civil code can set out civil penalties for the conduct of such-and-such act and the criminal code can set out criminal penalties for the conduct of such-and-such act without either needing to directly reference the other.

In contrast, I.R.C. § 5871, and tax crimes more generally, punish non-compliance with respect to some legally required conduct, an omission. Since the omission is defined by the required conduct that one is omitting to do, one cannot spell out the omission without reference to the law that sets out the required conduct in the first place; insofar as that required conduct is a matter of civil tax law, that means that the criminal tax law must refer to the civil tax law. I.R.C. § 5871 must refer to § 5821 since § 5821 sets out the required conduct, the omission of which is punishable.

C. Higher-Order Vagueness

To make it easier to talk about the rule of lenity, let us stipulate another law and some facts about language. Suppose that there is a law prohibiting driving dangerously. The safe driving statute reads:

Whoever operates a motor vehicle or motorcycle on the public roads or highways at a dangerous speed, having regard for width, traffic, use, and the general and usual rules of such road or highway shall be fined not more than twenty-five dollars.

The half-fictive statute is based on former Oregon General Code Section 12603, which was upheld as a valid statute in State v. Schaeffer.173State v. Schaeffer, 117 N.E. 220, 226 (Ohio 1917). I borrow this example from Jeremy Waldron, Vagueness and the Guidance of Action, in Philosophical Foundations of Language in the Law 58, 59 (Andrei Marmor & Scott Soames eds., 2011).

Table 1.  Table of Stipulations (Stated Again Infra)

Statute’s Meaning

Analysis of Statute’s Meaning (Includes Borderline Instances)

Citizen’s Mental Representation of Statute’s Meaning (Only Prototypical Instances)

Dangerous Driving

60 mph or faster

70 mph or faster

Clearly Dangerous Driving

70 mph or faster

80 mph or faster

Clearly Clearly Dangerous Driving

80 mph or faster

90 mph or faster

Note: I encourage the reader to refer to this table while working through the following paragraphs. In order to state the problem, some unusual and technical locution must be used, so the graphical component of this table will aid in comprehension.

All reasonable people will admit that what counts as dangerous driving admits of borderline cases and is, thus, a vague predicate. Suppose by stipulation that 60 miles per hour (“mph”) is the cutoff for driving dangerously on Birch Avenue at 10 a.m. on Wednesday—one is dangerous if and only if one is driving at 60 mph or faster. Of course, driving at 60 mph is not prototypically dangerous, it is instead a borderline case. In fact, it is the border! Let us then stipulate that driving on Birch Avenue is clearly dangerous if and only if the car is going 70 mph or faster.174For those more technically inclined, I should specify that I am here, for exposition’s purpose, speaking under the assumption of truth of an epistemic theory of vagueness on which category membership is definite but sometimes unknowable. Stephen Schiffer, Philosophical and Jurisprudential Issues of Vagueness, in Vagueness and Law 23, 25, 26 n.3 (Geert Keil & Ralf Poscher eds., 2016). The problem with the rule of lenity noted in this Part, however, is not dependent on any particular theory of vagueness.

When a person reads the safe driving statute, their mental representation includes only these prototypical, clear instances of dangerous driving, or so the story of fair notice goes.175See supra notes 69–72 and accompanying text. Driving at 60 mph, borderline dangerous driving, never crosses the mind of Average Joe as dangerous as he drives down Birch Avenue at 60 mph. Thus, when Joe goes on trial, the judges apply a rule of lenity. They construe the statute to mean that Average Joe can only be found guilty for dangerous driving if he has driven clearly dangerously, not just borderline dangerously.176Moore, supra note 35. To do otherwise would be unfair to his natural reading of the statute and violate fair notice as a rule of law value. So a rule of lenity, which caters to expectations, now requires judges to only find a defendant guilty of dangerous driving if the car was moving at 70 mph or faster, for it is these speeds that are clearly dangerous. Joe has not violated the safe driving statute, the court rules.

From here, the story unravels. The key observation is that someone who knows about the rule of lenity will now actually have a narrower realm of expectation. Recall that the rule of lenity, as a canon of statutory interpretation, assigns meaning to the statute.177See supra note 60 and accompanying text. After Joe’s trial, the meaning of the statute changed from prohibiting dangerous driving to prohibiting clearly dangerous driving.178See Solan, supra note 60, at 2213 (“But once the courts interpret a statute . . . the ruling becomes part of the meaning of the statute . . . .”). So suppose Steve knows that courts have applied the rule of lenity with respect to the safe driving statute because he read the opinion from Joe’s verdict. Whereas Joe read the statute to mean that “dangerous” driving is prohibited, Steve rightly reads the statute to mean “clearly dangerous” driving is prohibited. The ultimate authorities on the meaning of statutes are the courts,179H.L.A. Hart, The Concept of Law 141 (3d ed. 2012); see Marbury v. Madison, 5 U.S. 137, 177 (1803) (“It is emphatically the province and duty of the judicial department to say what the law is.”). and the courts have stated that the safe driving statute means do not drive clearly dangerously. Steve knows from reading the opinion from Joe’s case that if he drives dangerously but only barely so such that he is still a borderline rather than prototypical case of dangerous driving, he will then be outside the ambit of the statute. The rule Joe follows is do not drive dangerously. The rule Steve follows is do not drive clearly dangerously. Since Joe and Steve have different propositional contents for the rules that they are following, they will also have different mental representations. If Steve expects that he will only be in violation of the statute for clearly dangerous driving, he will conjure the mental image of a prototypical clearly dangerous speed, not a borderline clearly dangerous speed. In other words, if mental representations of concepts are just those of prototypical instances, as discussed in Section I.B., then the mental representation that Joe has is of clearly dangerous driving while the mental representation that Steve has is of clearly clearly dangerous driving. The crux of the issue is that “clearly dangerous” is itself a vague predicate—what counts as clearly dangerous driving admits of both clear and borderline cases. This is the recursive phenomenon of higher-order vagueness, vagueness about the borderline cases.180For an account of higher-order vagueness in law, see also Moore, supra note 35, at 134, 134 nn.18–19; Raz, supra note 37.

Driving at 70 mph is a borderline case of clearly dangerous driving. Driving at 70 mph, however, is not clearly clearly dangerous driving. It is merely clearly dangerous. The mental representation of dangerous driving that Steve has upon reading the statute with the rule of lenity in mind—the propositional content of which is do not drive clearly dangerously—is driving at 80 mph or greater. Thus, Steve does not expect to be found guilty of dangerous driving when he drives at 70 mph. Applying exactly the same sort of reasoning that justified having the rule of lenity in the earlier case, a court system ought now to adopt a double rule of lenity to deal with the issues caused by second-order vagueness; otherwise, they will violate Steve’s expectations and the rule of law value of fair notice. Steve can be found guilty of dangerous driving only if he drove clearly clearly dangerously—at 80 mph or greater.

Such reasoning can continue ad-infinitum, adding the clearly adverb with each iteration of higher-order vagueness.181Just as there is second-order vagueness, there is third-order vagueness, fourth-order vagueness, and so forth. See also Raz, supra note 37 (discussing higher-order vagueness as a requisite of any plausible theory of vagueness). In order to protect fair notice, there must be the triple rule of lenity, the quadruple rule, the quintuple . . . . But surely this is absurd.182I note here that higher-order vagueness may be asymptotic such that, once there are enough clearly adverbs, there are no real differences in the velocity of a clearly^n dangerous speed and a clearly^n+1 dangerous speed. If higher-order vagueness is so asymptotic, an infinite rule of lenity may be more palatable than if higher-order vagueness is not so asymptotic, but I suspect that most will find the infinite rule of lenity absurd even if higher-order vagueness were asymptotic. Since we plainly ought not adopt an infinite rule of lenity—lest we let many dangerous drivers go free—and fair notice does seem to be an important rule of law value in criminal law, something has gone quite wrong. Citizens who read a statute after the rule of lenity has been applied are failing to receive fair notice of punishment. Call this the “higher-order vagueness problem.”

Many readers, when presented with my argument above, have responded that the court ought to draw clear boundaries in order to avoid the higher-order vagueness problem. On their account, instead of changing the meaning from dangerous to clearly dangerous, the court should instead state something akin to “we hereby stipulate that any speeds at 70 mph or greater will count as dangerous driving for the purpose of the safe driving statute.” Whereas “clearly dangerous” is vague, “70 mph or greater” is a bright line rule. No problem of higher-order vagueness is presented for “70 mph or greater.” Steve, when reading this opinion, should have a clear mental representation that 70 mph driving is prohibited by law.

The problem with such a response is that it fails to notice that this discussion has thus far been using elliptical construction to hide the context dependence of the statute. The safe driving statute states that the notion of dangerous speed must be understood in the context of “width, traffic, use, and the general and usual rules of such road or highway.” Even if the court draws clear boundaries in one context, it leaves the other contexts open. 70 mph is a clearly dangerous speed for driving on Birch Avenue at 10 a.m. on Wednesday. But what counts as a dangerous speed on Grove Street at 8 p.m. on Saturday or MLK Boulevard at 4 p.m. on Tuesday? Surely, the court cannot delineate what counts as dangerous for every width, traffic, use, and the general and usual rules of every road and highway. And what of vague predicates that reject quantification altogether, such as the No Vehicles in the Park statute? How would a court draw up a bright line rule for the meaning of “vehicle”? The courts are severely limited in their ability to draw bright line rules. In most cases, they must simply apply the rule of lenity to restrict the meaning of a vague statute to only its prototypical instances, thus leading to the higher-order vagueness problem.

1. Technical Bookkeeping

For most legal scholars, the above Section should be convincing on its own. For these scholars, I recommend skipping this addendum on the more technical workings of the intuitive story set out above. Those more inclined to debate the theoretical foundations of law may disagree with how I have presented the issues above. Here, I respond to such disagreements.

In the above example of Steve and Joe, some theoretical premises were implicit in how I laid out the example. Premise one, legal realism is false. Premise two, judges assign meaning when applying the rule of lenity. Premise three, there is a fact of the matter about the borders of vague predicates, but such facts are unknowable (in other words, epistemicism). The higher-order vagueness problem is not dependent on these premises. Even if all three premises were false, I would need to revise only the manner in which the problem is laid out, not the substance.

The first two premises get to at what point Steve can rightly have the expectation that the law only punishes clearly dangerous driving. For instance, suppose the first premise is false and legal realism is true. According to legal realism (or, more precisely, legal realism as characterized by H.L.A. Hart), the law is whatever a judge will say it is.183Hart, supra note 179, at 65, 65 n.1, 146; Brian Leiter, American Legal Realism, in The Blackwell Guide to the Philosophy of Law and Legal Theory 50, 61 (Martin P. Golding & William A. Edmundson eds., 2005). If that is the case, then Steve need not wait for the court to actually apply any rule of lenity for he knows they will. Legal realism states that the fact the court will apply the rule of lenity makes it currently the case that the statute has a narrow meaning. And if the future fact that judges will apply the rule of lenity is current law, then Steve should think, even before Joe’s case is heard, that the law prohibits clearly dangerous driving. The only difference here is a matter of timing. Was the meaning of the statute made narrow by the rule of lenity or was it always narrow since the rule of lenity will be applied when the meaning of the statute is litigated? Either way, the problem of higher-order vagueness stands.

Regarding the second premise, recall the earlier argument in Part II that the rule of lenity stipulates rather than figures out what the statute means. Though canons of statutory interpretation typically seek to figure out the existing meaning of a statute, substantive canons like the rule of lenity instead assign meaning to a statute based on normative considerations.184Eskridge et al., supra note 61, at 448–49. The rule of lenity is not a rule of interpretation in substance since it is not concerned with figuring out what the words mean.185See also Balkin, supra note 84 (distinguishing between construction and interpretation proper). The construction/interpretation distinction also explains why the higher-order vagueness problem does not have a parallel issue in ordinary cases of interpretation. Where the statute is being interpreted to figure out its meaning, the court’s ruling about a specific case does not assign or change the meaning of the statute. Without any change in meaning, there is no new expectation to have, other than perhaps the knowledge that one or another thing is included or excluded from a general category set out in the statute. When the courts are applying the rule of lenity, it is often within the space of indeterminacy, where meaning has run out.186Callanan v. United States, 364 U.S. 587, 596 (1961). That courts change, rather than interpret, the meaning of a statute when they apply the rule of lenity (premise two above) was a key part of how I originally framed the higher-order vagueness problem.

Suppose, arguendo, the second premise is false and that the rule of lenity is a way of uncovering the existing meaning of the statute. That is, the safe driving statute already has a narrow meaning before it is ever litigated, and in litigation, judges are merely uncovering the existing meaning rather than changing the meaning to implement normative principles. This would make the rule of lenity a rule of statutory interpretation in substance. Even so, the higher-order vagueness problem remains. Again, the only thing that changes is that Steve, if he understands the already existing meaning of the statute, should think that only clearly dangerous driving is prohibited without needing to know about Joe’s case. As with the legal realism premise, the only change here is a matter of timing.

Finally, I have been speaking as if there is a definite fact of the matter about the category membership of borderline instances of a predicate and that we do not know such facts. I find the supposition of epistemicism an easy way to talk about vagueness,187See supra note 172 and accompanying text. but its falsity does not solve the higher-order vagueness problem. The higher-order vagueness problem arises from the general features of vagueness that all theories of vagueness must accommodate:(1) vague predicates have borderline cases that cannot be clearly categorized either as or as not members of such predicates; (2) when reading a vague statute, the reader’s mind tends to conjure up only the clear cases and not the borderline cases; and (3) the question of which items are clear or borderline cases of vague predicates is itself infected with vagueness, thus necessitating distinctions between, for example, clearly clearly dangerous driving and borderline clearly dangerous driving. Features (1) and (2) necessitate a rule of lenity to provide fair notice, and feature (3) kicks the problem one level up each time that the rule of lenity is applied such that features (1) and (2) now apply to the higher level. All three features are theory-independent phenomena.

IV. LENITY AS EXCUSE: REVISING THE DOCTRINE

I have thus far noted the discrepancy between the rule of lenity’s purpose and outcome as well as three doctrinal problems that arise from the discrepancy. The rule of lenity cannot resolve cases of linguistic ambiguity. The rule of lenity extends into civil law. The rule creates higher-order issues of fair notice. Further, I have argued that such problems are foundational to the rule of lenity as it is currently applied. If my arguments are sound, then we must revise the jurisprudential approach to indeterminate criminal law at the foundation. But what should such revisions look like? This Part examines the fundamental nature of the courts’ current lenity jurisprudence and how it ought to be rectified in a way that maintains rule of law values.

On my diagnosis, the issue is that the courts have understood the rule of lenity to be a canon of statutory construction. As a canon of statutory construction, it determines the meaning of the statute to which it applies.188See supra note 60 and accompanying text. Call such a doctrine the semantic rule of lenity. The meaning of statutes is not the right instrument by which to implement the demands of notice in punishment. As I have thus far argued in this Article, statutory interpretation is too blunt a tool for the fine purpose of protecting fair notice.

In some sense, it should not be surprising that the semantic rule of lenity runs into technical problems. The originators of the rule of lenity likely did not foresee the three doctrinal problems I have listed here. The rule of lenity, which traces back to sixteenth century England, predates both the advent of the Internal Revenue Code and contemporary linguistics.189Livingston Hall, Strict or Liberal Construction of Penal Statutes, 48 Harv. L. Rev. 748, 750 (1935). Ideally, we should like to reconceptualize the rule of lenity such that we avoid the three doctrinal problems while maintaining its function carrying out rule of law values.

The semantic rule of lenity should be replaced by what I will call the lenity excuse. There ought to be an affirmative defense available to defendants in those instances in which the defendants’ actions were within the penumbra of an indeterminate criminal statute without changing the meaning of that statute.190Likely, the most straightforward way to replace the semantic rule of lenity with the lenity excuse would require both legislative and judicial support. First, judges must abandon the use of the semantic rule of lenity. Second, there should be new legislation permitting lenity as a general excuse limited to criminal cases. It is unclear whether courts acting alone could accomplish the task. Depending on one’s more foundational jurisprudential views, this is either a refashioning of the rule of lenity or the addition of a novel common law defense. For instance, one might read Justice Stevens’s dissent in Thompson/Center, in which he states that the rule of lenity ought to be cabined to the criminal realm, as consistent with the “refashioning” view as Justice Stevens appears to want to keep the application of the rule of lenity without changing the meaning of the tax statute. See United States v. Thompson/Ctr. Arms Co., 504 U.S. 505, 525–26 (1992) (Stevens, J., dissenting). Alternatively, one may view it as a new common law defense. Although federal criminal law does not allow expansion of criminal liability through common law, there is a history of contraction of criminal liability through the use of common law defenses. See Stephen S. Schwartz, Comment, Is There a Common Law Necessity Defense in Federal Criminal Law?, 75 U. Chi. L. Rev. 1259, 1268 (2008); see also George P. Fletcher, The Nature of Justification, in Action and Value in Criminal Law 175, 180 (Stephen Shute, John Gardner & Jeremy Horder eds., 1993) (“The legislature is supreme in defining offences, but not in specifying the range of possible defences that can negate the inference of wrongdoing from the commission of an offence.”). The issue is that federal courts have portrayed the introduction of such defenses as a matter of statutory construction, see, e.g., Dixon v. United States, 548 U.S. 1, 24–26 (2006) (Breyer, J., dissenting) (detailing the common law defenses available in federal criminal law as a matter of statutory construction); see also Jessica A. Roth, The Anomaly of Entrapment, 91 Wash. U. L. Rev. 979, 993–95 (2014) (detailing the introduction of the entrapment defense in federal courts as an application of their province of statutory construction), whereas the point of my proposed revision to lenity is to avoid the semantic conclusions that come by way of statutory construction. Perhaps courts could construe the excuse of lenity as a matter of constitutional due process under the Fourteenth Amendment. Since not even the rule of lenity is understood to be a constitutional mandate, Cass R. Sunstein, Nondelegation Canons, 67 U. Chi. L. Rev. 315, 332 (2000), such an argument may be a reach. Given its mere tangential relevance, I leave this matter of constitutional law unresolved. Because the new rule would operate as an excusing condition, the mere fact that the law did not unambiguously criminalize a defendant’s conduct would be sufficient to negate any liability for criminal defendants in the same way that duress or insanity would negate liability. In this way, lenity would function like other excuses (such as duress or insanity) that absolve defendants of criminal liability when it would be unfair to punish them.191See Model Penal Code §§ 2.09(1), 4.01 (Am. L. Inst., Proposed Official Draft 1962).

In the remainder of this Part, I will argue that my proposed revision to lenity would not only be pragmatic, solving the three doctrinal problems that plagued the semantic rule of lenity, but also conceptually fruitful, helpfully tying together the purpose of lenity with that of other excuses.

A. The Categorical Unity of Lenity and Excuse

In order to understand the categorical unity between lenity and excuse, one must first understand two foundational concepts and their relation to one another: affirmative defense and excusing condition. In criminal law, the establishment of an affirmative defense will absolve the defendant of criminal liability even if the prosecution has established case that all elements of the offense are present.192Kadish et al., supra note 83, at 817. A paradigmatic example is the excuse of duress. Suppose, for instance, that a defendant has stolen cash from his friend’s wallet because a thug made a credible threat to kill the defendant unless the defendant stole from his friend and gave it to the thug.193Or, if one prefers an example in which the threatened harm is less harmful than the defendant’s evil conduct, one can replace my example with the example of a browbeater who threatens to bust the defendant’s kneecaps unless the defendant aids the browbeater in the browbeater’s killing of the victim. See Moore, supra note 39, at 36 (noting the possibility of “justificatory readings of duress” and using a similar example). Which example we use will make no difference here so long as we focus on the nature of duress as an excuse. Even if the prosecution can establish that all elements of the larceny offense are present, the defendant may appeal to the defense of duress, which absolves a defendant of criminal liability if the defendant was threatened with “unlawful force . . . , which a person of reasonable firmness . . . would have been unable to resist.”194Model Penal Code § 2.09(1) (Am. L. Inst., Proposed Official Draft 1962).

For the second concept, that of excusing conditions, this Article will follow the analysis by H.L.A. Hart. Rather than defining the term, Hart provides a non-exhaustive list of its members: “Mistake, Accident, Provocation, Duress, and Insanity.”195Hart, supra note 33, at 31. Unlike other analyses of excuses, which tend to center their focus around the defendant’s moral responsibility,196Id. at 35. If one prefers the alternative analyses under which excuses are essentially exculpatory, they may also prefer to think of lenity as a public policy defense rather than an excuse. A public policy defense absolves a defendant of punishment for a reason of public policy, the public policy at issue for the rule of lenity being the rule of law value of fair notice. This would align the rule of lenity with doctrines like diplomatic immunity and the rule against double jeopardy. See Paul H. Robinson, Criminal Law Defenses: A Systematic Analysis, 82 Colum. L. Rev. 199, 230–31 (1982) (describing the public policy defense category). I take it that there is no substantive disagreement. Hart’s analysis of excusing conditions focuses on their role in protecting liberty.197Hart, supra note 33, at 44–50. Hart finds excuses to be valuable because they provide for citizens the valuable ability to predict in what instances one will be punished and to avoid such instances through one’s own will.198Id. at 45.

Return to the duress example above. If the defendant stole from the defendant’s friend because a gunman threatened to kill the defendant otherwise, the duress excuse would absolve the defendant of criminal liability.199Model Penal Code § 2.09(1) (Am. L. Inst., Proposed Official Draft 1962). If there was no such excuse available, then it would be very difficult for a citizen to ensure they avoid punishment. In such a system, the citizen cannot guarantee that they will avoid punishment as a result of two factors working in conjunction. Firstly, to the extent one has no control over what a violent gunman will do, one cannot guarantee that one will not be threatened by a gunman. Second, to the extent that it is near impossible to resist the orders of a gunman, one cannot guarantee that one will not commit the crime the gunman demands. Thus, without a duress excuse, whether or not one will go to jail would depend on the unpredictable whims of a gunman. In such a case, it cannot be said that the individual had a genuine choice to avoid the law’s criminal sanctions.200Hart, supra note 33, at 47–48. The duress excuse eliminates this worry by ensuring that, in this unpredictable circumstance, one will be saved from punishment.

As argued in Section I.B, fair notice of the criminal laws is also an essential part of citizens’ having a genuine choice to avoid punishment. Fair notice is essential because it gives citizens an opportunity to figure out which actions are subject to punishment under the law. For instance, if the government chose not to publish the criminal laws but instead keep them private, an ordinary person would not have the ability to figure out which actions will be met with punishment. Although Hart himself did not explicitly consider the question of whether or not fair notice doctrines should be understood as excuses, his theory and its implications are clear. In order to protect the choice to avoid punishment, Hart plainly states that citizens must be given the ability to “find out, in general terms at least, the costs they have to pay if they act in certain ways.”201Id. at 44.

Thus, lenity’s purpose of protecting the choice to avoid punishment aligns the doctrine more closely with the domain of excuse than the domain of statutory interpretation. Given the theoretical unity between lenity and excuse, it may be instructive to look to how other excuses are applied in the criminal law and consider whether lenity should be given the same treatment.

Many excusing conditions, such as duress and insanity, are employed as affirmative defenses under the law.202Model Penal Code §§ 2.09(1), 4.03 (Am. L. Inst., Proposed Official Draft 1962). In these instances, we take it as obvious that if what we want to do is free the defendant, then we should do that directly by permitting a defense, rather than indirectly through changing what a statute means. I propose here that the same treatment be given to lenity, allowing for defendants to simply avoid punishment in instances where the statute was indeterminate with respect to the defendant’s behavior without constraining the meaning of that term as the current semantic rule of lenity does.

Excuses are often understood in contrast to another category of affirmative defense: justifications. In the legal context, both serve as affirmative defenses requiring acquittal even where the prosecution has established the case that all elements of the offense are present.203Kadish et al., supra note 83, at 817. Interestingly, whether an act is justified or excused affects differently the conduct rules of those around the actor. “In sum, when the defendant’s act is justified (worthy of approval), everyone may help him, and no one may hinder him. When the defendant’s act is excused (worthy of sympathy, but not approval), no one may help him and everyone may hinder him.” Leo Katz, Bad Acts and Guilty Minds: Conundrums of the Criminal Law 65 (1987). However, as a moral matter getting at the theoretical grounding of the doctrines, the two categories of defense diverge on the question of why acquittal is required. Justifications defeat what would otherwise be a prohibition against acting in a particular way. It turns what would ordinarily be prohibited into a permissible act. A standard example is self-defense. Killing another is ordinarily impermissible, but not so if done in self-defense.204Judith Jarvis Thomson, Self-Defense, 20 Phil. & Pub. Aff. 283, 283 (1991). Someone who kills another in self-defense has done nothing wrong.205Id. This contrasts sharply with the nature of excuse, which presupposes the wrongfulness of the act done.206Fletcher, supra note 190, at 178. To use the duress example, we would say that the defendant did something wrong by stealing, but only did so because the coercive threat left them no choice otherwise. Both excuse and justification absolve the actor of criminal liability, but justifications do so by negating the impermissibility of the actions whereas excuses affirm that the actions were impermissible but absolve the actor for a reason standing outside the wrongfulness of the act itself.207See H.L.A. Hart, Prolegomenon to the Principles of Punishment, in Punishment and Responsibility: Essays in the Philosophy of Law, supra note 24, at 1, 13–14.

By leveraging the distinction between a justification and an excuse, one can see why the lenity excuse better accords with our intuitions regarding notice as a rule of law value as opposed to the semantic rule of lenity. Like justification, the semantic rule of lenity redraws the lines of what is permissible and what is impermissible behavior. By giving the statute a narrow construction, the semantic rule shrinks what counts as impermissible behavior. But in many cases in which the rule of lenity is applied, we see that the defendants really did do some harmful act, such as disposing of an object that could serve as evidence208Yates v. United States, 574 U.S. 528, 528–30 (2015). and transporting a stolen airplane across state lines.209McBoyle v. United States, 283 U.S. 25, 26–27 (1931). Therefore, the semantic rule of lenity fails to accord with the reason why we acquit defendants who conduct non-prototypical criminal activities. The reason we acquit them is that it would be unfair, from a rule of law perspective, to punish an act without clear notification that such an act would be punished, not that the defendants have clean hands.210See supra Section I.B.

Under the lenity excuse, a criminal defendant would be absolved of liability for their actions if their actions were not clearly within the meaning of the criminal statute without needing to change the meaning of said statute. The underlying conduct rule, for example, the rule prohibiting dangerous driving, does not change. The law can stand both for the proposition that a citizen acted wrongfully and the proposition that punishing the individual, despite their wrongful acts, would violate our rule of law principles. Refashioning lenity from a canon of statutory interpretation to an excusing condition essentially allows us to have our cake and eat it too. The lenity excuse protects the safe path principle while still maintaining the best interpretation of the statute.

B. Solving the Three Doctrinal Problems

Recall the earlier discussion of the hypothetical Bank Safety Act, which criminalizes starting a fire within one hundred feet of a bank.211See supra Section III.A. We stipulated there that “bank” was ambiguous between a river bank and a financial bank and that Defendant A had set fire next to a financial bank and Defendant B had set fire next to a river bank.212See supra Section III.A. The semantic rule of lenity does not resolve such a situation since either reading of “bank” would absolve only one of the two defendants of guilt. However, since neither defendant received fair notice that their act would be criminalized due to the linguistic ambiguity of the Bank Safety Act, the fair notice purpose would require absolving both of guilt.

Under a lenity excuse regime, the situation is neatly resolved. The court, for all defendants, need only ask the question whether either one of their actions were unambiguously criminalized by the law. Ex hypothesi, due to the linguistic ambiguity inherent in the statute, neither defendants’ actions were unambiguously criminalized, so both defendants would be absolved of culpability. The same would go for any case of linguistic ambiguity in criminal statutes, mutatis mutandis. Thus, the outcome of the lenity excuse is consistent with what fair notice demands.

The semantic rule of lenity problematically extended past its purpose of fair notice in punishment by applying in purely civil contexts. With the lenity excuse, a statute used in both criminal and tax law contexts can be given the best interpretation rather than the narrowest construction, so tax law’s civil purposes are protected. Nevertheless, the lenity excuse can still apply in criminal contexts, which is the context in which fair notice is required due to the special status of punishment. Although the meaning of an inscription must, for linguistic reasons, stay constant across contexts,213See supra note 116 and accompanying text. no such principle of consistency applies to affirmative defenses at law. The law can, and does, permit defenses in criminal law that are not available in civil law.214E.g., 18 U.S.C. § 17 (providing an insanity defense for criminal prosecutions). Insofar as we have a principled reason, namely the special status of punishment, a company like Thompson/Center Arms Co. could leverage the excuse of lenity to avoid punishment for the manufacture of Thompson/Center kits, but it should not have such an excuse when courts are determining its civil tax liability, such as whether the firearms excise tax applies to the production of Thompson/Center kits. This solves the too much lenity problem.

The semantic rule of lenity, by assigning meaning to a statute, led to fair notice problems caused by higher-order vagueness. With the excuse of lenity, there is no need to assign any particular narrow construction to the statute itself. Without any new assignment of meaning, studious potential criminals have no reason to have different expectations of what violates the criminal law after reading an opinion employing the lenity excuse. The court instead affirms, for instance, that borderline dangerous driving of 60 mph is still dangerous, though it absolves the defendant who drove at 60 mph of criminal liability since that would not have been the defendant’s expectation from reading the statute. The sovereign command remains don’t drive dangerously rather than changing to do not drive clearly dangerously, and no double rule of lenity is required. Joe, who drove at 60 mph will be able to benefit from the lenity excuse because his actions were not clearly prohibited by the vague safe driving statute; Steve, who drove at 70 mph will not be able to benefit from the excuse. Steve’s driving was clearly dangerous and, since the conduct rule remains do not drive dangerously as opposed to do not drive clearly dangerously, Steve has received fair notice that his actions would be punished. The excuse of lenity thereby solves the higher-order vagueness problem.

The reader may here object that a problem parallel to the higher-order vagueness problem nevertheless remains. Can Steve not say that he thought himself to be following the law given that there is an excuse of lenity that absolves borderline dangerous driving of culpability? To put the force of the counterargument another way, what is the difference between saying, as the semantic rule of lenity does, that clearly dangerous driving is prohibited and saying, as the excuse of lenity does, that dangerous driving is prohibited but borderline dangerous driving is excused?

The key distinction is that although the semantic rule of lenity is directly construing the statutory conduct rule, an excuse is not meant to guide conduct. It would be quite odd to think that the presence of excuses in the criminal law is tantamount to the law’s saying, “If you are planning to commit homicide, please make sure you are insane or under duress.” Excuses are instead best understood as addressing the government on how to adjudicate questions of criminal culpability.215Fletcher, supra note 190; Robinson, supra note 75, at 372. The criminal law permits excuses sotto voce.

This sotto voce feature of excuses may be analogized to the same in statutes of limitations. A five-year statute of limitations on assault, for example, is not to let citizens know that they are permitted to assault others so long as they can lay low for the next five years.216Fletcher, supra note 190, at 184–85. If the government were to amend the statute of limitations to seven years, an assaulter cannot complain of unfairness.217See id. It would be on its face ridiculous for the criminal to complain, “I assaulted someone yesterday thinking I would only have to hide for five years, not seven. You are treating me unfairly!” Likewise, because excuses are not conduct rules, the rule of law principle that citizens be given fair notice of which conduct is prohibited does not apply to excuses.218Gardner, supra note 24, at xlvii; Robinson, supra note 75, at 371–76, 379. Therefore, the excuse of lenity does not require a “second-order” excuse of lenity, thereby resolving the higher-order vagueness problem.

C. Other Justifications for the Semantic Rule of Lenity

I have argued in the previous Section that an excuse of lenity best aligns the doctrine with its rule of law purpose of fair notice while the semantic rule of lenity does not. Though the value of fair notice is the most often cited justification for the rule of lenity,219Kahan, supra note 20, at 349, 349 n.13. it is certainly not the only justification for such a long-standing and august doctrine of criminal law. Ideally, the excuse of lenity would be consistent with these other justifications as well—it would be a shame to throw out any babies with the bathwater. In this Section, I consider the other justifications for the rule of lenity and demonstrate how the excuse of lenity is consistent with such aims.

The first set of reasons significantly different from fair notice for having the rule of lenity includes those that are still closely connected to restricting the scope of criminalization. The rule of lenity “constrains the discretion of law enforcement officials”;220Id. at 345. it is a speed bump against over-criminalization in the United States;221See Stephen F. Smith, Proportionality and Federalization, 91 Va. L. Rev. 879, 939 (2005). and it protects the (relatively) politically powerless citizens who would have a hard time organizing to change the criminal code.222William N. Eskridge, Jr., Philip P. Frickey & Elizabeth Garrett, Legislation and Statutory Interpretation 357 (2d ed. 2006); William N. Eskridge, Jr., Overriding Supreme Court Statutory Interpretation Decisions, 101 Yale L.J. 331, 413 (1991). Since the excuse of lenity likewise works to restrict the scope of criminalization by offering a functional near-equivalent of the rule of lenity in the criminal context,223Although the excuse of lenity, like the semantic rule of lenity, ultimately results in absolving defendants of culpability when statutes did not unambiguously criminalize their behavior, the excuse of lenity is only “near-equivalent” because it actually provides greater protections to criminal defendants when it comes to issues of lexical ambiguity in statutes. See supra Section IV.B. all of these purposes are also carried out by the excuse of lenity.

Another category of reasons in favor of the rule of lenity involves the notion that criminal law is solely the province of the legislature, the non-delegation principle.224Hickman, supra note 10, at 912, 912 n.27. Again, the functional near-equivalence between the semantic rule of lenity and the excuse of lenity within the criminal context will explain why the excuse of lenity can do much of the work that the semantic rule of lenity currently does. Whenever the excuse of lenity applies, because the excuse will be dispositive of the case, the courts need not resolve the indeterminacy of the criminal statute at hand. Courts will need to resolve penumbral issues in dual-purpose statutes, but this will not make a difference for criminal liability since the excuse of lenity will be available when defendants fall into the penumbra. Since defendants can leverage the excuse against vague statutes in court, it will, like the semantic rule of lenity, put the impetus on Congress to draft clearer statutes.225This argument in favor of the rule of lenity is provided in William N. Eskridge, Jr. & Philip P. Frickey, Quasi-Constitutional Law: Clear Statement Rules as Constitutional Lawmaking, 45 Vand. L. Rev. 593, 600 (1992).

There is one reason in favor of the semantic rule of lenity that does not apply to the excuse of lenity: the semantic rule of lenity has a long history.226Scalia, supra note 64. The rule of lenity originated in sixteenth-century England227Livingston Hall, Strict or Liberal Construction of Penal Statutes, 48 Harv. L. Rev. 748, 749–50 (1935). and has survived in application to the present day. This is indeed a value lost if we were to do away with the semantic rule of lenity, but its importance ought to be put in proper perspective. Though the semantic rule of lenity’s history is long, canons of statutory interpretation are not law and do not have precedential effect.228Kisor v. Wilkie, 139 S. Ct. 2400, 2444 (2019) (Gorsuch, J., concurring); Abbe R. Gluck, Intersystemic Statutory Interpretation: Methodology as “Law” and the Erie Doctrine, 120 Yale L.J. 1898, 1902, 1909–10 (2011) (“The U.S. Supreme Court generally does not treat its statements about statutory interpretation methodology as law.”).

V. COUNTERARGUMENTS AND RESPONSES

In this Part, I consider some arguments specifically against the existence of the too much lenity problem, the higher-order vagueness problem, and the excuse of lenity. The central counterarguments are (1) the tax law is best construed narrowly in civil contexts, so the “too much lenity problem” is actually a feature, not a bug, of the semantic rule of lenity, and (2) if the legislature were to announce that there is no rule of lenity, then the higher-order vagueness problem dissipates due to the fact that individuals are now on notice that statutes will be construed according to the intent of the legislature. Both arguments are important because they go to the theoretical foundations of this Article.

A. Tax Law Would Be Better Off If the Rule of Lenity Applied

The first counterargument puts forth that the problem of too much lenity is no problem at all since the tax code ought to be subject to strict construction, resolving any indeterminacy in favor of the taxpayer. On this view, it is unfair to tax a citizen without clear say-so by statute. The application of the rule of lenity to the tax law is to be celebrated, not decried. Some European nations, for instance, have strict-construction tax systems favoring taxpayers.229See Dean et al., supra note 153, at 139, 148–53 (detailing the more taxpayer-friendly approaches taken in the United Kingdom and France). Such a response, I contend, fails to comport with the differential attitude citizens should have with regard to the administration of distributive justice and retributive justice.230There is another, more technical argument against applying the rule of lenity to matters of determining civil tax liability. Namely, the various inclusion, exclusion, deduction, and credit rules that determine tax liability do not have a single interpretation that is uniformly good or bad for all taxpayers. Take as one example what may seem like a clear case: rules assigning taxable income. In most situations, we may think that assigning taxable income is a negative consequence to the taxpayer, so we might think it appropriate to apply the rule of lenity in a way that minimizes a taxpayer’s taxable income. But this can be disadvantageous to a taxpayer who wants to meet an income floor to receive a health insurance premium assistance tax credit under I.R.C. § 36B. In such a situation, additional tax liability can be outweighed by the benefit of meeting the income floor and getting the credit. Grewal, supra note 22 (laying out this technical argument in further detail).

First, tax law is the government’s most important lever in carrying out principles of distributive justice.231See Nam, supra note 43. Distributive justice concerns how institutions should be designed to fairly distribute the benefits and burdens of societal cooperation.232Jeesoo Nam, Biomedical Enhancements as Justice, 29 Bioethics 126, 126 (2015). Our progressive income tax system carries out a democratically determined vision of distributive justice under which tax obligations directly correspond to one’s income earned in the marketplace. The statutes provide the skeletal structure for this vision, the corpus of which is fleshed out by the judicial and administrative authority. To undo the interstitial authority is to partly undo the very aims of the tax code.233See also supra Section III.B.2 (discussing the importance of agency deference in carrying out tax policy aims).

On this picture of tax justice, to deviate from the best interpretation of a statute in favor of a narrow interpretation of a statute not only undoes what distributive justice would require, but thereby also partly undoes the provision of a valuable moral service by the government. The tax system provides valuable coordination between citizens to hire an expert to tally up what justice requires of them and hold each other to that tally. Since taxpayers have moral reason to pay what justice requires, following the best interpretation of the tax law helps their aims rather than impeding them.234For the purpose of this argument, that some individuals do not think that such moral reasons apply to them does not entail that those reasons do not apply to them. What reasons are provided by morality is one question, whether people rightly recognize such reasons is another.

Second, though related to the first point, the purpose of tax law is distinct from the purpose of criminal law in that the imposition of tax does not typically aim to serve a deterrence function. After all, a tax on income is not meant to discourage the earning of income.235Quite the contrary, an important goal of tax law design is to minimize distortions to market behavior because such distortions lead to economic inefficiency. See Jonathan Gruber, Public Finance and Public Policy 620–33 (5th ed. 2016). There is no implicit public moral rebuke attached to civil tax liability as there is for criminal liability.236United States v. Bass, 404 U.S. 336, 348 (1971) (noting the moral condemnation attached to criminal punishment). Although there are exceptions,237For instance, the home mortgage interest deduction of I.R.C. § 163(h) appears to be provided principally to encourage home ownership over renting. the principal purpose of the tax law is to collect (and sometimes distribute) revenue in a just and efficient manner. Notice is most critical when the statutes are intended to guide citizens’ behavior since ambiguously drafted statutes cannot properly serve this guiding function.238Gardner, supra note 24, at xlii–xliii. A critic of this premise may nevertheless contend that even if vague statutes fail to serve as suitable reference points for decision-making, a legal regime of unclear laws still affects the behavior of decision-makers. Anticipating such a response, H.L.A. Hart and John Gardner helpfully distinguish between the law serving merely as a goad and the law serving as a guide. Though unclear laws cannot serve as a guide, they may nevertheless be successful as a goad. “Isn’t it arguable that the most effective legal systems (those most successful in securing their policy objectives) have been those operated as reigns of terror, revelling in arbitrariness, exploiting human weaknesses, and triggering conditioned responses?” Id. at xlii. Presumably, we should prefer that our criminal law system guide rather than terrorize our community, so the premise that notice is most important when statutes intend to guide still stands. After all, a citizen cannot use a statute to guide their behavior if they cannot figure out what the statute means.

To bring out this point, we can think of the perspective of a hypothetical idealized taxpayer with regard to the tax law. The taxpayer understands that they have a moral obligation to contribute a certain amount to the common pool of resources by which we fund the various functions of government. However, it is quite unlikely that the taxpayer could even estimate how much they should contribute if they were to reason purely from philosophical first principles or that they would know much about the content of such first principles. Even if the taxpayer resolves the coarse-grained question regarding their obligation to pay taxes, it is unlikely that they will even be able to approximate an answer to the fine-grained question of how much taxes they are morally obligated to contribute as a matter of justice.239Nam, supra note 43.

One way for the taxpayer to resolve the fine-grained question is to defer the calculations to appointed experts in the legislature and the Treasury. On this account, the taxpayer can carry out their ordinary business without worrying about what constitutes their fair share contribution and, at the end of the year, rely on the tax law and the aid of administrative officials to figure out what that fair share is given the activities they engaged in and their results. Instead of having to think through how the tenets of John Locke and Jean-Jacques Rousseau apply to him as a citizen, the taxpayer can just fill out an IRS Form 1040-EZ. This sort of division of labor is critical due to the difficulty of answering the fine-grained questions of political morality and the limited resources that citizens have to put towards such inquiry.240Id.

For these taxpayers, notice is only relevant insofar as they need the information to pay what they owe. Since the taxpayer is not treating tax liability as a cost or benefit of such-and-such action, clarity in laws is actually more important for the administration of such laws rather than being governed by them.241See Solan, supra note 53, at 134–35. This is also why the tax law can bear such enormous complexity. Whereas conduct rules primarily meant to dictate citizens’ behavior must be drafted simply so that citizens can understand the conduct rules, tax laws can be drafted with greater complexity because they are primarily addressed to administrators and judges who have expertise in tax law.242See Robinson, supra note 75, at 378 (“For example, a high degree of specificity might be desirable even if it created a degree of complexity that would be unreasonable to expect the public to master. The special training of decision makers . . . means that greater complexity can be tolerated.”). Though idealized for purposes of exposition, this sort of narrative is consistent with both the theoretical work in political philosophy243Nam, supra note 43 (detailing the work in political philosophy and the implications for how taxpayers ought to view the tax law). and the empirical research that often, though notably not always, finds that tax rates have no effects or very small effects on taxpayer behavior.244E.g., Gruber, supra note 235, at 688–90, 707; Joel Slemrod & Jon Bakija, Taxing Ourselves 112–13 (2d ed. 2000).

None of this is to deny the proposition that there would be something good provided by having a tax law system where taxpayers can more easily figure out their tax liability. For instance, if a taxpayer does not know how much taxes they will have to pay at the end of the year because the tax laws are too vague, it may lead to the taxpayer over- or under-saving for the forthcoming tax liability. Instead, my argument is merely that the civil tax law lacks many of the features that make fair notice far more important in criminal law. Given these differences—for civil tax law, there is no moral condemnation, no punishment, and no intended deterrence effect—we have good reason to think that tax law ought not follow the stringent fair notice requirements of criminal law.

B. Statutory Notice That Fair Notice Laws Have Been Repealed

The second counterargument contends that if courts were to get rid of the rule of lenity altogether in conjunction with notice of such at the legislative level, then the problem of higher-order vagueness would not arise. Here, the central idea is that the legislation stating that the rule of lenity does not apply to the criminal code would itself stop citizens from forming any expectations about the rule of lenity.

Such an approach has a fundamental problem. A notice that there would be no lenity provided would amount to notice that there is no fair notice. This becomes plain if we recall the fictive story underlying the fair notice doctrine. The reason a citizen needs fair notice is that they may think they are following the law when they are not. To put the point another way, citizens would simply be on notice that they cannot find comfort in their natural understanding of a criminal statute. That fair notice has been abrogated still stands.

The critic might then respond that the legislature ought to impose a statutory single rule of lenity. Given my argument for the higher-order vagueness problem, having just one rule of lenity would be arbitrary—what reason do we have to stop at one rather than two?—but such violations are forgivable. The law is in the business of line drawing and, since the hair-width difference between what is inside the line and outside the line can hardly be a difference-maker,245When this notion is applied to vague predicates, it is referred to as “the principle of tolerance.” Crispin Wright, Language Mastery and the Sorites Paradox, in Truth and Meaning 223, 229 (Gareth Evans & John McDowell eds., 1976). Put more formally, for vague predicates, if there is an object a to which the vague predicate applies and another object b that is qualitatively identical to a but for a miniscule difference, then the vague predicate will also apply to b. line drawing is often an arbitrary matter.

The bigger issue is that having one rule of lenity does not resolve the fair notice problem so long as the rule of lenity remains semantic in nature. The same conclusion about having no rule of lenity applies to the single rule of lenity. So long as individuals understand that the meaning of a statute has changed from an application of the rule of lenity, then to announce by statute that there will be no more higher-order rules of lenity will only violate individuals’ expectations that their actions are within the bounds of legally permissible behavior. The selection of any n-tuple rule of lenity cutoff is arbitrary and will disrupt fair notice for the n+1th order reader of the statute.246For any natural number n.

CONCLUSION

This Article has argued that the semantic nature of the rule of lenity leads to three problems in which the rule breaks away from its purpose of providing fair notice in criminal law. The rule of lenity cannot deal with linguistic ambiguity. Some criminal statutes also play civil functions, thereby transferring the strict construction of the rule of lenity from criminal contexts to the civil context. Once the courts construe the meaning of a statute to include just the clear cases, it then creates a fair notice burden regarding the question of what counts as the clear cases, which is itself a vague matter.

To resolve these issues, we ought to replace the semantic rule of lenity with an excuse of lenity. Excuses and fair notice share the common denominator of providing ordinary citizens the safety of choosing to avoid punishment, so having lenity provided as an excuse would more closely align the rule with its purpose. An excuse of lenity would provide the same benefits as the semantic rule of lenity, restricting the scope of criminalization and maintaining criminal law within the province of the legislature, without the drawbacks of having a semantic rule.

 

96 S. Cal. L. Rev. 397

Download

Assistant Professor of Law and Philosophy, University of Southern California. Thank you to Scott Altman, Jody David Armour, Jordan Barry, Thomas Bennett, Jonathan Choi, Robin Craig, Noël Cunningham, William Eskridge, Felipe Jiménez, Mitchell Kane, Gregory Keating, Adam Kern, Daniel Klerman, Yao Lin, Erin Miller, Michael Moore, Clare Pastore, Marcela Prieto, Robert Rasmussen, Emily Ryo, Daniel Sokol, Kevin Tobia, Gideon Yaffe, Yuan Yuan, Jack Whiteley, participants of the New York University School of Law–Lawyering Scholarship Colloquium, participants of the University of Pittsburgh–Law and Language Group, and participants of the University of Southern California Gould School of Law–Faculty Workshop for their invaluable help. Any errors are mine and mine alone.

Seeing and Serving Students with Substance Use Disorders Through Disability Law

The opioid epidemic has brought the immense harms of substance abuse to the fore of national attention. Despite a growing bipartisan consensus that substance use disorders are best addressed through treatment and community support, rather than punitive deterrence measures, policymakers have yet to allocate the necessary resources for a comprehensive and evidence-based national drug policy. Until that occurs, advocates for individuals with substance use disorders must search for reform opportunities within existing law and policy.

To that end, this Article explores whether, and to what degree, the federal disability statutes that are applicable to public schools—the Individuals with Disabilities Education Act, Section 504 of the Rehabilitation Act of 1973, and the Americans with Disabilities Act—can “see” and serve adolescents with substance use disorders within the public school system. It argues that substance use disorders can be education-impacting disabilities, that the general failure to recognize and address substance use disorders in school settings is due to widespread misperception of substance-involved students, and that a novel-but-reasonable interpretation of existing law could provide a meaningful degree of support for certain students with substance use disorders.

This Article has three objectives: (1) to instigate a debate in an uncharted area of education law and policy; (2) to provide a comprehensive survey of current medical research and special education case law for advocates of students with substance use disorders; and (3) to direct further attention to the broader inadequacies of special education law and policy for students with mental health challenges. The implications of this debate, upon the lives of the estimated 1.6 million adolescents with substance use disorders and upon education policy generally, are profound.

INTRODUCTION

Tom Murphy attended the 2015 Prescription Drug Abuse and Heroin Summit in his professional capacity as a Senior Special Agent in the Virginia State Police.1See Remarks at the Rx Drug Abuse and Heroin Summit in Atlanta, Georgia, 2019 Daily Comp. Pres. Doc. 237 (Apr. 24, 2019) (referring to Tom Murphy, Senior Special Agent in the Virginia State Police). But the topic of the summit was of great personal interest to him: his teenage son Jason was struggling with a substance use disorder.2Id. Jason’s substance abuse, and its attendant consequences, deepened in the months following the conference.3Id.

In 2017, Jason died from an overdose of fentanyl and heroin.4Id.

Addressing the Prescription Drug Abuse and Heroin Summit in 2019, Special Agent Murphy implored those touched by substance use disorders to share their experiences with others in order to fight stigma.5Id. He concluded his remarks by placing his family’s tragedy within the grim national context: “There are 70,000 different stories that happened in 2017. You heard my son’s.”6Id. He paused, choking back tears. “His name was Matthew Jason Murphy.”7Id.

It is difficult to fathom the harms caused by substance abuse. For the past several years, the rate of fatal overdoses has exceeded the highest-ever annual death tolls from car accidents, the AIDS epidemic, and gun violence.8See Josh Katz, Abby Goodnough & Margot Sanger-Katz, In Shadow of Pandemic, U.S. Drug Overdose Deaths Resurge to Record, N.Y. Times (July 15, 2020), https://www.nytimes.com/interactive/
2020/07/15/upshot/drug-overdose-deaths.html [https://perma.cc/W6XR-6RBU]. The total overdose rate includes non-opiate overdoses, but opiates such as fentanyl are involved in the large majority of overdose deaths. See Drug Overdose Death Rate Maps & Graphs, Ctrs. for Disease Control & Prevention (June 2, 2022), https://www.cdc.gov/drugoverdose/deaths/index.html [https://perma.cc/KZ8N-Z2SZ].
There were 70,237 overdose deaths in the United States in 2017,9Holly Hedegaard, Arialdi M. Miniño & Margaret Warner, NCHS Data Brief No. 329: Drug Overdose Deaths in the United States, 1999–2017 (2018), https://www.cdc.gov/
nchs/data/databriefs/db329-h.pdf [https://perma.cc/4ARB-D5DG].
67,367 overdose deaths in 2018,10Holly Hedegaard, Arialdi M. Miniño & Margaret Warner, NCHS Data Brief No. 356: Drug Overdose Deaths in the United States, 1999–2018 (2020), https://www.cdc.gov/
nchs/data/databriefs/db356-h.pdf [https://perma.cc/N7UV-JMVP].
70,630 overdose deaths in 2019,11Holly Hedegaard, Arialdi M. Miniño & Margaret Warner, NCHS Data Brief No. 394: Drug Overdose Deaths in the United States, 1999–2019 (2020), https://www.cdc.gov/
nchs/data/databriefs/db394-H.pdf [https://perma.cc/4REQ-XT63].
91,799 overdose deaths in 2020,12Holly Hedegaard, Arialdi M. Miniño, Merianne Rose Spencer & Margaret Warner, NCHS Data Brief No. 428: Drug Overdose Deaths in the United States, 1999–2020 (2021), https://www.cdc.gov/nchs/data/databriefs/db428.pdf [https://perma.cc/B5C5-RUQN]. and a stunning 107,573 overdose deaths in 2021.13Nat’l Ctr. for Health Stat., Provisional Drug Overdose Death Counts (2021), https://www.cdc.gov/nchs/nvss/vsrr/drug-overdose-data.htm [https://perma.cc/2JF9-PZ7M]. The substantial increase in overdose deaths between 2019 and 2021 was likely fueled in part by the COVID-19 pandemic, which caused widespread misery and inhibited access to treatment.14See William Wan & Heather Long, ‘Cries for Help’: Drug Overdoses Are Soaring During the Coronavirus Pandemic, Wash. Post (July 1, 2020), https://www.washingtonpost.com/health/2020/07/
01/coronavirus-drug-overdose [https://perma.cc/6NLC-PAPU]; Zoe Rohrich, Opioid Deaths Are Surging in the Pandemic. Here’s How Treatment Is Adapting, PBS (Aug. 7, 2020), https://www.pbs.org/
newshour/health/opioid-deaths-are-surging-in-the-pandemic-heres-how-treatment-is-adapting [https://
perma.cc/83AT-JAAY].

For a frame of reference, Special Agent Murphy’s tribute to his son lasted four minutes;15President and First Lady Deliver Remarks at Drug Abuse Summit, C-SPAN (Apr. 24, 2019), https://www.c-span.org/video/?460100-1/president-lady-deliver-remarks-drug-abuse-summit [https://
perma.cc/6FPK-JFF9].
if a family member of every person who died from a drug overdose in 2017 shared their story for four minutes, back to back, it would last over 195 days. If family members of those who lost loved ones to overdoses in 2021 did the same thing, it would last over 298 days.

At the same conference, politicians and policymakers touted their efforts to combat the opioid epidemic, including the designation of a national public health emergency the previous year; the issuing of billions of dollars in state grants “[t]o expand access to treatment, recovery, and other crucial activities and services”; and the signing of the Substance Use-Disorder Prevention that Promotes Opioid Recovery and Treatment (“SUPPORT”) for Patients and Communities Act,16See Remarks at the Rx Drug Abuse and Heroin Summit in Atlanta, Georgia, supra note 1. which reduced regulatory hurdles concerning “opioid use disorder prevention, recovery, and treatment” the previous October.17Substance Use-Disorder Prevention that Promotes Opioid Recovery and Treatment (SUPPORT) for Patients and Communities Act, Pub. L. No. 115-271, 132 Stat. 3894 (2018). These and similar policy responses have received widespread praise.18See What They Are Saying: Support for President Trump’s Initiative to Fight the Opioid Epidemic, Trump White House (Mar. 20, 2018), https://trumpwhitehouse.archives.gov/briefings-statements/saying-support-president-trumps-initiative-fight-opioid-epidemic [https://perma.cc/5767-LBSF].

Unfortunately, such policy responses—while generally welcomed by experts in the field of addiction studies19See German Lopez, Trump Just Signed a Bipartisan Bill to Confront the Opioid Epidemic, Vox (Oct. 24, 2018, 3:13 PM), https://www.vox.com/policy-and-politics/2018/9/28/17913938/trump-opioid-epidemic-congress-support-act-bill-law [https://perma.cc/DAU2-8HJU].—have not come close to creating the “Cascade of Care” required to serve the roughly forty million Americans with substance use disorders.20Arthur Robin Williams, Edward V. Nunes, Adam Bisaga, Frances R. Levin & Mark Olfson, Development of a Cascade of Care for Responding to the Opioid Epidemic, 45 Am. J. Drug & Alcohol Abuse 2 (2019); Substance Abuse & Mental Health Servs. Admin., Key Substance Use and Mental Health Indicators in the United States: Results from the 2020 National Survey on Drug Use and Health 3 (2021) [hereinafter SAMHSA Report], https://www.samhsa.gov/
data/sites/default/files/reports/rpt35325/NSDUHFFRPDFWHTMLFiles2020/2020NSDUHFFR1PDFW102121.pdf [https://perma.cc/KN2S-RYEU].
Until the political will exists for such comprehensive policy initiatives, advocates for individuals with substance use disorders must contemplate ways in which existing law and policy can be marshaled to serve that population.

This Article offers such a solution: using the federal disability-discrimination laws applicable to public schools as a new way to “see” and serve individuals who—like Jason Murphy—develop debilitating substance use disorders as adolescents. This Article proposes that students with substance use disorders who meet the eligibility criteria of federal disability laws should be recognized as individuals with disabilities (and receive appropriate accommodations) from their schools, just as adults with substance use disorders who meet such diagnostic criteria have received appropriate accommodations from their employers since the 1970s.

The argument that substance use disorders should be recognized and addressed as legal disabilities under special education law is a novel one. While several scholars have powerfully addressed the need to recognize mental health conditions under special education law,21See infra note 114. no court opinion or piece of scholarship has yet engaged with the matter of applying such laws to students with substance use disorders specifically. It is not that the matter has been studied and rejected, but rather that this particular conversation has not yet begun.

This Article offers a possible explanation for this silence: that students with substance use disorders are rarely perceived within their schools to be afflicted with “medical” conditions, which is the necessary predicate for recognition of “legal” disabilities. To that end, this Article provides a survey of current medical research regarding substance use disorders and how such disorders affect adolescents’ academic development. It also discusses the power of social perception in this space; the manner in which adolescents face unique barriers to the identification of, and appropriate responses to, substance use disorders; and how students with substance use disorders are therefore largely invisible within schools’ current drug and alcohol policies.

Jason Murphy began “self-medicating” with marijuana while he was in high school and moved out of his parents’ home the day he turned eighteen.22See Remarks at the Rx Drug Abuse and Heroin Summit in Atlanta, Georgia, supra note 1. If his school had recognized substance use disorders not as a propensity towards deviant behavior, but rather an addressable education-impacting disability, perhaps his story would not have been one of the 70,237—every one of whom, to some degree, representing a failure of policy—told in 2017. Radical as it may initially appear, the possibility that an avenue exists by which students like Jason can be seen and served in their schools is worth exploring.

I. SUBSTANCE USE DISORDERS AND CURRENT EDUCATION POLICY

Part I of this Article presents the case for the recognition of substance use disorders in federal special education law: Section I.A examines substance use disorders as “medical” conditions and “legal” disabilities; and Section I.B explores why schools are resistant to interpreting drug abuse by adolescents through these medical and legal constructs.

A. Substance Use Disorders in Medicine and Law

The term “substance use disorder” will be used frequently throughout this Article. This is in part because (as discussed below) the term “substance use disorder” is preferable to terms such as “addiction” and “alcoholism.”23See infra Section I.B.2. But more importantly, using such “medical” terminology when discussing drug abuse by adolescents reinforces a central argument of this Article: that seeing substance-involved adolescents as having medical conditions (as opposed to merely engaging in criminal behaviors) opens the door to recognition of and support for those adolescents under federal disability laws. Accordingly, a brief framing of “substance use disorders” within medicine and law is in order.

1. Substance Use Disorders as Medical Conditions

According to the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (“DSM-5”), the “essential feature” of substance use disorders—regardless of the particular substance being abused—is a “cluster of cognitive, behavioral, and physiological symptoms indicating that the individual continues using the substance despite significant substance-related problems.”24Am. Psychiatric Ass’n, Diagnostic and Statistical Manual of Mental Disorders 483 (5th ed. 2013) [hereinafter DSM-V]. In other words, individuals with substance use disorders continue to abuse substances despite the consequences stemming from that abuse, even when such individuals no longer desire to use drugs or obtain much pleasure from doing so.25See Gary Wenk, James C. Anthony, Hui Cheng, Brian Fairman & Dan Romer, The Neurobiology, Characteristics, and Prevalence of Substance Use, in Treating and Preventing Adolescent Mental Health Disorders 373, 376 (Dwight L. Evans, Edna B. Foa, Raquel E. Gur, Herbert Hendin, Charles P. O’Brien, Daniel Romer, Martin E. P. Seligman & B. Timothy Walsh eds., 2nd ed. 2017).

Ten separate classes of drugs are discussed in the DSM-5: “alcohol; caffeine; cannabis; hallucinogens (with separate categories for phencyclidine [or similarly acting arylcyclohexylamines] and other hallucinogens); inhalants; opioids; sedatives, hypnotics, and anxiolytics; stimulants (amphetamine-type substances, cocaine, and other stimulants); tobacco; and other (or unknown) substances.”26DSM-V, supra note 24, at 481. Misuse of any of these drugs, with the exception of caffeine,27See id. at 483. can result in an individual meeting the diagnostic criteria of a substance use disorder.28The symptoms that are assessed to determine a use disorder are as follows:

     A problematic pattern of [substance] use leading to clinically significant impairment or distress, as manifested by at least two of the following, occurring within a 12-month period:

[Substance] is often taken in larger amounts or over a longer period than was intended.

There is a persistent desire or unsuccessful efforts to cut down or control [substance] use.

A great deal of time is spent in activities necessary to obtain [substance], use [substance], or recover from its effects.

Craving, or a strong desire or urge to use [substance].

Recurrent [substance] use resulting in a failure to fulfill major role obligations at work, school, or home.

Continued [substance] use despite having persistent or recurrent social or interpersonal problems caused or exacerbated by the effects of [substance].

Important social, occupational, or recreational activities are given up or reduced because of [substance] use.

Recurrent [substance] use in situations in which it is physically hazardous.

[Substance] use is continued despite knowledge of having a persistent or recurrent physical or psychological problem that is likely to have been caused or exacerbated by [substance].

Tolerance, as defined by either of the following:

   A need for markedly increased amounts of [substance] to achieve intoxication or desired effect.

A markedly diminished effect with continued use of the same amount of [substance].

Id. at 490–91. Specific indicators of the remaining criteria, “withdrawal,” differ depending upon the substance involved. See id. at 490–578.

The absence of symptoms (with the exception of cravings) for one year or longer indicates that the substance use disorder is in “sustained remission.”29Id. at 491. Establishing and prolonging remission from an active substance use disorder, which is achieved by preventing relapses of the previously abused substance or the transitioning to another drug, is a primary goal of substance use disorder treatment.30See Tammy Chung & Stephen A. Maisto, Relapse to Alcohol and Other Drug Use in Treated Adolescents: Review and Reconsideration of Relapse as a Change Point in Clinical Course, 26 Clinical Psych. Rev. 149, 149 (2006).

Although relapse is a common part of the recovery process, a variety of therapeutic approaches can be employed to promote relapse prevention and increase the likelihood of long-term remission from substance use disorders.31See Sean Estaban McCabe, Brady T. West, Stephen Strobbe & Carol J. Boyd, Persistence/Recurrence and Remission from DSM-5 Substance Use Disorders in the United States: Substance-Specific and Substance-Aggregated Correlates, 93 J. Substance Abuse Treatment 38, 49 (2018). Compare Lori A. Quigley & G. Alan Marlatt, Relapse Prevention: Maintenance of Change After Initial Treatment, in Addictions: A Comprehensive Guidebook 370, 371 (Barbara S. McCrady & Elizabeth E. Epstein eds., 1999) (describing a relapse prevention model that employs cognitive behavioral therapy), with Brenna L. Greenfield, Corey Roos, Kylee J. Hagler, Elena Stein, Sarah Bowen & Katie A. Witkiewitz, Race/Ethnicity and Racial Group Composition Moderate the Effectiveness of Mindfulness-Based Relapse Prevention for Substance Use Disorder, 81 Addictive Behav. 96, 97 (2018) (describing an application of cognitive-behavioral relapse prevention principles using mindfulness techniques). Critically, all approaches require a degree of intentionality and effort on the part of the individual with the substance use disorder and, ideally, their family or other support network.32See Chung, supra note 30, at 150. See generally Quigley, supra note 31 (describing a relapse prevention model that requires ongoing therapeutic intervention). Establishing sustained remission from a substance use disorder is a long-term process that, for some, involves a personal commitment to lifelong abstinence from all mind-altering substances.33Alcoholics Anonymous 58–60 (4th ed. 2001).

In 2020, the most recent year for which data is available, approximately 40.3 million people aged twelve and older met the diagnostic criteria for a substance use disorder.34SAMHSA Report, supra note 20, at 3. A significant gap exists between the number of individuals who need treatment for substance use disorders and the number of individuals who receive such treatment.35Id. at 4. Many individuals with substance use disorders have co-occurring mental health issues.36Id.

2. Substance Use Disorders as Legal Disabilities

As reflected by policymakers’ remarks to the 2019 Prescription Drug Abuse and Heroin Summit, there has been a notable, if incomplete, movement toward recognizing substance use disorders as “medical” conditions most efficaciously addressed through treatment and community support.37Donald J. Trump, Remarks at the Rx Drug Abuse and Heroin Summit in Atlanta, Georgia, Am. Presidency Project (April 24, 2019), https://www.presidency.ucsb.edu/documents/remarks-the-rx-drug-abuse-and-heroin-summit-atlanta-georgia [https://perma.cc/6HV7-7PAB]. Even now, however, the idea that substance use disorders can be recognized within, and addressed by, federal disability laws may strike some as odd—if not wrongheaded.

Indeed, when presented with the argument that adolescents with substance use disorders should be seen and served by federal disability laws, many will likely find it more difficult to accept the premise that such laws should recognize substance use disorders in the first place than to accept the premise that such recognition should be extended to adolescents. But the first premise above has been in effect since the mid-1970s.38Rehabilitation Act of 1973—Coverage of Alcoholics & Drug Addicts, 43 Op. Att’y Gen. 75, 80 (1977) (“For the foregoing reasons, we believe that alcoholics and drug addicts were within the scope of the definition of ‘handicapped individuals’ in the Rehabilitation Act of 1973 as originally enacted.”).

The first major piece of federal disability-rights legislation was the Rehabilitation Act of 1973. The following language, contained in Section 504 of the Rehabilitation Act (“Section 504”) represents “the first explicit Congressional statement recognizing ‘discrimination’ against people with disabilities.”39Peter Blanck, Eve Hill, Charles D. Siegal & Michael Waterstone, Disability Civil Rights Law and Policy 31 (3d ed. 2004).

No otherwise qualified individual with a disability in the United States, as defined in section 705(20) of this title, shall, solely by reason of her or his disability, be excluded from the participation in, be denied the benefits of, or be subjected to discrimination under any program or activity receiving Federal financial assistance.4029 U.S.C. § 794(a).

The Americans with Disabilities Act (“ADA”), which was passed in 1990, extends the Rehabilitation Act’s discrimination prohibitions to private companies,4142 U.S.C. §§ 12111–12117. local and state governments,42Id. §§ 12131–12165. and public accommodations.43Id. §§ 12181–12189.

During the initial drafting and subsequent revisions of the Rehabilitation Act and the ADA, lawmakers directly confronted the possibility of adults with substance use disorders seeking workplace accommodations.44See Rehabilitation Act of 1973—Coverage of Alcoholics & Drug Addicts, 43 Op. Att’y Gen. 75, 80 (1977). Regulations under both statutes acknowledge addiction and alcoholism as disabilities deserving of certain protections45See 28 C.F.R. § 35.108(b)(2) (2022) (“Physical or mental impairment includes . . . drug addiction and alcoholism.”). and clarify the obligations of employers of individuals with substance use disorders.46See 29 U.S.C. § 705(20)(C)(v) (“For purposes of . . . employment, the term ‘individual with a disability’ does not include any individual who is an alcoholic whose current use of alcohol prevents such individual from performing the duties of the job in question or whose employment, by reason of such current alcohol abuse, would constitute a direct threat to property or the safety of others.”); see also 29 C.F.R. § 1630.3(b)(1)–(3) (2022) (“However, the terms disability and qualified individual with a disability may not exclude an individual who: (1) Has successfully completed a supervised drug rehabilitation program and is no longer engaging in the illegal use of drugs, or has otherwise been rehabilitated successfully and is no longer engaging in the illegal use of drugs; or (2) Is participating in a supervised rehabilitation program and is no longer engaging in such use; or (3) Is erroneously regarded as engaging in such use, but is not engaging in such use.”). A body of case law and legal scholarship further articulates those boundaries.47See generally Dustin Riddle & Richard Bales, Disability Claims for Alcohol-Related Misconduct, 82 St. John L. Rev. 699 (2008) (containing accounts of how workplace-focused nondiscrimination law protects individuals with substance use disorders); Reese John Henderson, Jr., Addiction as Disability: The Protection of Alcoholics and Drug Addicts Under the Americans with Disabilities Act of 1990, 44 Vand. L. Rev. 713 (1991); Judith J. Johnson, Rescue the Americans with Disabilities Act from Restrictive Interpretations: Alcoholism as an Illustration, 27 N. Ill. U. L. Rev. 169 (2007); Renee Parsons & Thomas J. Speiss III, Does the Americans with Disabilities Act Really Protect Alcoholism?, 20 Lab. Law. 17 (2004); Amy L. Hennen, Protecting Addicts in the Employment Arena: Charting a Course Toward Tolerance, 15 L. & Ineq. 157 (1997).

The Rehabilitation Act and ADA’s statutory language, regulatory guidance, and case law uniformly discharge any obligation on the part of employers to accommodate active drug use (or alcohol abuse that interferes with work obligations) by individuals with substance use disorders.48See Henderson, Jr., supra note 47. Employees who are “currently engaging in the illegal use of drugs” are not considered “qualified individual[s] with a disability” under the ADA and are therefore not entitled to workplace accommodations, reasonable or otherwise.4929 C.F.R. § 1630.3(a) (2022). However, employees with substance use disorders who have maintained sobriety beyond a minimum period of abstinence—the necessary length of which is determined on a case-by-case basis50See U.S. Equal Emp. Opportunity Comm’n, EEOCM1A, A Technical Assistance Manual on the Employment Provisions (Title I) of the Americans with Disabilities Act § 8.3 (1992) (“ ‘Current’ drug use means that the illegal use of drugs occurred recently enough to justify an employer’s reasonable belief that involvement with drugs is an on-going problem. It is not limited to the day of use, or recent weeks or days, in terms of an employment action. It is determined on a case-by-case basis.”). This protection for individuals in remission from substance use disorders is sometimes referred to as the ADA’s “safe harbor” provision. See Shafer v. Preston Mem’l Hosp. Corp., 107 F.3d 274, 276 (4th Cir. 1997) (“While expressly excluding current drug users from statutory protection [in the workplace], the statutes provide a ‘safe harbor’ for recovering addicts . . . .”); Mauerhan v. Wagner Corp., 649 F.3d 1180, 1185 (10th Cir. 2011) (“[T]he ADA also creates a ‘safe harbor’ for those who are not currently engaging in the illegal use of drugs.”).—are entitled to reasonable accommodations for their continued recovery-support needs.51See Samuel Brown Petsonk & Anne Marie Lofaso, Working for Recovery: How the Americans with Disabilities Act and State Human Rights Laws Can Facilitate Successful Rehabilitation for Alcoholics and Drug Addicts, 120 W. Va. L. Rev. 891, 906 (2018) (“Reasonable accommodations for recovering addicts may include part-time schedules to support inpatient or outpatient behavioral therapy, and may also include flex time or intermittent leave to participate in random drug screenings, rehab sessions, Alcoholics or Narcotics Anonymous meetings, physical activity, medically-assisted treatment (combining behavioral therapy with medications, such as Suboxone, to treat substance abuse disorders), or other recovery-related appointments.”).

The fact that these regulations cannot be neatly transferred to the school environment—which, obviously, has significant implications for the project of extending coverage under the Rehabilitation Act and ADA to students with substance use disorders—will be discussed in Section II.B.

B. Substance Use Disorders Among Adolescents

Substance use disorders are diagnosable, treatable, “medical” conditions that can, under certain conditions, be recognized as “legal” disabilities. But if an individual’s drug abuse is interpreted not as evidence of such a “medical” condition, and is instead interpreted only as maladaptive, dangerous, and unlawful behavior, then there is no possibility of such drug abuse being recognized and addressed as a “legal” disability. As discussed below, adolescents are especially vulnerable to such incomplete interpretations of their substance abuse, which perhaps is why a discussion regarding the inclusion of substance use disorders within special education laws has not yet occurred. Section B.1 discusses the prevalence of substance use disorders among adolescents and the education-related consequences of such disorders; Section B.2 discusses certain perceptual errors that prevent widespread recognition of substance use disorders among adolescents; and Section B.3 discusses the harms caused by schools’ resistance to recognizing substance use disorders among adolescents.

1. The Educational Impact of Substance Use Disorders

Substance use disorders among adolescents are considered to be a major public health challenge that presents certain difficulties distinct from the challenges presented by substance use disorders among adults.52See Danielle E. Ramo, Mark A. Prince, Scott C. Roesch & Sandra A. Brown, Variation in Substance Use Relapse Episodes Among Adolescents: A Longitudinal Investigation, 43 J. Substance Abuse Treatment 44, 44 (2012); Pilar M. Sanjuan & James W. Langenbucher, Age-Limited Populations: Youth, Adolescents, and Older Adults, in Addictions: A Comprehensive Guidebook 477, 477, 479 (Barbara S. McCrady & Elizabeth E. Epstein eds., 1999). See generally William L. White, Michael Dennis & Frank M. Tims, Adolescent Treatment: Its History and Current Renaissance, 3 Couns. 20 (2002), for a particularly insightful summation of the history of adolescent substance use disorder treatment. According to the most recent Substance Abuse and Mental Health Services Administration (“SAMHSA”) National Survey on Drug Use and Health, an estimated 6.3% of adolescents—1.6 million individuals—met the diagnostic criteria for substance use disorder.53SAMHSA Report, supra note 20, at 28. Given that the average class size in secondary schools is approximately twenty-seven students, one could visualize this prevalence by imagining that every middle and high school class in the country has one or two students with a substance use disorder.54See Nat’l Ctr. for Educ. Stat., U.S. Dep’t of Educ., NCES 2105-011, Digest of Education Statistics 2013, at 138 tbl.209.30 (2015).

The same “treatment gap” that exists for individuals with substance use disorders generally also exists for adolescents with substance use disorders. Only 0.7% of adolescents—169,000 individuals—received any substance abuse treatment in 2020, which is slightly over 10% of the total number of adolescents who needed such treatment.55SAMHSA Report, supra note 20, at 38.

While more research is needed to further understand the nature and mechanisms of substance use disorders among adolescents,56See H.W. Andersson, Merethe Wenaas & Trond Nordfjærn, Relapse After Inpatient Substance Use Treatment: A Prospective Cohort Study Among Users of Illicit Substances, 90 Addictive Behav. 222, 225–26 (2019); David G. Weissman, Roberta A. Schriber, Catherine Fassbender, Olivia Atherton, Cynthia Krafft, Richard W. Robins, Paul D. Hastings & Amanda E. Guyer, Earlier Adolescent Substance Use Onset Predicts Stronger Connectivity Between Reward and Cognitive Control Brain Networks, 16  Dev. Cognitive Neuroscience 121, 127 (2015); Margot Peeters, Tim Janssen, Karin Monshouwer, Wouter Boendermaker, Thomas Pronk, Reinout Wiers & Wilma Vollebergh, Weaknesses in Executive Functioning Predict the Initiating of Adolescents’ Alcohol Use, 16 Dev. Cognitive Neuroscience 139, 144 (2015); Tammy Chung, David J. Paulsen, Charles F. Geier, Beatriz Luna & Duncan B. Clark, Regional Brain Activation Supporting Cognitive Control in the Context of Reward Is Associated with Treated Adolescents’ Marijuana Problem Severity at Follow-Up: A Preliminary Study, 16 Dev. Cognitive Neuroscience 93, 99 (2015). and debates over certain aspects of the condition are ongoing within the medical community,57See Ramo et al., supra note 52, at 46; Hollis C. Karoly, Angela D. Bryan, Barbara J. Weiland, Andrew Mayer, Andrew Dodd & Sarah W. Feldstein Ewing, Does Incentive-Elicited Nucleus Accumbens Activation Differ by Substance of Abuse? An Examination with Adolescents, 16 Dev. Cognitive Neuroscience 5, 13 (2015). the notion that adolescents can and do have substance use disorders is uncontroversial among medical professionals.58See supra notes 52–57 and accompanying text. See generally Monica Luciana & Sarah W. Feldstein Ewing, Introduction to the Special Issue: Substance Use and the Adolescent Brain: Developmental Impacts, Interventions, and Longitudinal Outcomes, 16 Dev. Cognitive Neuroscience 1, 2 (2015) (presenting findings from an array of medical studies and scholarship focused upon substance use disorders among adolescents).

And naturally, because adolescents spend a significant percentage of their waking hours in school,59See Table 5.14: Number of Instructional Days and Hours in the School Year, by State: 2018, Nat’l Ctr. for Educ. Stat., U.S. Dep’t of Educ. (Jun. 6, 2018), https://nces.ed.gov/programs/
statereform/tab5_14.asp [https://perma.cc/HLT7-92UU]; Table 1: Enrollment Status of the Population 3 Years Old and Over, by Sex, Age, Race, Hispanic Origin, Foreign Born, and Foreign-Born Parentage: October 2016, U.S. Census Bureau (Aug. 23, 2017), https://www.census.gov/data/tables/2016/demo/
school-enrollment/2016-cps.html [https://perma.cc/3U7Q-XLSF] (showing that 98% of individuals aged 14 and 15 and 93% of individuals aged 16 and 17 attend school).
many of the harms posed by substance use disorders among adolescents manifest within the school environment. While the concept that substance use disorders are likely to negatively impact school performance is intuitive, the specific manners in which they can do so—and how the effects of substance use disorders may resemble other, recognized disabilities—are deserving of review, if only to clarify the manners in which schools can serve affected students.

The purpose and objectives of schooling extend beyond the academic learning process; schools play a critical role in students’ social development and the fostering of time- and task-management skills critical to future achievement.60See Brown v. Bd. of Educ., 347 U.S. 483, 493 (1954) (“[School] is a principal instrument in awakening the child to cultural values, in preparing him for later professional training, and in helping him to adjust normally to his environment.”); see also Margareth Etienne, Education, Violence, and Re-Wiring Our Schools, 2018 U. Chi. Legal F. 89, 115 (2018) (“Schools can play a formidable role in the academic and social development of children in both positive and negative ways.”). This holds for students receiving special education services as well: the concept of “education” as encompassing more than academic instruction is reinforced by the stated purpose of the Individuals with Disabilities Education Act (“IDEA”),61See 20 U.S.C. § 1400(d)(1)(A) (“The purposes of [the IDEA statute is to] . . . prepare [children with disabilities] for further education, employment, and independent living . . . .”). guidance from the U.S. Department of Education’s Office of Special Education Programs,62See Letter to Pawlisch, 24 Individuals with Disabilities Educ. L. Rep. 959 (Mar. 6, 1996) (“In determining whether a child’s impairment adversely affects educational performance, the multidisciplinary team must consider non-academic as well as academic areas.”); Letter to Lybarger, 17 Individuals with Disabilities Educ. L. Rep. 54 (Sept. 14, 1990) (“[A] child’s educational performance must be determined on an individual basis and should include non-academic as well as academic areas.”). and judicial interpretation of special education law.63See Robert A. Garda, Jr., Untangling Eligibility Requirements Under the Individuals with Disabilities Education Act, 69 Mo. L. Rev. 441, 473 (2004) (“[A] majority of courts hold that a child must progress in more than just graded areas in order to be provided educational benefit and a free appropriate public education.”). For the purposes of this analysis, the objectives of school can be roughly bifurcated into those that are academic—both classroom learning itself and the process of learning how to learn and retain information—and those connected with the socialization process. The emergence of a substance use disorder in adolescence can significantly impede progress in both spheres.

Substance abuse by adolescents has been shown to impair verbal memory, memory retrieval, executive function, and learning performance.64See Michael Takagi, Murat Yücel, Susan M. Cotton, Yasmin Baliz, Alan Tucker, Kathryn Elkins & Dan I. Lubman, Verbal Memory, Learning, and Executive Functioning Among Adolescent Inhalant and Cannabis Users, 72 J. Stud. Alcohol & Drugs 96, 103 (2010). Frequent substance use can cause “measurable and long-lasting cognitive impairments.”65Nora D. Volkow, Ruben D. Baler, Wilson M. Compton & Susan R.B. Weiss, Adverse Health Effects of Marijuana Use, 370 New Eng. J. Med. 2219, 2221 (2014). For additional examples of the manners in which substance abuse can have a lasting impact upon cognition, see also Nat’l Inst. on Drug Abuse, Principles of Adolescent Substance Use Disorder Treatment: A Research-Based Guide 4 (2014), https://nida.nih.gov/sites/default/files/podat-guide-adolescents-508.pdf [https://
perma.cc/SN5N-H7M8]; Weissman et al., supra note 56, at 127; Catherine Orr, Rowen Morioka, Brendan Behan, Sameer Datwani, Marika Doucet, Jelena Ivanovic, Clare Kelly, Karen Weierstall, Richard Watts, Bobby Smyth & Hugh Garavan, Altered Resting-State Connectivity in Adolescent Cannabis Users, 39 Am. J. Drug & Alcohol Abuse 372, 372 (2013); Anita Cservenka, Scott A. Jones & Bonnie J. Nagel, Reduced Cerebellar Brain Activity During Reward Processing in Adolescent Binge Drinkers, 16 Developmental Cognitive Neuroscience 110, 118 (2015); Francesca M. Filbey, Tim McQueeny, Samuel J. DeWitt & Virendra Mishra, Preliminary Findings Demonstrating Latent Effects of Early Adolescent Marijuana Use Onset on Cortical Architecture, 16 Developmental Cognitive Neuroscience 16, 16 (2015); Kerry M. Green, Elaine E. Doherty & Margaret E. Ensminger, Long-Term Consequences of Adolescent Cannabis Use: Examining Intermediary Processes, 43 Am. J. Drug & Alcohol Abuse 561, 567–68 (2017); Luciana & Feldstein Ewing, supra note 58, at 2; Sanjuan & Langenbucher, supra note 52, at 483, 487; Melissa Patricia Lopez-Larson, Jadwiga Rogowska & Deborah Yurgelun-Todd, Aberrant Orbitofrontal Connectivity in Marijuana Smoking Adolescents, 16 Developmental Cognitive Neuroscience 54, 61 (2015).
While the most acute cognitive effects of substance abuse are present during periods of frequent use, the scaffolded nature of secondary education can extend the consequences of failing to learn critical foundational information well into the future.66See Volkow et al., supra note 65. The academic impact of substance abuse is also reflected in the lower grades consistently found among students with substance use disorders.67See Ana I. Balsa, Laura M. Giuliano & Michael T. French, The Effects of Alcohol Use on Academic Achievement in High School, 30 Econ. Educ. Rev. 1, 8–10 (2010); Brittany A. Bugbee, Kenneth H. Beck, Craig S. Fryer & Amelia M. Arria, Substance Use, Academic Performance, and Academic Engagement Among High School Seniors, 89 J. Sch. Health 145, 145–46 (2019); Michael L. Dennis, Westley Clark & Larke N. Huang, The Need and Opportunity to Expand Substance Use Disorder Treatment in School-Based Settings, 7 Advances Sch. Mental Health Promotion 75, 82 (2014); Andrew J. Finch, Emily Tanner-Smith, Emily Hennessy & D. Paul Moberg, Recovery High Schools: Effect of Schools Supporting Recovery from Substance Use Disorders, 44 Am. J. Drug & Alcohol Abuse 175, 175 (2018); see also Robert L. DuPont, Kimberly M. Caldeira, Helen S. DuPont, Kathryn B. Vincent, Corinne L. Shea & Amelia M. Arria, Inst. for Behav. & Health, Inc., America’s Dropout Crisis: The Unrecognized Connection to Adolescent Substance Use 26–29 (2013), https://www.cls.umd.edu/docs/AmerDropoutCrisis.pdf [https://perma.cc/2Q28-NWC8].

Substance use disorders also hinder adolescents’ social development. The illegality of substance use can result in criminal charges and involvement with the juvenile justice system; a significant percentage of adolescents in juvenile detention meet the criteria for substance use disorders.68See Laurie Chassin, Juvenile Justice and Substance Use, Future Child., Fall 2008, at 165, 167. Adolescents who abuse substances are also at a higher risk of dropping out of school, which in turn can produce a myriad of social and economic harms.69See generally DuPont et al., supra note 67 (providing a thorough review of studies indicating an increased dropout risk for substance-involved students).

Given the prevalence of substance use disorders among adolescents, and the fact that substance use disorders are highly likely to negatively impact the education of adolescents, it is striking that the argument that substance use disorders should be recognized under special education law remains a novel one. A reason for this, as argued below, is that substance abuse by adolescents is too infrequently assessed through a clinical, “medicalized” lens and is instead too frequently assessed through a punitive, “disciplinary” lens.

2. Perceptual Barriers for Adolescents with Substance Use Disorders

Sociologists have long examined the role played by social constructs and labeling in assigning meaning to human behavior.70See Allan V. Hortitz, Creating Mental Illness 7–8 (2002). These methods of categorization are not “constant, but [instead] change according to the dominant modes of thinking.”71Id.; see also Paul S. Graubard, Children with Behavioral Disabilities, in Exceptional Children in the Schools: Special Education in Transition 245–46 (2d ed. 1973). These processes have had a significant impact upon drug policy insofar as they heavily influence the manners in which individuals with substance use disorders are perceived by society.72See infra notes 73–75 and accompanying text.

Indeed, over a century of policy responses to drug and alcohol abuse have been significantly influenced by the dominant social constructions of substance-abusing individuals.73See Lisa N. Sacco, Cong. Rsch. Serv., R43749, Drug Enforcement in the United States: History, Policy, and Trends 2–10 (2014); Katharine A. Neill, Tough on Drugs: Law and Order Dominance and the Neglect of Public Health in U.S. Drug Policy, 6 World Med. & Health Pol’y 375, 379–81 (2012). For example, an “alcoholic-as-sinner” construct undergirded the temperance movement;74Geoffrey R. Stone, The Second Great Awakening: A Christian Nation, 26 Ga. St. U. L. Rev. 1305, 1322 (2010). an “addict-as-criminal” construct inspired the “War on Drugs.”75See Neill, supra note 73, at 383–84. The degree to which racial animus contributed to the addict-as-criminal construct cannot be overstated. See John P. Hoffmann, The Historical Shift in the Perception of Opiates: From Medicine to Social Menace, 22 J. Psychoactive Drugs 53, 57 (1990); Kenneth B. Nunn, Race, Crime and the Pool of Surplus Criminality: Or Why the “War on Drugs” Was a “War on Blacks,” 6 J. Gender Race & Just. 381, 383 (2002); Michael L. Rosino & Matthew W. Hughey, Speaking Through Silence: Racial Discourse and Identity Construction in Mass-Mediated Debates on the “War on Drugs,” 4 Soc. Currents 246, 246 (2017); Michael Tonry, Race and the War on Drugs, 1994 U. Chi. Legal F. 25, 27 (1994).

The heavy influence of social constructs in this space is apparent when considering one’s own responses to various labels associated with substance-involved individuals. Terms such as “addict” and “alcoholic” are not used in the DSM-V to describe individuals with substance use disorders, nor is the term “addiction.”76See DSM-V, supra note 24, at 485. While some individuals, such as those participating in Twelve Step recovery fellowships, choose to use such language—or more-graphic terms such as “junkie” and “dope fiend”—to describe themselves, the use of such terms to describe individuals with substance use disorders is discouraged due to their negative connotations.77See Alcoholics Anonymous, supra note 33, at 60; Narcotics Anonymous 3 (6th ed. 2008). See generally Lauren M. Broyles, Confronting Inadvertent Stigma and Pejorative Language in Addiction Scholarship: A Recognition and Response, 35 Substance Abuse 217 (2014) (describing the potential negative ramifications of the use of various colloquialisms relating to substance use disorders).

Such terminology has the secondary effect of erasing substance-involved adolescents altogether. To give an example, while a twelve-year-old child can meet the diagnostic criteria for an alcohol use disorder—the adolescents with alcohol use disorders in the aforementioned SAMHSA study included twelve-year-old respondents78See SAMHSA Report, supra note 20, at 36.—describing a twelve-year-old child as an “alcoholic” reflexively appears to be misguided, if not outright impossible. Such dissonance surely stems from the notion that “alcoholism” requires years of problematic drinking to develop. But, as discussed in Section I.A. above, the diagnostic criteria for substance use disorders does not require a minimum age of onset or duration of symptoms. The notion that it requires years of drinking “alcoholically” before an individual can meet the criteria of an alcohol use disorder is simply wrong.

The logical extension of that incorrect belief is that problematic, harm-causing substance abuse by an adolescent is attributable not to a substance use disorder but rather to less-sympathetic causes such as youthful experimentation or simple defiance. Such a belief can prevent adolescents with substance use disorders from obtaining needed medical intervention and likely provides a tacit justification for punitive disciplinary policies.

Describing adolescents as “addicts” is perhaps easier to accept, insofar as a certain percentage of adolescents abuse drugs such as opiates that create an obvious physical dependence and precipitate rapid physical withdrawal symptoms.79An estimated 0.3% of all adolescents aged 12 to 17—approximately 80,000 individuals—had an opioid use disorder in 2020. See SAMHSA Report, supra note 20, at 30. In other words, the abuse of certain drugs can cause symptoms that do align with our conceptions of “addiction” regardless of the age of the drug abuser.

But other drugs, such as marijuana, can cause physical withdrawal symptoms that last for weeks and are often mistaken for general irritability or depression.80Elena M. Kouri & Harrison G. Pope Jr., Abstinence Symptoms During Withdrawal from Chronic Marijuana Use, 8 Experimental & Clinical Psychopharmacology 483, 484 (2000). So despite the fact that marijuana use disorder is the most prevalent of all substance use disorders among adolescents,81See SAMHSA Report, supra note 20, at 30. marijuana abuse does not align as neatly within the social construct of “addiction,” which requires physical tolerance to and withdrawal from a drug. When drug and alcohol abuse by adolescents often does not align with our constructs of “addiction” or “alcoholism,” such behavior is vastly more likely to be addressed within a punitive, “disciplinary” framework. Nowhere is this more evident than in schools.

3. Schools’ Outdated and Ineffective Responses to Substance-Involved Students

While a particular construct can achieve a measure of dominance on a societal level, various entities within society operate under their own dominant modes of thinking.82See Graubard, supra note 71, at 245–46. See generally Lois A. Weithorn, Envisioning Second-Order Change in America’s Responses to Troubled and Troublesome Youth, 33 Hofstra L. Rev. 1305 (2005) (providing a brilliant and thorough analysis of the various intervention systems for youth exhibiting maladaptive behavior). Imagine, for example, a father who finds illicitly obtained opiate painkillers in his teenage daughter’s room and decides to take bold action in response. The nature of the response will depend significantly, if not entirely, upon the entity he contacts; the local police would likely address the situation differently from a substance abuse treatment center or a priest. If the painkillers were discovered in the girl’s school locker, however, the available responses would be limited by district-level or statewide disciplinary policies.83See School Discipline Laws & Regulations by Category, Nat’l Ctr. on Safe & Supportive Learning Env’ts, https://safesupportivelearning.ed.gov/discipline-compendium/choose=state?field_
sub_category_value=Substance+use [https://perma.cc/4F5B-35Y2] (select “Discipline Addressing Specific Code of Conduct Violations,” then “Substance Use,” then each state individually and click “Apply”).

In a 2012 study of the drug- and alcohol-related policies of the one hundred largest school districts in the country, disciplinary responses to incidents of drug possession, use, sales, and distribution (including referral to law enforcement) were far more prevalent than interventions intended to detect and address possible substance use disorders.84See Nat’l Ctr. on Educ. & Econ., What Are Districts’ Written Policies Regarding Student Substance-Related Incidents? 1–2 (2012), https://ies.ed.gov/ncee/pubs/20124022/pdf/
20124022.pdf [https://perma.cc/BH2Y-5JBU].
Though only 15% of districts’ policies referenced obtaining written assessments for potential substance dependence and 55% allowed for referrals to substance abuse counseling, intervention, and treatment programs following possession or use offenses, 98% referenced the imposition of principal-determined suspensions, 90% recommended expulsion hearings, 86% allowed for reporting to law enforcement, and 80% referenced placement in alternative schools or programs.85Id. at 3. Only 26% of districts referenced prevention education in their drug or alcohol policies, and only 44% referenced school-based interventions or remediations.86Id. at 4.

So-called “zero tolerance” policies towards drug- and alcohol-related infractions have been criticized for being ineffective,87See S. Patrick Wynne, Zero-Tolerance Policies in U.S. Schools Are Ineffective and Unaffordable, Juv. Just. Info. Exch. (Jan. 14, 2013), https://jjie.org/2013/01/14/zerotolerance-policies-schools-ineffective-unaffordable-2 [https://perma.cc/K45L-MEY5]; Catherine Winter, Spare the Rod: Amid Evidence Zero Tolerance Doesn’t Work, Schools Reverse Themselves, APM Reps. (Aug. 25, 2016), https://www.apmreports.org/episode/2016/08/25/reforming-school-discipline [https://perma.cc/WV89-5HPS]. punitive,88See Christopher Boccanfuso & Megan Kuhfeld, Multiple Responses, Promising Results: Evidence-Based, Nonpunitive Alternatives to Zero Tolerance 2–3
(2011), http://www.childtrends.org/wp-content/uploads/2011/03/Child_Trends-2011_03_01_RB_AltTo
ZeroTolerance.pdf [https://perma.cc/N3CT-LL2F]; Karen Dolan, Punitive, Zero Tolerance Policies
Are Endangering Our Students, Inst. Pol’y Stud. (Aug. 31, 2018), https://ips-dc.org/punitive-zero-tolerance-policies-are-endangering-our-students [https://perma.cc/ZRU9-KKVA].
and overbroad.89See Kevin P. Brady, Zero Tolerance or (In)Tolerance Policies? Weaponless School Violence, Due Process, and the Law of Student Suspensions and Expulsions: An Examination of Fuller v. Decatur Public School Board of Education School District, 2002 BYU Educ. & L.J. 159, 177 (2002); Kathy Koch, Zero Tolerance for School Violence: Is Mandatory Punishment in Schools Unfair?, 10 CQ Researcher 185, 191 (2000). There is also a degree to which such policies are too narrow, insofar as their focus—and therefore utility—extends only to the boundaries of active drug possession and use. Put another way, current methods of addressing adolescent drug abuse in schools focus more on the drugs being used than on the adolescents using them. When drugs are removed from a situation, through successful policy initiatives or carceral force, the particular “drug problem” ceases to exist: no laws are broken, and the threat to school safety disappears.

This framing of the problem of student drug use fails to recognize the fundamental nature of substance use disorders insofar as it presumes that the unwanted behavior of student drug use can be deterred through consequences, when continued use in spite of consequences is one of the indicators of substance use disorders.90See supra notes 20–23 and accompanying text. Furthermore, achievement of such policies’ primary objective—the cessation of drug possession and use—would not fully address students’ substance use disorders, as achieving long-term recovery is an active endeavor that persists far beyond the cessation of substance use.91See supra notes 41–47 and accompanying text.

Recognizing substance use disorders as diagnosable and treatable medical conditions, as well as education-impacting disabilities, provides a clearer lens through which to view adolescent substance abuse, albeit one with profoundly complicated implications. What were once considered merely to be willful acts of defiance could instead be interpreted to be ineffective and destructive attempts of self-medication.92Rudolf H. Moos, Theory-Based Processes That Promote the Remission of Substance Use Disorders, 27 Clinical Psych. Rev. 537, 539 (2007). The value of deterrence mechanisms, absent attempts to address the underlying motivations for substance abuse, diminishes if not vanishes.93See supra notes 87–90 and accompanying text. In short, when substance use disorders are cognizable conditions in schools, the problematic activity of adolescent drug abuse necessitates a far greater degree of interpretative complexity.

The challenge this presents, its implications on the allocation of limited resources such as time and funding, and a reasonable desire to avoid controversial decision-making all serve as likely resistance points to the recognition of substance use disorders under special education law. That is why this Article seeks to instigate a new conversation among educators, policymakers, and scholars regarding how to best see and serve substance-involved students. To that end, Part II below will place substance use disorders within the two spheres of special education laws under which public schools operate, which will highlight current impediments to the recognition of substance-involved students and the areas of the law where recognition and accommodations could plausibly be obtained.

II. SITUATING SUBSTANCE USE DISORDERS WITHIN SPECIAL EDUCATION LAW

Two of the three major disability-rights statutes under which public schools operate—Section 5049429 U.S.C. § 794; 34 C.F.R. pt. 104 (2022). and Title II of the ADA9542 U.S.C. §§ 12131–12134; 28 C.F.R. pt. 35 (2022). —currently offer sufficient tools to procure recognition of, and a degree of support for, certain students with substance use disorders. As discussed below, however, such students can neither be seen nor served under the other statute—the IDEA.9620 U.S.C. §§ 1400–1420; 34 C.F.R. §§ 300.1–304.32 (2022).

A. The Individuals with Disabilities Education Act

      The first federal law to mandate that states receiving federal education funding provide “all handicapped children [with] a free appropriate public education” was the Education for All Handicapped Children Act of 1975 (“EAHCA”).97Education for All Handicapped Children Act of 1975, Pub. L. No. 94–142, 89 Stat. 775 (1975) (codified as amended at 20 U.S.C. § 1232). The EAHCA was the product of many years of congressional lobbying from parents and advocates for children with disabilities.98See Wendy F. Hensel, Symposium, Sharing the Short Bus: Eligibility and Identity Under the IDEA, 58 Hastings L.J. 1147, 1148 (2006); Mark C. Weber, The IDEA Eligibility Mess, 57 Buff. L. Rev. 83, 88 (2009). It was also influenced by two federal cases that upheld procedural due process and equal protection claims in favor of students with disabilities who had been excluded or otherwise denied services from their public schools.99See id.; Mills v. Bd. of Educ., 348 F. Supp. 866, 878–79 (D.D.C. 1972); Pa. Ass’n for Retarded Child. v. Pennsylvania, 343 F. Supp. 279, 302–03 (E.D. Pa. 1972). The EAHCA was reauthorized in 1990, at which time its name was changed to the Individuals with Disabilities Education Act.100See Weber, supra note 98, at 88.

The IDEA seeks to ensure that all students with qualifying disabilities and corresponding educational needs receive a “free appropriate public education.”10120 U.S.C. § 1401(9); 34 C.F.R. § 300.17 (2022). The manners in which schools provide a free appropriate public education to IDEA-qualified students are articulated in students’ Individualized Education Programs (“IEPs”).10220 U.S.C. § 1401(4); 34 C.F.R. § 300.22 (2022). Each student’s IEP must articulate which “special education and related services” the child is entitled to receive in order to meet their specific educational goals.10320 U.S.C. § 1414(d)(1)(A)(i); 34 C.F.R. § 300.320(a)–(c) (2022). Furthermore, the free appropriate public education offered to each child, codified by their IEP, must be provided in the least restrictive environment in which they can attain their individualized educational objectives.10420 U.S.C. § 1412(a)(5)(A); see 34 C.F.R. § 300.114(a)(2) (2022). These entitlements are provided to children who meet the IDEA’s disability criteria105See infra notes 115–17 and accompanying text. and whose disability also “adversely affects [the] child’s educational performance”106See infra notes 141, 143–44 and accompanying text. in a manner that creates the need for “special education and related services.”10734 C.F.R. § 300.8(a)(1) (2022). Each of these eligibility prongs will be analyzed in more detail below.108See infra Section II.A.1.

In addition to the substantive right to a free appropriate public education, the IDEA provides certain procedural rights in disputes between parents or otherwise interested third parties and schools.10920 U.S.C. § 1415(a); 34 C.F.R. § 300.500 (2022). Parents or public agencies may file a “due process complaint” on any matter relating to the “identification, evaluation or educational placement” of a child with a disability.11020 U.S.C. § 1415(b)(6)(A); 34 C.F.R. § 300.507(a)(1) (2022). Parties to disputes are afforded access to a timely mediation process conducted by a “qualified and impartial mediator.”11120 U.S.C. § 1415(e)(2)(A)–(E); 34 C.F.R. § 300.506(b)(1)–(5)) (2022). The mediator’s decision in due process disputes can subsequently be challenged in a civil court action; in certain cases, such an action can be filed prior to full exhaustion of the administrative process.112See 20 U.S.C. § 1415(i)(2)(A); 34 C.F.R. § 300.516 (2022); see also Lewis M. Wasserman, Delineating Administrative Exhaustion Requirements and Establishing Federal Courts’ Jurisdiction Under the Individuals with Disabilities Education Act: Lessons from the Case Law and Proposals for Congressional Action, 29 J. Nat’l Ass’n Admin. L. Judiciary 349, 384–412 (2009).

The IDEA, and the EAHCA before it, have made a positive impact on public education and the lives of millions of children with disabilities.113See Office Special Educ. Programs, U.S. Dep’t of Educ., History: Twenty-Five Years of Progress in Educating Children With Disabilities Through IDEA 2–4 (2007), https://files.
eric.ed.gov/fulltext/ED556111.pdf [https://perma.cc/9HXT-CUJF].
That said, many scholars have noted that the IDEA’s overly restrictive eligibility criteria appear to conflict with its stated objective of “ensur[ing] that all children with disabilities have available to them a free appropriate public education.”11420 U.S.C. § 1400(d) (emphasis added); 34 C.F.R. § 300.1 (2022) (emphasis added); see also Ellen A. Callegary, The IDEA’s Promise Unfulfilled: A Second Look at Special Education & Related Services for Children with Mental Health Needs After Garret F., 5 J. Health Care L. & Pol’y 164, 183–87 (2002); Kevin Golembiewski, Disparate Treatment and Lost Opportunity: Courts’ Approach to Students with Mental Health Disabilities Under the IDEA, 88 Temp. L. Rev. 473, 484–92 (2016); L. Kate Mitchell, “We Can’t Tolerate That Behavior in This School!”: The Consequences of Excluding Children with Behavioral Health Conditions and the Limits of the Law, 41 N.Y.U. Rev. L. & Soc. Change 407, 415–16 (2017); Weber, supra note 98, at 89–102; Julia C. Dimoff, The Inadequacy of the IDEA in Assessing Mental Health for Adolescents: A Call for School-Based Mental Health, 6 DePaul J. Health Care L. 319, 330–32 (2003). Indeed, students with substance use disorders are functionally invisible under the IDEA.

1. Eligibility Barriers for Students with Substance Use Disorders

a. “Child with a Disability”

In order to receive services under the IDEA, a student must first qualify as a “child with a disability.”11520 U.S.C. § 1401(3)(A)(i); 34 C.F.R. § 300.8(a)(1) (2022). The following disabilities—and only the following disabilities—are recognized under the IDEA: “intellectual disabilities, hearing impairments (including deafness), speech or language impairments, visual impairments (including blindness), serious emotional disturbance, . . . orthopedic impairments, autism, traumatic brain injury, other health impairments, or specific learning disabilities.”116Id. The IDEA’s regulatory guidance provides further clarification regarding the requisite components of each disability.117See 34 C.F.R. § 300.8(c) (2022). Failure to meet the criteria for a “child with a disability” precludes a student from receiving services under the IDEA.11820 U.S.C. § 1401(3)(A)(i); 34 C.F.R. § 300.8(a)(1) (2022). While two of the IDEA’s qualifying disabilities—emotional disturbance11920 U.S.C. § 1401(3)(A)(i); 34 C.F.R. § 300.8(c)(4) (2022). and other health impairments12020 U.S.C. § 1401(3)(A)(i); 34 C.F.R. § 300.8(c)(9) (2022).—may initially appear to encompass students with substance use disorders, the conditions for the disabilities that are articulated in the IDEA’s regulations would make such recognition difficult to obtain.

i. Emotional Disturbance

In order to obtain recognition under the IDEA as a child with an emotional disturbance, a student must, “over a long period of time and to a marked degree that adversely affects [the student’s] educational performance,”12134 C.F.R. § 300.8(c)(4)(i) (2022). exhibit one or more of the following characteristics:

(A) An inability to learn that cannot be explained by intellectual, sensory, or health factors.

(B) An inability to build or maintain satisfactory interpersonal relationships with peers and teachers.

(C) Inappropriate types of behavior or feelings under normal circumstances.

(D) A general pervasive mood of unhappiness or depression.

(E) A tendency to develop physical symptoms or fears associated with personal or school problems.122Id.

The regulations further state that emotional disturbance “includes schizophrenia” but does not apply to children who are “socially maladjusted” unless they exhibit one of the criteria provided above.123Id. § 300.8(c)(4)(ii) (2022).

The general deficiencies in this regulatory language have been catalogued at length.12420 U.S.C. § 1400(d); 34 C.F.R. § 300.1 (2022); see also Callegary, supra note 114, at 183–87; Golembiewski, supra note 114, at 484–92; Mitchell, supra note 114, at 415–16; Weber, supra note 98, at 89–102; Dimoff, supra note 114, at 330–32. Insofar as students with substance use disorders are concerned, it should be noted that the criteria for emotional disturbances do not align with the DSM-V’s criteria for substance use disorders;125See supra note 28. some students would exhibit sufficient criteria under both conditions to obtain classification as a student with a substance use disorder and an emotional disturbance, but other students with substance use disorders would fail to meet the emotional disturbance criteria entirely. Nor does the requirement that qualifying behavior be exhibited “over a long period of time”12634 C.F.R. § 300.8(c)(4)(i) (2022). reflect the DSM-V’s relative lack of emphasis upon the amount of time symptoms of substance use disorders must be present prior to a diagnosis.127See supra note 28. Furthermore, the clause referencing “socially maladjusted”12834 C.F.R. § 300.8(c)(4)(ii) (2022). students—a term used to describe juvenile delinquency at the time the IDEA’s precursor was drafted129See Weber, supra note 98, at 111.—appears to serve little purpose other than to bias decisionmakers against classifying certain types of maladaptive behavior as evidence of emotional disturbance.130This assessment of the social maladjustment clause is informed by its curious placement in the regulatory definition of emotional disturbance: evidence of social maladjustment does not disqualify a student from receiving a disability classification of emotionally disturbed if one or more of the other five factors are present. 34 C.F.R. § 300.8(c)(4)(ii) (2022). As this would be a requirement regardless of the social maladjustment clause, the clause’s purpose—other than to present the false implication that emotional disturbance criteria resulting from social maladjustment does not qualify a student for an emotional disturbance classification—is unclear. For scholarly critiques of the social maladjustment clause, see Virginia Costenbader & Roberta Buntaine, Diagnostic Discrimination Between Social Maladjustment and Emotional Disturbance: An Empirical Study, 7 J. Emotional & Behav. Disorders 2, 3–4 (1999); Callegary, supra note 114, at 189; Cynthia A. Dieterich, Nicole D. Snyder & Christine J. Villane, A Legal Study of Children with Emotional Disturbance and Mental Health Needs and Implications for Practice, 45 J.L. & Educ. 39, 46–48 (2016); Weithorn, supra note 82, at 1357–59; Lucy W. Shum, Note, Educationally Related Mental Health Services for Children with Serious Emotional Disturbance: Addressing Barriers to Access Through the IDEA, 5 J. Health Care L. & Pol’y 233, 244–46 (2002); and Felicia Winder, Note, Childhood Trauma and Special Education: Why the “IDEA” Is Failing Today’s Impacted Youth, 44 Hofstra L. Rev. 601, 623–24 (2015).

Evidence of student drug use in the records of IDEA-based civil actions appears to dissuade reviewing judges from attributing student behavior to an underlying emotional disturbance in IDEA-based appeals.131This matter was directly addressed in Springer v. Fairfax City School Board, 134 F.3d 659 (4th Cir. 1998), wherein a student’s “use of illegal substances . . . and reckless and risk-taking acts” was attributed to a “conduct disorder” that triggered the social maladjustment clause and precluded a finding of emotional disturbance under the IDEA. Id. at 664 (internal quotation marks omitted). For additional examples of the social maladjustment clause precluding a finding of emotional disturbance for students with a history of substance abuse, see Dale M. v. Bd. of Educ. of Bradley-Bourbonnais High Sch. Dist. No. 307, 237 F.3d 813, 817 (7th Cir. 2001); and Tracy v. Beaufort Cty. Bd. of Educ., 335 F. Supp. 2d 675, 688–89 (D.S.C. 2004). In denying a student eligibility under the emotionally disturbed category, Judge Richard Posner attributed the child’s drug use and criminal record to “a lack of proper socialization” and noted that while the child’s substance abuse “interferes with his schooling . . . it interferes with much else besides, such as [his] ability to conform to the law and avoid jail.”132Dale M., 237 F.3d at 817. A district court opinion, also denying eligibility, noted that “[t]eenagers . . . can be a wild and unruly bunch. Adolescence is, almost by definition, a time of social maladjustment for many people.”133Springer, 134 F.3d at 664. Some courts consider substance abuse to be a de facto indicator of social maladjustment.134W.G. v. N.Y.C. Dep’t of Educ., 801 F. Supp. 2d 142, 155 (S.D.N.Y. 2011) (supporting a denial of emotional disturbance finding with a student’s psychologist’s assessment that “under the IDEA students who are socially maladjusted or have a history of substance abuse [do] not qualify for the disability classification of emotional disturbance”). The student under review had a diagnosis of cannabis dependence under DSM-IV criteria, which delineated between “abuse” and “dependence” in a manner that was not continued in the DSM-V. See id. at 153; see also Substance Abuse & Mental Health Servs. Admin., Impact of the DSM-IV to DSM-V Changes on the National Survey on Drug Use and Health 10 (2016), https://www.samhsa.gov/data/sites/default/files/NSDUH-DSM5Impact
AdultMI-2016.pdf [https://perma.cc/D5BM-HYHT].
Given the barriers to receiving an emotional disturbance disability classification faced by all students with maladaptive school behaviors, and the particular barrier of the social maladjustment clause for students with a history of substance abuse, widespread acknowledgement of student substance use disorders via the emotionally disturbed category of IDEA-eligible disabilities is unlikely.

ii. Other Health Impairment

To qualify for IDEA services under the “other health impairment” category, a student must have “limited strength, vitality, or alertness, including a heightened alertness to environmental stimuli, that results in limited alertness with respect to the educational environment” due to a “chronic or acute health problem[] such as asthma, attention deficit disorder or attention deficit hyperactivity disorder, diabetes, epilepsy, a heart condition, hemophilia, lead poisoning, leukemia, nephritis, rheumatic fever, sickle cell anemia, and Tourette syndrome [that] adversely affects a child’s educational performance.”13534 C.F.R. §  300.8(c)(9) (2022). The requirement that the health condition create a “limited alertness with respect to the educational environment” is far easier to demonstrate than the emotional disturbance criteria;136See Hensel, supra note 98, at 1164. it also more closely aligns with the typical manifestations of substance use disorders.137See supra Section I.A. Other obstacles exist, however, for individuals seeking recognition of substance use disorders as an “other health impairment.”

While the category’s list of “chronic or acute health problems” that are considered “other health impairments” is non-exhaustive,13834 C.F.R. § 300.8(c)(9) (2022) (stating “chronic or acute health problems such as [list of OHI-recognized disabilities]” (emphasis added)). the absence of substance use disorders leaves the decision of whether to acknowledge a particular student’s disorder to the special education team at the student’s school, subject to review of a mediator and, if appealed, a state or federal judge.139The overwhelming majority of IDEA-based causes of action are filed in federal courts. See James R. Newcomer & Perry A. Zirkel, An Analysis of Judicial Outcomes of Special Education Cases, 65 Exceptional Child. 469, 474 (1999) (finding that 85% of IDEA-based civil suits were filed in federal court as of 1999). This figure has remained consistent twenty years after this initial finding. See Laura J. Granelli & Beth L. Sims, Special Education Disputes: Litigate or Settle: That Is the Question 6–7 (2018), https://www.nyssba.org/clientuploads/nyssba_pdf/Events/precon-law-2018/06-special-ed-disputes-outline.pdf [https://perma.cc/RR3H-Y57D]. Though such a finding would not be outside the realm of possibility, two factors diminish its likelihood. One, the lack of precedent for a substance use disorder being classified as an “other health impairment” compromises advocates’ ability to effectively argue for such a classification and would likely give reviewing authorities pause before making such a determination. Furthermore, the fact that other conditions (such as attention-deficit/hyperactivity disorder and Tourette syndrome) have been added to the original list of “other health impairments” in formal amendments to the IDEA’s regulations140 See IDEA-Part B Final Regulations: Children With ADD/ADHD—Topic Brief, U.S. Dep’t
of Educ., https://fbaum.unc.edu/lobby/063_IDEA/Agency_Activities/Education/ED_Children_With_
ADD_ADHAD_0399.htm (Apr. 14, 1999) [https://perma.cc/6SHH-9JSD]; Dixie Snow Huefner, Commentary, The Final Regulations for the Individuals with Disabilities Education Improvement Act (IDEA ‘04), 217 Educ. L. Rep. 1, 3 (2007). For additional analysis regarding attention-deficit hyperactivity disorder, see infra Section II.A.1.c.
might dissuade school personnel or reviewing authorities from recognizing a condition absent from the regulations that has not been subjected to a similar degree of review and approval.

Another significant barrier impeding the classification of substance use disorders as an “other health impairment” is the secondary requirement—which also applies to findings of emotional disturbance and all other qualifying disabilities under the IDEA—that the impairment “adversely affects a child’s educational performance.”14134 C.F.R. § 300.8(c)(1)–(9) (2022). Indeed, most disputes over whether a student should receive IDEA services under the “other health impairment” classification focus not upon the existence of a disability but rather the degree to which that disability adversely affects the child’s educational performance.142See Hensel, supra note 98, at 1164, 1170.

b. “Adversely Affects Educational Performance”

Neither the IDEA statutory text nor its regulations clearly articulate the type and extent of adverse effect a disability must have upon a student’s educational performance in order for the child to qualify for IDEA services.143See 20 U.S.C. §§ 1401–1482; 34 C.F.R. § 300.8 (2022); see also Callegary, supra note 114, at 186. This element of the IDEA’s eligibility criteria has been a longstanding focus of scholarly critique.144See Garda, supra note 63, at 481–86 (2004); Theresa Glennon, Disabling Ambiguities: Confronting Barriers to the Education of Students with Emotional Disabilities, 60 Tenn. L. Rev. 295, 355–56 (1993); Jamie Lynne Thomas, Comment, Decoding Eligibility Under the IDEA: Interpretations of “Adversely Affect Educational Performance,” 38 Campbell L. Rev. 73, 97–104 (2016); Weber, supra note 98, at 116–18. For an astute critique of the most-common judicial interpretations of this clause, see generally Golembiewski, supra note 114. While students with substance use disorders who receive recognition of their disability as an “other health impairment” would face much of the same difficulty as other students with disabilities in demonstrating the adverse effect of their disability (and corresponding need for special education and related services), the unique nature of substance use disorders poses particular challenges in this space.

These challenges can be distinguished between those that would likely be faced by students who are actively using substances at the time of an eligibility determination or IEP meeting and those likely to be faced by students in remission145Here, “remission” does not need to align with the DSM-V’s requirement of a minimum of three months free from symptoms of substance use disorders; students who are not currently engaging in drug or alcohol use and are not in need of intensive medical intervention fall within this category. from a substance use disorder. While students in remission would likely face fewer barriers in this space than substance-involved students, demonstrating sufficient adverse effects upon their educational performance that can be attributable to their substance use disorder might nevertheless be difficult. For one, the educators and reviewing entities making the eligibility determination may not fully understand the unique profile of substance use disorders and the manner in which they can continue to symptomatically manifest—and, possibly, adversely affect the student’s educational performance—even when a student is in remission from active drug use.146See supra Section I.A. Additionally, the existence of alternative vehicles of support for students with disabilities that do not feature as-stringent eligibility criteria—Section 504 and the ADA—might diminish the perceived significance of recognizing a substance use disorder in remission under the IDEA.147See infra Section II.B. Finally, the delicate balance of being in remission from a substance use disorder, and that disorder concurrently being recognized as adversely affecting the student’s educational performance to a degree that warrants special education and related services, is ever-vulnerable to disruption by the common occurrence of relapses.148See supra note 31 and accompanying text.

Students who do not use or possess drugs at school but instead manifest the adverse effects of substance use disorders primarily at home can also “fall[] without” the “outer boundaries of IDEA eligibility.”149R.C. v. York Sch. Dep’t, Civil No. 07-177-P-S, 2008 U.S. Dist. LEXIS 75538, at *72 (D. Me. Sept. 25, 2008) (finding that a “deeply troubled young woman who suffered serious adverse effects from [depression], but displayed virtually none in school” was ineligible for IDEA services), aff’d, 2008 U.S. Dist. LEXIS 98762 (D. Me. Dec. 5, 2008). The “inappropriate behaviors” catalogued by the court, however, included “being drunk or high . . . in school.” Id. According to the Department of Education’s Office of Special Education Programs, because the IDEA’s provisions “relate to the educational environment . . . for eligibility purposes, the student must meet the [adverse effect requirement] within the educational environment.”150Letter to Anonymous, 213 Educ. for Handicapped L. Rep. 247, 249 (Aug. 11, 1989); see also Garda, supra note 63, at 479. Unfortunately, such policies fail to acknowledge the degrees to which the consequences of substance use disorders extend beyond periods of active drug use.151See supra Section I.A.

The remaining category of students, those with substance use disorders who commit drug-related offenses at school, would likely have the most-obvious claim that their disability is adversely affecting their educational performance. The significance of this finding, however, would be diminished by the disciplinary (and possibly legal) consequences the students would face following the infraction. Furthermore, the discovery of active substance abuse either at home or at school can result in parents seeking a degree of support for their students that schools are typically unwilling to fund.152See infra Section.II.A.2.b. While these particular elements are distinct from the inquiry concerning IDEA eligibility for students with substance use disorders, they would factor significantly into the manner in which such students would be served under the IDEA were they to meet the initial eligibility criteria.

c. “Needs Special Education”

One eligibility prong remains: students who meet the aforementioned criteria must also “need[] special education and related services.”15320 U.S.C. § 1401(3)(A)(ii); 34 C.F.R. § 300.8(c) (2022). The IDEA defines special education as “specially designed instruction, at no cost to parents, to meet the unique needs of a child with a disability.”15420 U.S.C. § 1401(29). The term “specially designed instruction” is defined as “adapting, as appropriate to the needs of an eligible child . . . the content, methodology, or delivery of instruction.”15534 C.F.R. § 300.39 (2022). The IDEA provides a list of “related services” that “may be required to assist a child with a disability to benefit from special education,” including “psychological services, . . . social work services, . . . counseling services, including rehabilitation counseling . . . and medical services . . . for diagnostic or evaluation purposes.”15620 U.S.C. § 1401(26)(A); see also 34 C.F.R. § 300.24(a) (2022). Whether a student needs “special education,” as opposed to accommodations such as preferential seating or mobility assistance, is often a determining factor in whether a student meets IDEA eligibility or the more-expansive Section 504 eligibility criteria.157See Garda, supra note 63, at 487 (“Section 504’s coverage is broader than IDEA’s because it does not consider the child’s need for special education.”). For an analysis of Section 504’s eligibility criteria as applied to students with substance use disorders, see infra Section II.B.

While the eligibility requirement that a student must “need[] special education” is logically “intertwined” with the requirement that a student’s IDEA-recognized disability “adversely affects” their educational performance, they are distinct inquires.158See Garda, supra note 63, at 490. Complicating this analysis is the fact that the statutory and regulatory language of the IDEA does not clarify (beyond the aforementioned definitions) which modifications constitute “special education” and which are simply best practices that address individual student needs.159See Hensel, supra note 98, at 1174 (“Despite the statute’s thirty[-]year existence, there is little agreement among courts and scholars as to what type of services fall under this umbrella.”); Weber, supra note 98, at 120 (“The reality is that there exists no precise definition for ‘need special education’ beyond the meaning of the words themselves.”); see also Garda, supra note 63, at 486–90.

The eligibility barriers discussed above are likely sufficient to preclude recognition of students with substance use disorders under the IDEA, rendering the discussion of whether such students need “special education” primarily theoretical at present. Nevertheless, advocates seeking IDEA reform must clearly establish that—if the statute were amended to recognize substance use disorders as education-impacting disabilities—there are available special education practices that could serve such students. Two foundations for this argument exist. One can first analogize the manner in which the students with substance use disorders could be served under the IDEA to the manner in which students with attention-deficit hyperactivity disorder (“ADHD”) are currently being served under the IDEA.160See infra notes 162–66 and accompanying text. One can then glean examples of “specially designed instruction” from school-based programs that currently serve students with substance use disorders, such as recovery schools.161See infra notes 167–70 and accompanying text.

ADHD and substance use disorders are “inextricably intertwined.”162Elizabeth Harstad, Sharon Levy & Committee on Substance Abuse, Attention-Deficit/Hyperactivity Disorder and Substance Abuse, 134 Pediatrics e293, e293 (2014). Children with ADHD are at a significantly higher risk of developing substance dependence than children without ADHD,163See Steve S. Lee, Kathryn L. Humphreys, Kate Flory, Rebecca Liu & Kerrie Colass, Prospective Association of Childhood Attention-Deficit/Hyperactivity Disorder (ADHD) and Substance Use and Abuse/Dependence: A Meta-Analytic Review, 31 Clinical Psych. Rev. 328, 338 (2011). The same study also found that “early ADHD strongly predicts future substance abuse/dependence in adolescence/adulthood and that this association is largely impervious to demographic and methodological factors that varied across each study.” Id. at 337. and rates of ADHD among adolescents receiving treatment for substance use disorders are significantly higher than among the general population of their peers.164See Susan Merle Gordon, Frank Tulak & Joseph Troncale, Prevalence and Characteristics of Adolescent Patients with Co-Occurring ADHD and Substance Dependence, 23 J. Addictive Diseases 31, 31–32 (2004) (“Estimates of co-morbidity of SUD [substance use disorders] and ADHD in addiction treatment settings range from 30% to 50%, while community prevalence rates are approximately 3% to 5%.” (citations omitted)); Katelijne van Emmerik-van Oortmerssen, Geurt van de Glind, Wim van den Brink, Filip Smit, Cleo L. Crunelle, Marije Swets & Robert A. Schoerers, Prevalence of Attention-Deficit Hyperactivity Disorder in Substance Use Disorder Patients: A Meta-Analysis and Meta-Regression Analysis, 122 Drug & Alcohol Dependence 11, 12–13 (2012) (showing an overall ADHD prevalence of 25% among adolescent subjects of the meta-analysis, compared to 5% of the general child population). Like substance use disorders, ADHD is correlated with poor academic performance, higher risk of dropout, and an increased risk of involvement with the juvenile justice system.165See William J. Barbaresi, Slavica K. Katusic, Robert C. Colligan, Amy L. Weaver & Steven J. Jacobsen, Long-Term School Outcomes for Children with Attention-Deficit/Hyperactivity Disorder: A Population-Based Perspective, 28 J. Dev. & Behav. Pediatrics 265, 270 (2007); Regina Bussing, Dana M. Mason, Lindsay Bell, Phillip Porter & Cynthia Garvan, Adolescent Outcomes of Childhood Attention-Deficit/Hyperactivity Disorder in a Diverse Community Sample, 49 J. Am. Acad. of Child & Adolescent Psychiatry 595, 596, 601 (2010).

ADHD was not included in the examples of “other health impairments” in the IDEA’s original regulations; the condition was added following the IDEA Amendments of 1997.166See Paolo G. Annino, The New IDEA Regulations: The Next Step in Improving the Quality of Special Education, 23 Mental & Physical Disability L. Rep. 439, 439 (1999). In seeking similar recognition of substance use disorders, advocates need not entirely conflate such disorders with ADHD to nevertheless draw valid analogies between the two conditions. Both concern a medically grounded reassessment of maladaptive school behavior that, if left unaddressed, leaves students vulnerable to a higher risk of failure. Furthermore, to whatever degree the common symptoms of ADHD mirror the school performance of students with substance use disorders, similar special education and related services can be provided to the latter population.

Advocates can also look to programs that currently serve students with substance use disorders for examples of academic modifications and supportive services that allow such students to fully access their educational opportunities. Recovery schools, which provide integrated therapeutic support for students in remission from substance use disorders, are a valuable source of such knowledge and experience.167See D. Paul Moberg, Andrew J. Finch & Stephanie M. Lindsley, Recovery High Schools: Students and Responsive Academic and Therapeutic Services, 89 Peabody J. Educ. 165, 165 (2014) (“RHS [Recovery High School] programs are designed to meet both academic and therapeutic needs of adolescents who have received treatment for substance use disorders.”). See generally Approaches to Substance Abuse and Addiction in Education Communities: A Guide to Practices that Support Recovery in Adolescents and Young Adults (Jeffery D. Roth & Andrew J. Finch, eds., 2010) (describing recovery high schools from the perspectives of students, teachers, and administrators). Recovery schools’ academic programming is typically more flexible than traditional schools, both to provide students the opportunity to learn foundational material that was not obtained prior to entering treatment and to allow time for supportive services throughout the day.168See Moberg, supra note 167, at 174. Recovery schools also have small class sizes, which allow for a greater amount of individual student attention.169See Andrew J. Finch, D. Paul Moberg & Amanda Lawton Krupp, Continuing Care in High Schools: A Descriptive Study of Recovery High School Programs, 23 J. Child & Adolescent Substance Abuse 116, 123 (2014). Incorporating the principles and practices of recovery schools into public school systems would substantially alleviate the most pressing challenges of recovery schools—maintaining sustainability and offering a diverse suite of academic and elective courses170See Moberg, supra note 167, at 172–80.—by leveraging economies of scale.

Despite the valuable insight recovery schools can provide, the manners in which “the content, methodology, or delivery of instruction”17134 C.F.R. § 300.39(b)(3) (2022). can most-efficaciously be adapted for students with substance use disorders remains a significant opportunity for further study and innovation. Much more is known regarding the “related services”17220 U.S.C. § 1401(26)(A). schools can provide—and in some cases are already providing—to support this population. Approaches such as resilience theory,173See Rebecca Kate Hodder, Megan Freund, Luke Wolfenden, Jenny Bowman, Smriti Nepal, Julia Dray, Melanie Kingsland, Sze Lin Yoong & John Wiggers, Systematic Review of Universal School-Based ‘Resilience’ Interventions Targeting Adolescent Tobacco, Alcohol or Illicit Substance Use: A Meta-Analysis, 100 Preventative Med. 248, 257 (2017) (“[U]niversal school-based interventions that address adolescent ‘resilience’ protective factors as part of any intervention approach are effective in reducing adolescent illicit substance use, supporting the implementation of such universal school-based interventions to reduce illicit substance use by adolescents.”). peer network counseling,174See Michael J. Mason, Nikola M. Zaharakis, Michael Russell & Victoria Childress, A Pilot Trial of Text-Delivered Peer Network Counseling to Treat Young Adults with Cannabis Use Disorder, J. Substance Abuse Treatment, June 2018, at 1, 8 (finding that, while the study’s sample size was small, Peer Network Counseling interventions using text messages “may be efficacious in reducing cannabis related problems for those with moderate and high levels of CUD [Cannabis Use Disorder] severity, in reducing cannabis craving, and in reducing positive cannabis metabolites specimen results among young adults”). motivational interviewing,175See Elizabeth Barnett, Steve Sussman, Caitlin Smith, Louise A. Rohrbach & Donna Sprujit-Metz, Motivational Interviewing for Adolescent Substance Use: A Review of the Literature, 37 Addictive Behavs. 1325, 1327 (2012) (“Twenty-six trials (67%) showed significant reductions in some type of substance use.”). and cognitive-behavioral therapy176See Susan G. Forman & Laura Sharp, Substance-Abuse Prevention: School-Based Cognitive-Behavioral Approaches, in Cognitive-Behavioral Interventions in Educational Settings: A Handbook for Practice 557, 567–75 (Rosemary B. Mennuti, Ray W. Christner & Arthur Freeman eds., 2d ed. 2012) (offering examples of studies supporting the assertion that “cognitive-behavioral school-based prevention programs can have a positive impact on the prevalence of substance abuse among youth”). have all been demonstrated to improve outcomes for adolescents with substance use disorders.177For additional examples of adolescent substance abuse treatment practices, see Nat’l Inst. On Drug Abuse, supra note 65, at 22–29. School-based interventions can strengthen “social resistance skills,”178See Kenneth W. Griffin & Gilbert J. Botvin, Evidence-Based Interventions for Preventing Substance Use Disorders in Adolescents, 19 Child & Adolescent Psychiatric Clinics N. Am. 505, 510 (2010). provide “normative education” regarding the dangers of substance abuse,179See id. and focus on “competence-enhancement” that addresses other social needs.180See id. at 511.

It should also be noted that the provision of “special education and related services” to students with substance use disorders aligns with the value of inclusion underlying the policy that students are to be educated in the “least restrictive environment” in which their needs can be met.18120 U.S.C. § 1414(d)(1)(A)(i); 34 C.F.R. § 300.320(a)–(c) (2022). This is especially the case if such interventions can be performed at the outset of the disorder’s manifestation. Providing early, effective, and evidence-based interventions can allow students to remain integrated in their schools and home environments and forestall, or ideally preempt altogether, a need for residential placement or the threat of juvenile justice involvement.

In summary, the IDEA’s eligibility criteria currently present barriers to the recognition of students with substance use disorders that would likely require statutory or regulatory amendments to overcome. In addition to amending the IDEA’s eligibility criteria, there are two important policy considerations that are deserving of attention, debate, and a similarly tailored response: (1) the balance between schools’ non-negotiable need to maintain safe and drug-free campuses and students’ protections against disciplinary actions that are “manifestations” of their disabilities;18220 U.S.C. § 1415(k)(1)(F); 34 C.F.R. § 300.530(f) (2022). and (2) schools’ obligations to provide tuition reimbursement for residential treatment programs.

2. Further Policy Considerations: Manifestation Determinations and Residential Placements

a. Manifestation Determinations

If a student’s IEP team determines that a particular incident of school misbehavior is a manifestation of the child’s disability, the school, rather than levying punitive discipline, will conduct a “functional behavioral assessment, . . . implement a behavioral intervention plan[, and] . . . return the child to the placement from which the child was removed.”183Id. However, drug-related offenses trigger an exception to the IDEA’s standard protocol of determining whether a student’s misbehavior can be considered a “manifestation” of the student’s disability.18420 U.S.C. § 1415(k)(1)(G); 34 C.F.R. § 300.530(g) (2022). Students who are caught using or possessing drugs at school are thus subject to disciplinary action, referral to law enforcement, and removal to an alternative educational setting for up to forty-five days “without regard to whether the behavior is determined to be a manifestation of the child’s disability.”185Id. Under the IDEA, schools still have the discretion to hold manifestation determination hearings following drug-related infractions by students with disabilities, but they are not required to do so as they are with other infractions.186Id.

The fact that substance use disorders are functionally invisible within special education law has resulted in inconsistent outcomes of manifestation determinations involving drug-related offenses for students with IDEA-recognized disabilities.187For examples of findings that drug-related offenses were manifestations of IDEA-recognized disabilities, see School Bd. of Prince William Cnty., Va. v. Malone, 762 F.2d 1210, 1212 (4th Cir. 1985) (applying similar criteria from IDEA’s precursor, the Education for All Handicapped Children Act); Maple Heights City Sch. Bd. of Educ. v. A.C., No. 14CV1033, 2016 U.S. Dist. LEXIS 83100, at *25 (N.D. Ohio June 27, 2016); Edwin K. v. Jackson, No. 01 C 7115, 2002 U.S. Dist. LEXIS 11913, at *13–14 (N.D. Ill. July 1, 2002); Breen ex rel. Breen v. St. Charles R-IV Sch. Dist., 2 F. Supp. 2d 1214, 1218 (E.D. Mo. 1997), aff’d, No. 97-2788, 1998 U.S. App. LEXIS 7504 (8th Cir. Apr.15, 1998). For examples of findings that drug-related offenses were not manifestations of IDEA-recognized disabilities, see Bd. of Educ of Oak Park v. Nathan R. ex rel. Richard R., 199 F.3d 377, 379 (7th Cir. 2000); Bd. of Educ. of Vandalia Cmty. Unit Sch. Dist. No. 203 v. K.S., No. 15-CV-1048-DGW, 2016 U.S. Dist. LEXIS 131466, at *2 (S.D. Ill. Sept. 23, 2016); Fisher v. Friendship Pub. Charter Sch., 10-cv-886, 2012 U.S. Dist. LEXIS 59510, at *6–7 (D.D.C. Jan. 26, 2012); S.C. ex rel. Poland v. Union Twp. Sch. Corp., No. 09-CV-167, 2010 U.S. Dist. LEXIS 53562, at *2 (N.D. Ind. June 1, 2010); Y.B. ex rel. A.B. v. Williamson Cnty. Bd. of Educ., No. 08-0999, 2009 U.S. Dist. LEXIS 108701, at *3–4 (M.D. Tenn. Nov. 20, 2009); Gutin v. Wash. Twp. Bd. of Educ., 467 F. Supp. 2d 414, 420 (D.N.J. 2006); A.P. v. Pemberton Twp. Bd. of Educ., No. 05-3780, 2006 U.S. Dist. LEXIS 32542, at *2 (D.N.J. May 15, 2006); Farrin v. Me. Sch. Admin. Dist. No. 59, 165 F. Supp. 2d 37, 47 (D. Me. 2001). If a student with a substance disorder who is deemed to have met the aforementioned IDEA eligibility criteria—by, again, meeting the criteria for emotional disturbance or being recognized as having an “other health impairment,” either of which must adversely affect the student’s educational performance to a degree that requires special education—committed a drug-related offense at school, it would almost certainly be considered a manifestation of their disability. Nevertheless, absent an amendment to the current guidelines, the aforementioned exception would apply, and the student’s school district would still have the ability to discipline the student, refer the student to law enforcement, and remove the student to an alternative placement for up to forty-five days.18820 U.S.C. § 1415(k)(1)(G); 34 C.F.R. § 300.530(g) (2022).

Any legislative response that adheres to the value of school safety would ensure that schools maintain the flexibility to adequately respond to all drug-related offenses, including, if necessary, the temporary removal of students from campus. Such responses, however, should initiate, rather than foreclose, a dialogue regarding the “special education and related services” students are to be provided in their new placement. Without such support, the value of safety—both for the individual student and the school community—would be compromised upon the student’s return.

b. Residential Placements

The school’s response could be rendered moot, however, if the student is first withdrawn from the district by their parents and placed in a residential drug-treatment program. Under the IDEA’s regulations, parents of children who have previously received special education services and are dissatisfied by a school’s current provision of services can enroll their child in a private school program and file a due process action seeking reimbursement.18920 U.S.C. § 1412(a)(10)(C)(ii); 34 C.F.R. § 300.148(c) (2022). Such reimbursement is justified only if “the public placement violated IDEA and the private [here, residential] school placement was proper under the Act.”190Forest Grove Sch. Dist. v. T.A., 557 U.S. 230, 246 (2009) (citing Florence Cnty. Sch. Dist. Four v. Carter ex rel. Carter, 510 U.S. 7, 15 (1993)).

Federal circuits employ different tests for determining whether a particular student’s residential placement is proper (and therefore reimbursable).191See Ralph D. Mawdsley, Commentary, Applying the Forest Grove Balancing Test to Parent Reimbursement for Placement in Residential Medical Facilities, 253 West’s Educ. L. Rep. 521, 528–32 (2010). Under the oldest, most lenient, and most widely employed standard, courts assess “whether full-time placement may be considered necessary for educational purposes, or whether the residential placement is a response to medical, social or emotional problems that are segregable from the learning process.”192Kruelle v. New Castle Cnty. Sch. Dist., 642 F.2d 687, 693 (3d. Cir. 1981); see Mawdsley, supra note 191, at 528. The Seventh Circuit modified the above test to focus more directly upon the primary purpose of the chosen residential facility; reimbursement is not provided for placements that are “oriented more toward enabling the child to engage in noneducational activities.”193Dale M. ex rel. Alice M. v. Bd. of Educ. of Bradley-Bourbonnais High Sch. Dist., 237 F.3d 813, 817; see Mawdsley, supra note 191, at 528–29. The Fifth Circuit adopted elements of the aforementioned tests to create a two-part standard for proper residential placements: such placements “must be 1) essential in order for the disabled child to receive a meaningful educational benefit, and 2) primarily oriented toward enabling the child to obtain an education.”194Richardson Indep. Sch. Dist. v. Michael Z. ex rel. Leah Z., 580 F.3d 286, 299 (5th Cir. 2009); see Mawdsley, supra note 191, at 529–30. The second prong of this test is a “fact-intensive inquiry” that involves “weed[ing] out inappropriate treatments from the appropriate (and therefore reimbursable) ones.”195Michael Z., 580 F.3d at 301.

In practice, courts are reluctant to order reimbursement for programs designed to address substance use disorders for students with disabilities currently recognized under the IDEA.196For examples of courts denying parental reimbursement for substance use disorder or other mental health treatment programs in cases that involved students with IDEA-recognized disabilities and at least some history of substance use, see Fort Bend Indep. School Dist. v. Douglas A. ex rel. Z.A., 601 F. App’x 250, 253 (5th Cir. 2015); Forest Grove Sch. Dist. v. T.A., 638 F.3d 1234, 1239–40 (9th Cir. 2011); Mary T. v. Sch. Dist. of Phila., 575 F.3d 235, 248–49 (3d. Cir. 2009); C.T. ex rel. M.T. v. Croton-Harmon Union Free Sch. Dist., 812 F. Supp. 2d 420, 423 (S.D.N.Y. 2011); P.C. ex rel. K.C. v. Oceanside Union Free Sch. Dist., 818 F. Supp. 2d 516, 531–32 (E.D.N.Y. 2011); J.P. v. Enid Pub. Schs., No. CIV-08-0937-HE, 2009 U.S. Dist. LEXIS 87813, at *2 (W.D. Okla. Sept. 23, 2009); Rodriguez v. San Mateo Union High Sch. Dist., No. C 07-2360, 2008 U.S. Dist. LEXIS 111376, at *43 (N.D. Cal. July 3, 2008), aff’d, 357 F. App’x 752 (9th Cir. 2009); Green v. N.Y.C. Dep’t of Educ., No. 07 Civ. 1259 2008 U.S. Dist. LEXIS 32118, at *23 (S.D.N.Y. Mar. 31, 2008); Lauren V. v. Colonial Sch. Dist., No. 07-308, 2007 U.S. Dist. LEXIS 7836, at *34–35 (E.D. Pa. Oct. 22, 2007); Windsor-Plainsboro Reg’l Sch. Dist. Bd. of Educ. v. J.S. ex rel. M.S., No. 04-3459, 2005 U.S. Dist. LEXIS 25855, at *70–71 (D.N.J. Oct. 28, 2005); J.S. v. Shoreline Sch. Dist., 220 F. Supp. 2d 1175, 1191 (W.D. Wash. 2002). For examples of courts approving some degree of reimbursement for substance use disorder or other mental health treatment programs in cases that involved students with IDEA-recognized disabilities and at least some history of substance use, see Edmonds Sch. Dist. v. A.T., 299 F. Supp. 3d 1135, 1144 (W.D. Wash. 2017), aff’d, 780 F. App’x 491 (9th Cir. 2019) (approving reimbursement for a highly structured residential program for a student with a diagnosis of prodromal schizophrenia and a history of substance abuse); Sacramento City Unified Sch. Dist. v. R.H. ex rel J.H., No. 14-cv-01549-DB, 2016 U.S. Dist. LEXIS 140065, at *62–63 (E.D. Cal. Oct. 6, 2016) (holding a district responsible for placement that provided academic and mental health support due to the district’s failure to provide a free appropriate public education); Bd. of Educ. of City of Chi. v. Ill. Bd. of Educ., 13 C 2782, 2013 U.S. Dist. LEXIS 142134, at *12 (N.D. Ill. Oct. 1, 2013) (upholding reimbursement for a residential program that provided “drug treatment services [that were] incidental to, and enabled [the student] to benefit from, their academic programs”); Lauren G. ex rel. Scott G. v. W. Chester Area Sch. Dist., 906 F. Supp. 2d 375, 395 (E.D. Pa. 2012) (finding a placement that included substance abuse treatment services provided a “necessary ingredient for learning”); J.S. ex rel. R.S. v. S. Orange/Maplewood Bd. of Educ., No. 06-3494, 2008 U.S. Dist. LEXIS 24031, at *26–27 (D.N.J. Mar. 15, 2008) (awarding partial tuition reimbursement for a student with a history of substance abuse due to procedural violations on the part of the school district); New Paltz Cent. Sch. Dist. v. St. Pierre ex rel. M.S., 307 F. Supp. 2d 394, 401–02 (N.D.N.Y. 2004) (finding that a residential program, “[i]n addition to providing a drug-free environment . . . offered group and individual psychological counseling and cognitive-behavioral and confrontational therapies necessary to attain social and emotional stability”); Dep’t of Educ. v. Cari Rae S., 158 F. Supp. 2d 1190, 1200 (D. Haw. 2001) (classifying a short-term stay in a psychiatric hospital for a student with a history of substance abuse and mental health disabilities as falling within the IDEA’s “diagnostic and evaluative” exception to the general prohibition against reimbursement for medical services). A valid concern exists, however, that broadening the IDEA’s eligibility standards to encompass more students with mental health disorders would place a significant burden upon school districts to underwrite treatment costs.197Compare David S. Doty, A Desperate Grab for Free Rehab: Unilateral Placements Under IDEA for Students with Drug and Alcohol Addictions, 2004 BYU Educ. & L.J. 249, 267 (2004) (arguing that using the IDEA as a vehicle for providing substance use disorder treatment for students with other recognized disabilities would represent a waste of taxpayer dollars), with Erin M. Heidrich, Note, Expanding Access to Residential Treatments for Mentally Ill Youth Through the Individuals with Disabilities Education Act, 41 N. Ky. L. Rev. 295, 311 (2014) (arguing that early investment in mental health support ultimately saves taxpayer dollars).

In conclusion, students with substance use disorders will likely remain invisible under the IDEA. Fortunately, another mechanism exists by which this population can be seen and served in their schools: the prohibitions against disability-based discrimination by public entities contained in Section 504 and further contextualized by the ADA.

B. Section 504 and the Americans with Disabilities Act

Compared to the IDEA, Section 504 and the ADA appear to impose altogether different obligations upon schools: the IDEA imposes an affirmative duty to provide students that meet its exclusive eligibility criteria with a free appropriate public education198See 20 U.S.C. § 1401(9); 34 C.F.R. § 300.17 (2022). while Section 504 and the ADA contain strong prohibitions against disability-based discrimination.199See 34 C.F.R. § 104.1 (2022) (“The purpose of this part [of the Code of Federal Regulations] is to effectuate section 504 of the Rehabilitation Act of 1973, which is designed to eliminate discrimination on the basis of handicap in any program or activity receiving Federal financial assistance.”); 42 U.S.C. § 12101(b)(2) (“It is the purpose of [the ADA statute] . . . to provide clear, strong, consistent, enforceable standards addressing discrimination against individuals with disabilities.”). Functionally, however, Section 504 and the ADA stand alongside the IDEA as powerful mechanisms by which students with disabilities can be seen and served in their schools.200See Christopher J. Walker, Note, Adequate Access or Equal Treatment: Looking Beyond the IDEA to Section 504 in a Post-Schaffer Public School, 58 Stan. L. Rev. 1563, 1588 (2006) (“Once [its] regulations were in place, Section 504 became a powerful tool for combating disability discrimination in employment, as well as in preschool, elementary, secondary, and postsecondary education.”).

1. Section 504 and the ADA’s Eligibility Requirements and Protections

Unlike the IDEA, Section 504 and the ADA’s current eligibility standards provide sufficient opportunity to recognize and serve certain students with substance use disorders. While the IDEA only recognizes particular enumerated disabilities,201See supra Section II.A.1.a. Section 504 and the ADA prohibit discrimination against—and provide needed protections for—all students for whom “a physical or mental impairment . . . substantially limits one or more major life activities.”20234 C.F.R. § 104.3 (2022); 42 U.S.C. § 12102(1)(A).

According to Section 504’s school-specific regulations, schools must provide each “qualified handicapped person” with a “free appropriate public education.”20334 C.F.R. § 104.33 (2022). To be considered “handicapped” under Section 504 and the ADA, students must have “any physiological disorder or condition . . . affecting one or more” of a broad list of bodily systems,204Of the bodily systems listed, “neurological” is most directly relevant to substance use disorders. See 34 C.F.R. § 103 (2022); 29 U.S.C. § 794. or “any mental or psychological impairment,”20534 C.F.R. § 104.3(j)(2)(i)(B) (2021). that substantially limits one or more “major life activities,” including learning, reading, and concentrating.20642 U.S.C. § 12102(2). The non-exhaustive list of major life activities also includes “caring for oneself, performing manual tasks, seeing, hearing, eating, sleeping, walking, standing, lifting, bending, speaking, breathing . . . , thinking, communicating, and working.” Id. An individual who exhibits a sufficient number of DSM-V symptoms to qualify for diagnosis of a substance use disorder would likely meet the initial eligibility criteria for Section 504 protections. Subsequent ADA regulatory language—particularly its requirement that “the definition of disability . . . shall be construed in favor of broad coverage of individuals”20742 U.S.C. § 12102(4)(A); see also 29 C.F.R. § 1630.2(j)(1)(i) (2022) (explaining that the ADA Amendments Act of 2008 is to be “construed broadly in favor of expansive coverage”).—informs the manner in which Section 504 eligibility is to be determined by educators.

Because Section 504 and the ADA’s disability-based eligibility criteria are far less restrictive than the IDEA’s,208See supra notes 112–15. virtually all students with IDEA-recognized disabilities receive concurrent recognition and protections under Section 504 and the ADA, while some students who are eligible for “Section 504 Plans”209Although Section 504 and the ADA operate in tandem in defining eligibility and general protections for students with disabilities, Section 504 alone articulates the substantive right to a “free appropriate public education.” 34 C.F.R. § 104.33 (2022); see also K.M. ex rel. Bright v. Tustin Unified Sch. Dist., 725 F.3d 1088, 1099 (9th Cir. 2013) (“[I]ndeed, Title II does not impose any FAPE [free appropriate public education] requirement.”). Accordingly, individual student accommodations are generally referred to as “Section 504 Plans” or simply “504 Plans.” See Perry A. Zirkel, Comparison of IDEA IEP’s and Sec. 504 Accommodations Plans, 191 Educ. L. Rep. 563, 563 (2004). are ineligible for accommodations under the IDEA.210See Perry A. Zirkel & Tiedan Huang, State Rates of 504-Only Students in K-12 Public Schools: An Update, 354 Educ. L. Rep. 621, 624–25 (2018) (discussing prevalence rates of students receiving Section 504, but not IDEA, accommodations). Nevertheless, the IDEA and Section 504 both require the provision of a “free appropriate public education” to students who meet their separate eligibility criteria.211Courts have held that there are “few differences, if any” between the IDEA’s “free appropriate public education” (“FAPE”) standard and the same standard under Section 504. See Lauren G. ex rel. Scott G. v. W. Chester Area Sch. Dist., 906 F. Supp. 2d. 375, 377 (E.D. Pa. 2012) (quoting Ridgewood Bd. of Educ. v. N.E. ex rel. M.E., 172 F.3d 238, 253 (3d Cir. 1999)). The Ninth Circuit described the “overlapping but different” standards as follows:

In sum, the IDEA contains a statutory FAPE provision and allows private causes of action only for prospective relief. Section 504 contains a broadly-worded prohibition on discrimination against, exclusion of and denial of benefits for disabled individuals, under which the U.S. DOE has promulgated regulations containing a FAPE requirement worded somewhat differently from the IDEA FAPE requirement.

Mark H. v. Lemahieu, 513 F.3d 922, 925, 930 (9th Cir. 2008). Scholars have argued that Section 504’s FAPE standard, while technically different, is equally robust to the IDEA’s FAPE standard. See Walker, supra note 200, at 1598–1603.

Providing a “free appropriate public education” under Section 504 requires the “provision of regular or special education and related aids and services that . . . are designed to meet individual educational needs of handicapped persons as adequately as the needs of nonhandicapped persons are met.”21234 C.F.R. § 104.33(b)(1) (2022). While it is “noncontroversial” that Section 504 and the ADA prohibit certain actions such as “unnecessary segregation, unjustified disparate-impact discrimination, refusal to furnish comparable academic and nonacademic facilities and settings, and failure to provide reasonable accommodation,”213Mark C. Weber, A New Look at Section 504 and the ADA in Special Education Cases, 16 Tex. J. on C.L. & C.R. 1, 10–11 (2010). the nature of services schools must provide in order to meet the needs of students with disabilities “as adequately” as their nonhandicapped peers (and thus provide an “appropriate” education) is a matter of debate.214See id. at 11; Walker, supra note 200, at 1593 (“The final condition—that of the level of accommodation required—is perhaps the most controversial and widely debated Section 504 concept among practitioners, policymakers, and academics.”). If a disagreement occurs as to whether students with disabilities are receiving a free appropriate public education, Section 504 regulations provide for “a system of procedural safeguards that includes notice, an opportunity . . . to examine relevant records, an impartial hearing with opportunity for participation by the person’s parents or guardian and representation by counsel, and a review procedure.”21534 C.F.R. § 104.36 (2022).

With the growing recognition of substance use disorders as complex, “biopsychosocial” conditions that often begin in adolescence, advocates for substance-involved students are better positioned than ever to seek Section 504 accommodations for students with substance use disorders. While this project would break new ground in the education context, a separate area of disability-nondiscrimination doctrine under Section 504 and the ADA can provide an initial (though incomplete) framework for such advocacy—the manners in which qualifying adults with substance use disorders have, for decades, been accommodated in their workplaces.216See Benedict v. Cent. Cath. High Sch., 511 F. Supp. 2d 854, 858 (N.D. Ohio 2007) (“[T]he decisional principles of the disability discrimination in employment cases are analogous to those in education cases, and much of the support for education cases will come from employment cases.”); see also supra Section I.A.2.

2. The Challenge of Substance-Involved Students

As discussed in Section 1.A.2, substance use disorders have long been considered “impairment[s] [that] substantially limit[] one or more major life activities” under Section 504 and the ADA.21734 C.F.R. § 104.3 (2022); 42 U.S.C. § 12102(1)(a); see supra note 44. Advocates for students with substance use disorders can thus stand upon decades of scholarship and case law addressing the recognition and protection of employees with substance use disorders in their workplaces. But the school context presents a challenging issue that is not present in the workplace context: what, if any, obligations are owed to substance-involved students.

The standards for qualifying for Section 504 and ADA protections differ in key ways within the employment and education contexts. In the employment context, “qualified” individuals are only those who, “with or without reasonable accommodation, can perform the essential functions of the employment position that such individual holds or desires.”21842 U.S.C. § 12111(8). Those who are “currently engaging in the illegal use of drugs” are not considered “qualified individual[s] with a disability.”21929 C.F.R. § 1630.3 (2022). In the school context, however, students can only lose their status as a “qualified individual” for Section 504 accommodations in schools by no longer being “of an age during which nonhandicapped persons are provided” public education.22034 C.F.R. § 104.3(l)(2) (2022). Students even maintain their status as a “qualified individual” following disciplinary proceedings for drug infractions that require the student to leave a particular campus. See Benedict, 511 F. Supp. 2d at 859 (finding that, even following a drug-related infraction resulting in a school expulsion, “[t]he parties do not dispute that the identification of [the student’s] specific learning disability qualifies him as a disabled person under section 504 of the Rehabilitation Act [and that the student] . . . is an ‘otherwise qualified’ handicapped person”).

Furthermore, while Section 504’s procedural protections do not preclude schools from issuing “legitimate, non-discriminatory”221Benedict, 511 F. Supp. 2d at 859. consequences for school misbehavior,222See 29 U.S.C. § 705(20)(C)(iv) (“For purposes of programs and activities providing educational services, local educational agencies may take disciplinary action pertaining to the use or possession of illegal drugs or alcohol against any student who is an individual with a disability and who currently is engaging in the illegal use of drugs or in the use of alcohol to the same extent that such disciplinary action is taken against students who are not individuals with disabilities.”). and some students can therefore face expulsion following drug-related offenses, the student’s relationship with the public school system—through an alternative school within their local school district, or perhaps a school program within a treatment or carceral setting—typically continues after such disciplinary measures have been taken.223See U.S. Dep’t of Educ. Off. Civ. Rts, Civil Rights Data Collection, Data Snapshot: School Discipline 2 (2014), https://ocrdata.ed.gov/assets/downloads/CRDC-School-Discipline-Snapshot.pdf [https://perma.cc/B4BQ-D8HY] (finding that 130,000 students, out of 49 million nationwide, were expelled in 2011–2012—the last school year for which national data is available). Put another way, while adults with substance use disorders are not entitled to any particular job (or to employment generally), students with substance use disorders are entitled, and indeed obligated, to attend school in some capacity.224The free, appropriate public education afforded to students with Section 504-recognized disabilities alone provides this entitlement, in addition to the general entitlement to a public education afforded to all children. The obligation to attend school derives from individual states’ truancy laws. Furthermore, every state constitution contains an affirmative right to an education. See Eric Blumenson & Eva S. Nilsen, One Strike and You’re Out? Constitutional Constraints on Zero Tolerance in Public Education, 81 Wash. U. L.Q. 65, 103 n.161 (2003) (cataloging the state constitutional provisions that provide a right to an education).

These distinctions expose a critical gap in special education law. Students with substance use disorders can meet the eligibility criteria of Section 504 and the ADA,225See supra note 206. and, unlike adult employees, students (1) cannot lose their status as a “qualified individual” deserving of Section 504 accommodations on the basis of active substance abuse,226See supra notes 218–20 and accompanying text. and (2) typically remain in an ongoing relationship with the public school system following the discovery of active substance abuse. What, then, are schools’ “free appropriate public education” duties under Section 504 to these students? Specifically, what manner and extent of academic and behavioral supports are legally necessary to provide an appropriate education for students in all stages of substance use disorders?227The analysis regarding the “special education and related services” that could be provided to students with substance use disorders (were such disorders to be recognized under the IDEA) can contribute to this important conversation. See supra Section II.A.1.c.

For now, these queries must be addressed on an individual basis and subjected to review by individuals who may be unaware of, or actively resistant to, the needs of adolescents with substance use disorders.228See supra Section I.A. Until the public school system’s obligations to students with substance use disorders are determined and articulated to schools, obtaining services or accommodations for substance-involved students will be challenging.229The analysis regarding the challenges substance-involved students would face with regards to meeting the IDEA’s “adversely affects educational performance” eligibility prong broadly applies here as well. See supra notes 149–52 and accompanying text. But even if formal Section 504 accommodations are not obtained, there may nevertheless be value in recognizing a substance-involved student as a child with a disability. The simple act of incorporating the vocabulary of a medicalized construct of substance use disorders into schools could have a significant effect on parents, educators, and the students themselves. A meeting convened to discuss a student’s “relapse,” for example, would likely have a different tone, and possibly outcome, than one discussing a student’s continued rule- and law-violations. And ideally all parties would recognize that any mandatory punitive responses on the part of the school will, absent concurrent therapeutic support, almost certainly fail to incentivize the student to cease drug use.230See supra notes 87–91 and accompanying text.

Furthermore, if such district-level drug policies prove to be consistently illogical and counterproductive over time, perhaps district-level policymakers would then be motivated to reform their policies in a manner that acknowledges the complexity of substance use disorders. Section 504’s affirmative obligation for schools to identify disabled students might also impact district-level behavior through increased screening for substance use disorders and timely communication with parents regarding warning signs and symptoms.231See 34 C.F.R. § 104.35(b) (2022) (“[Public schools] shall establish standards and procedures for the evaluation and placement of persons who, because of handicap, need or are believed to need special education or related services . . . .”). In fulfilling their evaluative obligations under Section 504, schools can play an invaluable role in the education of families and implementation of proactive responses to budding substance use disorders. Meeting this requirement might also entail increasing training opportunities for teachers and counselors to identify and initially address evidence of substance abuse.

3. Existing Space to Serve Students with Substance Use Disorders

There is, however, a class of individuals who could immediately obtain formal recognition and accommodations under Section 504: non-using students in recovery from substance use disorders. This population stands to benefit from decades of precedent on the matter without triggering the complex questions raised by substance-involved students.

Though the effort-intensive nature of maintaining remission from substance use disorders arguably justifies a robust provision of “regular or special education and related aids and services”23234 C.F.R. § 104.33 (2022). for students in recovery, there are also several practical—and relatively easy to provide—accommodations that advocates can and should seek for that population. Examples of such accommodations include the coordination of communication between school personnel, parents, and, upon consent, outside treatment providers to ensure that aberrations in students’ academic performance or behavior are addressed quickly and strategically; giving students the opportunity to call their sponsors or therapists during school hours without judgment or consequence; and excusing absences to attend outpatient treatment programs. Schools can also be more sensitive to the scheduling needs of students in recovery and, where possible, provide opportunities to transfer out of classes containing students from whom they should maintain distance.

Such interventions, if proven effective for a particular student, should remain available as long as the student attends school. This argument finds support in the ADA Amendments Act of 2008, which proscribes factoring the “ameliorative effects of mitigating measures” when assessing an individual’s impairment.23342 U.S.C. § 12101; 28 C.F.R. § 36.105 (2022). In other words, the impact of a student’s disability must be assessed insofar as how it would manifest absent any mitigating measures (such as school-based recovery supports).234See 42 U.S.C. § 12101; 28 C.F.R. § 36.105 (2022). Students in recovery are entitled to support under Section 504 and the ADA regardless of the length of their sobriety.

In any event, advocates for students with substance use disorders can and should initiate this conversation by seeking support and protections for this population under Section 504 and the ADA. Reasonable applications of the statutes as they currently stand can make a significant impact upon the lives of students in various stages of substance use disorders, as well as upon the school systems that serve them.

Nevertheless, the primary impact of broadly acknowledging and addressing substance use disorders in schools may lay outside the strict bounds of statutory obligations. Important as specific accommodations are, the greatest value in extending Section 504 protections to students in recovery may be simple recognition: for them to be seen, celebrated, and supported in their schools.

CONCLUSION

Substance use disorders are incredibly challenging to address. Initial instincts, on a personal and policy level, are often to mistake substance use disorders for problems that are seemingly easier to solve, if not to ignore them altogether. It is no surprise, then, that the primary policy framework for serving students with disabilities—the IDEA—fails to acknowledge and address students with substance use disorders. That said, certain students (particularly students in recovery) are entitled to recognition and accommodations under Section 504 and the ADA. Seeing and serving students with substance use disorders would be a complex and controversial project, but such students—like all other students with disabilities—are deserving of support.

 

96 S. Cal. L. Rev. 355

Download

* Judicial Law Clerk, Southern District of Texas. All views expressed are my own. I could not have written this Article without the education, encouragement, and feedback I received from Tara Ford, Bill Koski, Ticien Sassoubre, Rabia Belt, Jasmine Miller, Thomas Butterfoss, and Max Etchemendy. I am also grateful beyond words to my family, my friends, and the recovery school community for their support over the years. Finally, I am indebted to the kind and thoughtful editors of the Southern California Law Review who worked on this article: Daniel Willey, Celine Ang, Jessica Block, and Christopher LoCascio.

Analyzing the Circuit Split Over CDA Section 230(E)(2): Whether State Protections for the Right of Publicity Should be Barred

INTRODUCTION

In 2018, coworkers notified Karen Hepp, a newscaster and co-anchor for the local Fox affiliate’s morning news program Good Day Philadelphia, that a screenshot of her smiling at a hidden security camera taken about fifteen years ago was being used in various online advertisements for erectile dysfunction and dating apps.1Victor Fiorillo, Now It Can Be Told: Karen Hepp Opens Up About Her Battle With Facebook, Phila. Mag. (Sept. 24, 2021), https://www.phillymag.com/news/2021/09/24/karen-hepp-facebook-lawsuit/ [https://perma.cc/QC8Q-DQ7X]; Eriq Gardner, Is a Famous Face a Form of Intellectual Property?, Hollywood Rep. (June 18, 2021, 8:15 PM), https://www.hollywoodreporter.com/
business/business-news/news-anchors-fight-facebook-sag-aftra-1234968110/ [https://perma.cc/R6AN-VUU6].
Hepp was not previously aware that her photo had been taken or that her photo was posted and shared online on platforms such as Facebook, Reddit and Imgur.2Hepp v. Facebook, 465 F.Supp.3d 491, 495 (E.D. Pa. 2020), aff’d in part, 14 F.4th 204 (3d Cir. 2021). Hepp’s photo was used to solicit Facebook users to “meet and chat with single women.”3Id. The photo was also featured on Imgur under the heading “milf,” a derogatory and degrading term that refers to a sexually attractive woman with young children, and a Reddit user under the handle “pepsi_next” posted Hepp’s photo to a Reddit subgroup “r/obsf,” which is a repository for risqué photos of older women.4Id. Though Hepp did not allege that Facebook, Imgur, or Reddit had any role in creating or directly publishing this content, she argued that the platforms’ actions have caused “serious, permanent and irreparable harm” to her reputation brand, and image.5Id. Hepp filed claims against Facebook, Imgur, and Reddit for violations of a Pennsylvania state statute that codifies a right of publicity through causes of action for an unauthorized use of one’s name or likeness and the Pennsylvania common law right of publicity.6Id. at 495–96; see also 42 Pa. Cons. Stat. § 8316 (2022).

Because there is no federal law protecting a right of publicity, states that have adopted the right of publicity have done so by statute, judicial decision, or both.7Dustin Marlon, Unmasking the Right of Publicity, 71 Hastings L.J. 419, 426 (2020). The right of publicity is the right to control the commercial use and value of one’s persona, but the right significantly varies from state to state. Generally, a claim “requires three elements to be actionable: (1) the use of an individual’s persona; (2) for commercial purposes; and (3) without plaintiff’s consent.”8Id. The Pennsylvania statute creates a cause of action for “any natural person whose name or likeness has commercial value and is used for any commercial or advertising purpose without the written consent of such natural person.”942 Pa. Cons. Stat. § 8316(a) (2022). In Hepp v. Facebook, Facebook, Reddit, and Imgur filed a motion to dismiss the suit under Federal Rule of Civil Procedure 12(b)(6), and the District Court for the Eastern District of Pennsylvania granted the motion, holding that Hepp’s statutory and common law right of publicity claims were barred by the section 230(c) of the Communications Decency Act (“CDA”).10Hepp, 465 F.Supp.3d at 496, 501.

Section 230 of the CDA states, “[n]o provider or user of an interactive computer service shall be treated as the publisher or speaker of any information provided by another information content provider,” and it expressly preempts any state law to the contrary.1147 U.S.C. § 230(c)(1), (e)(3). Essentially, this means interactive service providers are generally immune from liability related to content posted or shared by third parties. Therefore, section 230 “creates a federal immunity to any cause of action that would make service providers liable for information originating with a third-party user of the service.”12Zeran v. Am. Online, Inc., 129 F.3d 327, 330 (4th Cir. 1997); see also Green v. Am. Online, 318 F.3d 465, 470–71 (3d Cir. 2003). However, there are some exceptions to this immunity, including causes of action under “any law pertaining to intellectual property.”1347 U.S.C. § 230(e)(2).

In granting the motion to dismiss, the district court in Hepp followed the Ninth Circuit’s interpretation of section 230(e)(2) in holding that only federal intellectual property claims are excluded from the scope of CDA section 230 immunity, so state rights of publicity claims are barred by section 230(c).14Hepp, 465 F. Supp. 3d. at 501; see Perfect 10, Inc. v. CCBill LLC, 488 F.3d 1102, 1118–19 (9th Cir. 2007). Hepp argued that the District Court for the Eastern District of Pennsylvania should instead follow other district courts in New Hampshire and New York in holding that section 230(e)(2) includes state intellectual property claims, such as a state right of publicity claim.15      Hepp, 465 F. Supp. 3d. at 497; see Doe v. Friendfinder Network, Inc., 540 F. Supp. 2d 288, 302 (D.N.H. 2008) (holding that the CDA did not preempt plaintiff’s right of publicity claim); Atl. Recording Corp. v. Project Playlist, Inc., 603 F. Supp. 2d 690, 704 (S.D.N.Y. 2009) (“Section 230(c)(1) does not provide immunity for either federal or state intellectual property claims.”). Hepp subsequently appealed her case to the Third Circuit, which held that it would “adhere to the most natural reading of section 230(e)(2)’s text” so that “a state law can be a ‘law pertaining to intellectual property,’ too.”16Hepp v. Facebook, 14 F.4th 204, 209–12 (3d Cir. 2021). This created a circuit split between the Ninth and Third Circuits with regard to subsection (e)(2).

The argument to limit interpretation of the intellectual property exemption in section 230(e)(2) to federal law is most strongly supported by the broader congressional intent to create a broad liability shield for interactive computer service users and providers in enacting section 230. Federal laws are well-established with clear scopes of application, but state laws protecting intellectual property are far from uniform, so state laws are much less predictable and could lead to significantly different outcomes between jurisdictions. State laws may cover different causes of action rooted in different legal theories, have varying purposes and policy goals, and provide for different remedies, all assuming that a state legislature has decided to enact a law at all. Consequently, if varying state laws are exempt from section 230 immunity, the integrity of section 230 would be severely compromised. Policy-wise, the Ninth Circuit’s interpretation to exclude state intellectual property laws could very likely prevent individuals harmed by infringing content from having any redress as third-party users who typically post such content are generally very difficult to identify. On the other hand, we must also defer to Congress’s express statutory language and assume that the literal language of the statute accurately reflects Congress’s intent. Therefore, because subsection (e)(2) states that “any law pertaining to intellectual property” is not included within section 230 immunity, and Congress explicitly named federal and state law multiple times within subsection (e), we must conclude that Congress did not actually intend to limit the intellectual property exemption, which should apply its most literal meaning.

There is also a potential argument that the right of publicity is not even an intellectual property right at all, in which case the exception to immunity for claims related to intellectual property laws would not even apply. Furthermore, there has been intense controversy over section 230 coming from both sides of the political aisle, so the statute may be amended or even repealed entirely in the relatively near future. Also, the entire landscape of the internet and the public’s relationship with the internet have been shaped by section 230, so changes to the law or differing applications of the law could compromise our understanding of the right and the internet.17See Casey Newton, Everything You Need to Know About Section 230, Verge (Dec. 29, 2020, 1:50 PM), https://www.theverge.com/21273768/section-230-explained-internet-speech-law-definition-guide-free-moderation [https://perma.cc/LUD9-9RZR]. However, the current proposed changes to section 230 do not directly address the issue over how subsection (e)(2) should be interpreted with regard to state intellectual property laws, so the circuit split described in this Note will likely still be relevant.18Meghan Anand, Kiran Jeevanjee, Daniel Johnson, Quinta Jurecic, Brian Lim, Irene Ly, Matt Perault, Etta Reed, Jenna Ruddock, Tim Schmeling, Niharika Vattikonda, Brady Worthington, Noelle Wilson & Joyce Zhou, All the Ways Congress Wants to Change Section 230, Slate (Mar. 23, 2021,
5:45 AM), https://slate.com/technology/2021/03/section-230-reform-legislative-tracker.html [https:// perma.cc/2wff-mtxm].

In the absence of federal protection, whether claims regarding the right of publicity are actionable is increasingly important with the growth and expansion of the internet, particularly social media. The rise of “influencers,” individuals who essentially monetize and make a career out of their personas and relationships with branded or commercial content, and the rise of “deep fakes,” which utilize technology to synthesize fake pictures or videos that convincingly appear to depict specific individuals or celebrities, are deeply linked to the interest in one’s self, which is protected by the right of publicity. Though lawsuits over right of publicity have historically been brought almost exclusively by celebrities, the age of social media has created many more opportunities for members of the general public to have a commercial interest in their name, image, likeness, or persona. Social media has proven to be extremely lucrative, and content can generate significant value from a business standpoint. Companies and internet platforms will presumably adapt their own policies regarding allegedly infringing content depending on the prevailing interpretation of section 230(e)(2) in order to avoid as much liability as possible, which would create consequences for the millions of users that access those sites and platforms every day.

Part I of this Note will provide background and context to the right of publicity and how it developed in common law to provide a remedy
for individuals, typically celebrities, whose likenesses have been misappropriated without their consent. I will analyze the right of publicity as codified in individual state statutes with an emphasis on how these often significantly different statutes create unpredictability in enforcement and litigation. I will also distinguish the right of publicity from causes of action regarding copyright and compare the Digital Millennium Copyright Act (“DMCA”) to section 230 of the CDA. Part II will provide an overview of section 230 and discuss legislative history and intent. Within Part II, I will also examine both sides of the circuit split from the Ninth and Third Circuits’ contrary interpretations of CDA section 230(e)(2) and each side’s underlying tradeoffs involving the lack of uniformity in state intellectual property laws and the potential effects of precluding claims from such laws as compared to potentially refraining from expanding beyond the congressional purpose of CDA section 230. Finally, I will address the arguments over whether section 230 should still exist in its current form and proposed reforms.

I. THE RIGHT OF PUBLICITY

Hepp sued Facebook, Reddit, and Imgur for allegedly violating her right of publicity as granted by Pennsylvania state statute and common law. The right of publicity is defined as “the inherent right of every human being to control the commercial use of his or her identity.”19J. Thomas McCarthy, The Rights of Publicity and Privacy § 1:3 (2d ed. 2009). The right of publicity is generally regarded as an intellectual property right, though interactive service providers such as the defendants in Hepp have argued to the contrary. The right of publicity allows all individuals, celebrity or not, to recover for unpermitted uses of their likeness or persona for commercial gain.20Id. §§ 1:3, 4:16. The right is valuable in that it provides individuals the opportunity to protect the commercial use of their identities as many people, especially celebrities, generate significant income by authorizing others to use their identities in exchange for payment.21Dylan M. Spaduzzi, Note, Publicity Enemy Number One: Federal Immunity for a Virtual World, 40 U. Mem. L. Rev. 603, 612 (2010).

 A. Origins and Development

The right of publicity originally developed as the other side of the coin of the laws and theories surrounding the right of privacy.22McCarthy, supra note 19, at §§ 1:3–4. Samuel D. Warren and Louis Brandeis first recognized the right of privacy as a right potentially rooted in common law in a law review article in 1890.23Samuel D. Warren & Louis D. Brandeis, The Right to Privacy, 4 Harv. L. Rev. 193, 194–197 (1890). The right of privacy was designed to protect people from uninvited public attention and create a cause of action for people who suffered emotional harm from unwanted publicity.24William L. Prosser, Privacy, 48 Cal. L. Rev. 383, 384 (1960). However, because the right to privacy is generally a right “to be let alone,”25Id. at 389. there was difficulty in enforcing and applying this right to cases involving individuals who were already in the public spotlight. Some courts held that individuals who sought out publicity through their career choices must waive any right of privacy, while others concluded that while celebrities may be the subject of news stories, they maintain a right of privacy that allows them a shield from unwanted, non-newsworthy publicity.26Compare O’Brien v. Pabst Sales Co., 124 F.2d 167, 170 (5th Cir. 1941) (holding that a football player who sought out publicity had no right of privacy to allow him to recover for an unauthorized publication of his name and likeness in a brewing company’s football calendar), and Martin v. F.I.Y. Theatre Co., 10 Ohio Op. 338, 338–39 (Cuyahoga Cnty. Ct. C.P. 1938) (holding that an individual’s choice to pursue acting as a career and her pursuit for publicity deprived her of a right of privacy), with Birmingham Broadcasting Co. v. Bell, 68 So. 2d 314, 319 (Ala. 1953) (holding that a picture may be published for legitimate news-related purposes but not for commercial purposes), and Wilk v. Andrea Radio Corp., 200 N.Y.S.2d 522, 524 (Sup. Ct. 1960) (concluding that waiver applies only to newsworthy stories and not to advertising), modified on other grounds, 216 N.Y.S.2d 662 (App. Div. 1961). Therefore, the term “right of publicity” was first coined by Judge Jerome Frank in Haelan Laboratories, Inc. v. Topps Chewing Gum, Inc. in order to address “the economic potential of a celebrity’s identity.”27Linda J. Stack, White v. Samsung Electronics America, Inc.’s Expansion of the Right of Publicity: Enriching Celebrities at the Expense of Free Speech, 89 Nw. U.L. Rev. 1189, 1193 (1995); Haelan Lab’ys, Inc. v. Topps Chewing Gum, Inc., 202 F.2d 866, 868 (2d Cir. 1953).

In Haelan, a chewing-gum manufacturer made a contract with a baseball player for the exclusive right to use the player’s photograph in connection with sales of its gum.28Haelan, 202 F.2d at 867. However, a rival chewing gum manufacturer knew of the contract and deliberately induced the player to authorize the rival manufacturer to use the player’s photograph in connection with the sales of the rival’s gum.29Id. The Second Circuit held that an individual has a right in the publicity value of their photograph—that is, the right to grant the exclusive privilege of publishing their picture—and that this right, which is particularly relevant for prominent figures in the public eye, “might be called a ‘right of publicity.’ ”30Id. at 868. Haelan distinguished itself from prior case law due to the opinion’s emphasis on the economic interest at stake as the plaintiffs, rather than not wanting their photographs to be withheld from public viewing entirely, simply did not want their photographs to be sold for profit by third parties.31Id. The right of publicity is significantly distinct from the right of privacy because the right of privacy is not assignable, and as such, the two rights are independent from each other.32Id. at 867.

Though the right of publicity has clear roots in privacy rights, Professor Melville Nimmer associated the right of publicity with unfair competition and property law.33Melville B. Nimmer, The Right of Publicity, 19 Law & Contemp. Probs. 203, 203–04 (1954) (“[The right of privacy] is not adequate to meet the demands of the second half of the twentieth Century . . . . Public personality has found that the use of his name, photograph, and likeness has taken on a pecuniary value undreamt of at the turn of the century.”). Nimmer stated an individual is entitled to “the fruit of his labors unless there are important countervailing public policy considerations . . . [and] persons who have long and laboriously nurtured the fruit of publicity values may be deprived of them, unless judicial recognition is given to what is here referred to as the right of publicity.”34Id. at 215–16. Thus, Nimmer linked the right of publicity to the commercial aspects of a public figure’s “personality.”35Id. In contrast, in 1960, William Prosser advocated for the recognition of the tort of privacy appropriation and suggested that an individual could have the right to control the use of their identity from the appropriation of others.36Prosser, supra note 24, at 389. Unlike Nimmer’s analysis of the right of publicity, the tort of misappropriation is not a property right. This analysis lends support to the minority view that the right of publicity should not be considered an intellectual property right.

B. Federal Influences

In 1977, the U.S. Supreme Court recognized the right of publicity in Zacchini v. Scripps-Howard Broadcasting Co.,37Zacchini v. Scripps-Howard Broad. Co., 433 U.S. 562, 573–78 (1977). the first and only Supreme Court case to address the right of publicity. In Zacchini, an entertainer performed a fifteen-second human cannonball act and sued a local television station after the station taped and broadcast the entire act on the news without the entertainer’s permission.38Id. at 563–64. The Court considered whether the station was immunized from damages by the First and Fourteenth Amendments of the Constitution and ultimately held that Ohio could grant a state law remedy against the station or give immunity to the press but was not required to do so either by the First and Fourteenth Amendments.39See id. The rationale behind protecting the right of publicity, according to the Court, was simple: to prevent unjust enrichment by the theft of good will as broadcasting an entertainer’s entire act “poses a substantial threat to the economic value of that performance.”40Id. at 575. The Court clearly emphasized the proprietary interest of the individual and compared the purpose of the right of publicity to the economic philosophy behind granting patent and copyright ownership, which is to encourage individuals to produce inventions and creative works in order to foster innovation and benefit the public.41See Andrew Beckerman-Rodau, Toward a Limited Right of Publicity: An Argument for the Convergence of the Right of Publicity, Unfair Competition and Trademark Law, 23 Fordham Intell. Prop. Media & Ent. L.J. 132, 151 (2012); U.S. Const. art. 1, § 8, cl. 8.

Zacchini, as the only Supreme Court case on the subject, provided some federal influence on the right of publicity, and another source of federal influence is section 43(a)(1) of the Lanham Act, the federal unfair competition act.42Stack, supra note 27, at 1196–97; 15 U.S.C. § 1125 (2021). Rights of publicity cases also regularly implicate the Lanham Act because unauthorized appropriations of celebrities’ identities often involve issues of confusion over sponsorship. Congress amended the Lanham Act in 1988 to codify judicial decisions that had interpreted the Act to allow for false endorsement claims, and the amendment provides celebrities with a clearer statutory foundation for alleging the applicability of the Lanham Act in right of publicity actions.43Stack, supra note 27, at 1197; S. Rep. No. 515, 100th Cong., 2d Sess., at 40, reprinted in 1988 U.S.C.C.A.N. 5577, 5603. The relevant section of the Act now reads:

Any person who, on or in connection with any goods or services . . . uses in commerce any word, term, name, symbol, or device, or any combination thereof, or any false designation of origin, false or misleading description of fact, or false or misleading representation of fact, which—(A) is likely to cause confusion, or to cause mistake, or to deceive as to the affiliation, connection, or association of such person with another person, or as to the origin, sponsorship, or approval of his or her goods, services, or commercial activities by another person . . . shall be liable in a civil action by any person who believes that he or she is likely to be damaged by such act.

15 U.S.C. § 1125(a) (1988 & Supp. IV 1992). The Lanham Act is also important as a federal law because the CDA has a preemption clause that preserves federal claims of false endorsement, so hypothetical plaintiffs could potentially still have an avenue for redress under this federal law even if causes of action under the state right of publicity are barred. Thus, in order to attempt to avoid being barred, some types of cases are now more likely to also be rooted in the Lanham Act for the nationwide coverage, clear remedy, and wide scope of damages.44Stack, supra note 27, at 1197; see also infra, Section II.B (discussing White v. Samsung Elecs. Am. Inc., 971 F.2d 1395, 1399–1401 (9th Cir. 1992) and the difference between confusion-based and association-based relationships).

C. Incongruence Among States

Currently, twenty-five states have passed statutory protections for the right of publicity, and many other states have held that their respective common law would protect the right.45Jennifer E. Rothman, Rothman’s Roadmap to the Right of Publicity, Right of Publicity State-by-State, Univ. of Pa. L. Sch., https://rightofpublicityroadmap.com/ [https://perma.cc/4NZS-UJZF]; see Ala. Code §§ 6-5-771, 6-5-772 (2022); Ariz. Rev. Stat. §§ 12-761, 13-3726 (2022); Ark. Code Ann. §§ 4-75-1101 to 4-75-1113 (2022); Cal. Civ. Code § 3344 (West 2022); Fla. Stat. Ann. § 540.08 (2022); Haw. Rev. Stat. Ann. §§ 482P-1 to 482P-8 (2022); 765 Ill. Comp. Stat. Ann. §§ 1075/1–1075/60 (2022); Ind. Code Ann. §§ 32-36-1-1 to 32-36-1-20 (2022); Ky. Rev. Stat. Ann. § 391.170 (West 2022); La. Rev. Stat. Ann. § 14:102.21 (2022); Mass. Ann. Laws ch. 214, § 3A (2022); Neb. Rev. Stat. Ann. §§ 20-201 to 20-211 (2022); Nev. Rev. Stat. Ann. §§ 597.770–597.810 (2022); N.Y. Civ. Rights Law §§ 50–51 (McKinney 2022); Ohio Rev. Code §§ 2741.01–2741.99 (West 2022); Okla. Stat. tit. 12 §§ 1448–1449 (2022); 42 Pa. Cons. Stat. § 8316 (2022); 9 R.I. Gen. Laws § 9-1-28 (2022); S.D. Codified Laws §§ 21-64-1 to 21-64-12 (2022); Tenn. Code Ann. §§ 47-25-1101 to 47-25-1108 (2022); Tex. Prop. Code Ann. §§ 26.001–26.015 (West 2021); Utah Code Ann. §§ 45-3-1 to 45-3-6, 76-9-407 (LexisNexis 2022); Va. Code Ann. §§ 8.01-40, 18.2-216.1 (2022); Wash. Rev. Code Ann. §§ 63.60.010–63.60.080 (2022); Wis. Stat. § 995.50 (2022). Of the courts that have directly addressed the issue, some states have recognized a distinct right of publicity and distinguished it from a right from misappropriation while others treat the right of publicity as synonymous with the tort of appropriation as expressed in the Restatement (Second) of Torts.46See, e.g., N.Y. Civ. Rights Law § 51 (McKinney 2022). Compare Allison v. Vintage Sports Plaques, 136 F.3d 1443, 1446–57 (11th Cir. 1998), with Jackson v. Roberts (In re Jackson), 972 F.3d 25, 38 (2d Cir. 2020), and Restatement (Second) of Torts § 652C (Am. L. Inst. 1977). As previously stated, Prosser wrote about the right to privacy in 1960 and argued it is composed of four subparts, one of which being the right to protection against misappropriation of one’s likeness.47Prosser, supra note 24, at 389. Later, the American Law Institute adopted these four subcategories in the Restatement (Second) of Torts.48Restatement (Second) of Torts § 652(B)–(E) (Am. L. Inst. 1977). The tort of misappropriation protects the “interest of the individual in the exclusive use of his own identity, in so far as it is represented by his name or likeness, and in so far as the use may be of benefit to him or to others.”49Restatement (Second) of Torts § 652C cmt. a (Am. L. Inst. 1977). Though this right is in the nature of a property right, misappropriation is rooted in the common form of invasion of privacy.50Id. § 652C cmts. a–b.

The scope of the right of publicity varies from state to state, so the extent that a right of publicity might be protected greatly depends on the state in which a person is attempting to assert that right, as well as the state in which that person is domiciled. For example, individuals in most states, such as California,51See, e.g., Cal. Civ. Code § 3344 (West 2022). can assert their right of publicity during their lifetimes, while individuals in Texas cannot, as their statutory right is protected only post-mortem.52Tex. Prop. Code Ann. § 26.002 (West 2022). Further, there is a wide range of duration periods within state statutes that provide for the right of publicity to survive after an individual’s death.53See Ark. Code Ann. § 4-75-1107 (2022) (50 years); Fla. Stat. § 540.08 (2022) (40 years); Ind. Code § 32-36-1-8 (2022) (100 years); 42 Pa. Cons. Stat. § 8316 (2022) (30 years); S.D. Codified Laws § 21-64-2 (2022) (70 years); Tenn. Code Ann. § 47-25-1104 (2022) (10 years); VA. Code Ann. § 8.01-40 (2022) (20 years). Moreover, some state statutes only protect the right of publicity for certain types of people, such as soldiers54Ariz. Rev. Stat. §§ 12-761, 13-3716 (2022); La. Rev. Stat. Ann. § 14:102.21 (2022). or “public figures.”55     Ky. Rev. Stat. § 391.170 (West 2022). Perhaps most notably, what exactly is protected under the right of publicity varies greatly among states. For example, in Virginia, the statutory right of publicity is limited to a person’s name, portrait, or picture,56Va. Code Ann. § 8.01-40 (2022). whereas in New York, the right protects a person’s “name, portrait, picture or voice”57N.Y. Civ. Rights Law §§ 50, 51 (McKinney 2022). but does not extend to that person’s likeness. In contrast, the California statute is much broader and protects against unauthorized use of an individual’s “name, voice, signature, photograph, or likeness, in any manner.”58Cal. Civ. Code § 3344 (West 2022). Even further, Indiana’s statute grants a property interest in an individual’s name, voice, signature, photograph, image, likeness, distinctive appearance, gestures, or mannerisms.59Ind. Code § 2-36-1-6 (2022).

Significantly, there is currently no federal statute to protect the right of publicity or otherwise provide a uniform approach to the right. Due to the reach of social media and other technological advancements, and the fact that those most likely to assert their right of publicity are celebrities and public figures whose identities could be recognized across the entire country, litigation is unpredictable.60Brittany A. Adkins, Comment, Crying Out for Uniformity: Eliminating State Inconsistencies in Right of Publicity Protection Through a Uniform Right of Publicity Act, 40 Cumb. L. Rev. 499, 501–02 (2009–2010). Generally speaking, a right of publicity claim involves (1) the use of an individual’s “persona,” (2) for commercial purposes, and (3) without the individual’s consent.61Marlin, supra note 7, at 426. For the purposes of this Note and to determine whether a state’s right of publicity statute might fit into the carve-out of CDA section 230 immunity, I will largely limit the discussion to statutory protections and judicial applications of such protections.

D. Application

Each state’s statutory differences in turn lead to very different judicial outcomes in application that do not necessarily strictly adhere to the statutory language. In Midler v. Ford Motor Co., for example, actress and singer Bette Midler sued for an alleged violation of her right of publicity when Ford and its advertising agency used a sound-alike of Midler, but neither her name nor her picture, in a television commercial.62Midler v. Ford Motor Co., 849 F.2d 460, 461–62 (9th Cir. 1988). Though the advertising agency had properly licensed Midler’s song from the copyright holder, the sound-alike was directed to “sound as much as possible like the Bette Midler record” after Midler herself refused the gig.63Id. at 461. Thus, the only issue in the case was whether Midler’s voice was protected.64Id. The lower court granted summary judgment in favor of the agency due to the fact that although California’s statute would have protected Midler’s voice if it were used without her consent, the audio in the commercial was not actually Midler’s voice.65Id. at 462–63. Ultimately, the Ninth Circuit reversed and held that “to impersonate her voice is to pirate her identity,” so the defendants committed a tort of misappropriation by intentionally seeking an attribute of Midler’s identity, valued at what the market would have paid for Midler to have actually sung the commercial.66Id. at 463. However, there was no statutory violation of Midler’s right of publicity as the term “likeness” refers to a visual image rather than a vocal imitation.67Id.

Similarly, in White v. Samsung Electronics, plaintiff Vanna White, the co-host of Wheel of Fortune—“one of the most popular game shows in television history” to which an estimated forty million people tune in daily— sued after Samsung ran an advertisement without consent from or payment to White.68White v. Samsung Elecs. Am., Inc., 971 F.2d 1395, 1396 (9th Cir. 1992). Samsung referred to the advertisement as the “Vanna White” advertisement, which depicted a robot outfitted to specifically resemble White in her famed stance next to the “instantly recognizable” Wheel of Fortune game show set.69Id. White argued that the advertisement used her “likeness” in violation of section 3344 of the California Civil Code, but because the advertisement featured a robot with mechanical features and not White’s “precise features,” the Ninth Circuit held that the robot did not constitute White’s “likeness” within the statutory meaning and affirmed the dismissal of White’s claim.70Id. at 1397. However, the common law right of publicity has a broader umbrella of protection as it “does not require that appropriations of identity be accomplished through particular means to be actionable,” and in this case, the aspects of the advertisement leave “little doubt about the celebrity the ad is meant to depict,” so the district court erred in rejecting White’s common law right of publicity claim on summary judgment.71Id. at 1398–99.

Furthermore, White also brought a claim under the Lanham Act, for which she was required to show that the defendants created a likelihood of confusion as to whether White was endorsing the products in the advertisement.72Id. at 1399–1400. The Ninth Circuit applied an eight-factor test from the trademark case AMF Inc. v. Sleekcraft Boats.73Id. at 1400. The eight factors are as follows: “(1) strength of the plaintiff’s mark; (2) relatedness of the goods; (3) similarity of the marks; (4) evidence of actual confusion; (5) marketing channels used; (6) likely degree of purchaser care; (7) defendant’s intent in selecting the mark; [and] (8) likelihood of expansion of the product lines.”74Id. Based on the evidence White provided, the first, second, fifth, sixth, and seventh factors supported finding that there was a likelihood of confusion.75Id. at 1400–01. The Ninth Circuit found that a jury could reasonably conclude that there was an underlying intent to persuade consumers that White was endorsing the products, so White properly raised a genuine issue of material fact and the lower court erred in rejecting her claim at the summary judgment stage.76Id. at 1401. Thus, even if state right of publicity claims are barred by section 230, a potential plaintiff may be able to assert a similar but distinct claim.

E. Distinguishing the Right of Publicity from Causes of Action for Copyright Infringement

Copyright law and the right of publicity, though seemingly similar, are very different rights that are rooted in different textual and theoretical foundations, especially regarding copyright law’s constitutional basis.77Adkins, supra note 60, at 539–40. The federal Copyright Act grants authors of original works the exclusive rights to reproduce, distribute, display, and perform their work.78See 17 U.S.C. §§ 102, 106. Copyright law also gives rights to the public, such as the right to use ideas and the right to resell lawfully purchased works.79See 17 U.S.C. §§ 102(b), 109(a)–(c). In contrast, as previously stated in this section, the right of publicity is protected by state statutes and common law and allows an individual to recover for unauthorized use of a person’s name or likeness for a commercial purpose. The right of publicity is most often asserted by celebrities, but most state statutes grant all individuals this right. There is also “a critical distinction between a commercial transaction for a photograph, itself, and a commercial transaction where a photograph is used to promote or sell another product or service.”80Scott J. Sholder, Copyright Trumps Right of Publicity—Permitting Display and Download of Basketball Photographs (Maloney v. T3Media, Inc.), Cowan, DeBaets, Abrahams & Sheppard LLP (April 8, 2015), https://cdas.com/right-of-publicity/ [https://perma.cc/5EX6-CHW3].

Jennifer Rothman, a leading scholar on the right of publicity, has argued that though copyright and the right of publicity both strive to protect creative artists and to incentivize them to create works, the two rights seriously conflict.81Jennifer E. Rothman, Copyright Preemption and the Right of Publicity, 36 U.C. Davis L. Rev. 199, 204 (2002). Rothman argued that the right of publicity “conflicts not only with explicit provisions of the Copyright Act, but also with the implicit grant of affirmative rights to copyright holders and the public,” particularly because the right of publicity has grown to cover “persona,” so the scope of the right has expanded beyond just an individual’s name or likeness.82Id. at 204–205. One’s “persona” could be implicated in a use where a viewing audience is simply reminded of the person even when neither the person’s name nor likeness is used.83Id. at 205.

Section 301 of the Copyright Act sets out a test to determine whether copyright law preempts a state law claim, such as a right of publicity claim: the content of the protected right must fall within the subject matter of copyright as specified by sections 102 and 103 of the Copyright Act, and the right asserted under state law must be “equivalent to any of the exclusive rights within the general scope of copyright” as specified by section 106 of the Copyright Act.8417 U.S.C. § 301(a). Therefore, while causes of action under a state’s right of publicity can be brought concurrently with a cause of action for copyright infringement, copyright law does not necessarily preempt right of publicity claims. Though an argument that the state-based right of publicity is preempted by federal copyright law exists, most judicial decisions have rejected it.85McCarthy, supra note 19, § 11:50 (noting the majority rule is that federal copyright law does not preempt state-based right of publicity); see also Rothman, supra note 81, at 225–26 (noting that few courts have found right of publicity preempted by copyright law). The Sixth and Ninth Circuits, as well as some district courts, have concluded that section 301, the Copyright Act’s explicit preemption clause, never preempts the right of publicity because the right of publicity is generally not equivalent to the rights protected by the Copyright Act.86Rothman, supra note 81, at 225–29; see, e.g., Downing v. Abercrombie & Fitch, 265 F.3d 994, 1003–05 (9th Cir. 2001); Landham v. Lewis Galoob Toys, Inc., 227 F.3d 619, 623–24 (6th Cir. 2000); Wendt v. Host Int’l, 125 F.3d 806, 809 (9th Cir. 1997); Bi-Rite Enters., Inc. v. Button Master, 555 F. Supp. 1188, 1201 (S.D.N.Y. 1983); see also 1 Melville B. Nimmer & David Nimmer, Nimmer on Copyright § 1.13[B] (2021) (stating that courts should primarily focus on a 301 analysis of preemption). While there is no categorical preemption of right of publicity claims, there have been individual cases in which the right was preempted; for example, in Fleet v. CBS, Inc., an actor in a movie attempted to use his right of publicity to thwart a copyright owner from exploiting its property.87Fleet v. CBS, Inc., 58 Cal. Rptr. 2d 645, 646 (Ct. App. 1996); Nimmer & Nimmer, supra note 86, at § 1.17[A]. In Fleet, because the individuals only sought to block CBS from reproducing and distributing their performances in a film, their claims were preempted by federal copyright law since the film came within the subject matter of copyright protection and their claim was equivalent to an exclusive right within the general scope of copyright.88Fleet, 58 Cal. Rptr. 2d at 646, 650–51.

Many rights of publicity cases also involve causes of action for copyright infringement under the DMCA,89Digital Millennium Copyright Act of 1998, 112 Stat. 2860, Pub. L. 105-304. which is analogous to the CDA in that the DMCA immunizes providers from some lawsuits involving third-party content. In passing the DMCA, Congress “sought to provide a safe harbor against copyright liability for the normal operations of online service providers.”90Ryan Gerdes, Scaling Back Section 230 Immunity: Why the Communications Decency Act Should Take a Page from the Digital Millennium Copyright Act’s Service Provider Immunity Playbook, 60 Drake L. Rev. 653, 668 (2012). The DMCA established certain safe harbors to “provide protection from liability for: (1) transitory digital network communications; (2) system caching; (3) information residing on systems or networks at the direction of users; and (4) information location tools.”91Ellison v. Robertson, 357 F.3d 1072, 1076–77 (9th Cir. 2004) (citing 17 U.S.C. § 512(a)–(d)). The DMCA provides “safe harbors” to covered providers who remove content after being notified that the content may violate federal copyright law.92See generally Kevin J. Hickey, Cong. Rsch. Serv., IF11478, Digital Millennium Copyright Act (DMCA) Safe Harbor Provisions for Online Service Providers: A Legal Overview (2020), https://crsreports.congress.gov/product/pdf/IF/IF11478 [https://web.archive.org/web/
20220417000239/https://crsreports.congress.gov/product/pdf/IF/IF11478]; U.S. Copyright Off., Section 512 of Title 17: A Report of the Register of Copyrights (2020), https://
http://www.copyright.gov/policy/section512/section-512-full-report.pdf [https://perma.cc/2ZC6-B9W5].
To be protected from lawsuits premised on hosting potentially infringing content, the DMCA requires the person notifying a service provider of copyright infringement to submit a statement “under penalty of perjury identifying the allegedly infringing material and providing a good-faith assertion that the use of the material is unlawful.”93Valerie C. Brannon & Eric N. Holmes, Cong. Rsch. Serv., R46751, Section 230: An Overview 32 (2021), https://crsreports.congress.gov/product/pdf/R/R46751 [https://web.archive.org/
web/20230322175737/https://crsreports.congress.gov/product/pdf/R/R46751]; 17 U.S.C. § 512(c)(3).
Then, the provider hosting the allegedly infringing content must decide whether to accept the notice and remove the material or ignore the notice and risk liability.94Brannon & Holmes, supra note 93, at 32–33. The DMCA both incentivizes the provider to take down the material by granting immunity to providers that do so, which creates a risk that providers will take down lawful material in order to avoid liability, as well as provides a process for the user who posted the allegedly infringing content to challenge the initial notice, in which case the provider may be able to replace the initial post and retain immunity if there is sufficient “counter notification.”95Id. at 33; 17 U.S.C. § 512(g)(1)–(4); see also, e.g., Wendy Seltzer, Free Speech Unmoored in Copyright’s Safe Harbor: Chilling Effects of the DMCA on the First Amendment, 24 Harv. J. L. & Tech. 171, 175 (2010) (discussing the incentive structure and arguing that the DMCA results in removal of constitutionally protected speech).

The most significant difference between the CDA and the DMCA is the DMCA’s requirement that the provider lack knowledge of the infringing material to be protected. Section 230 of the CDA immunizes providers for hosting both lawful and unlawful third-party content regardless of whether the provider has notice of allegedly unlawful user-generated content.96Brannon & Holmes, supra note 93, at 33 (citing Barrett v. Rosenthal, 146 P.3d 510, 520 (Cal. 2006) (comparing the DMCA’s “limited liability” scheme to section 230 and concluding “that Congress did not intend to permit notice liability under the CDA”)). Nonetheless, section 230 immunity contains exceptions allowing liability for hosting certain types of unlawful content, including if a site violates federal criminal law. For example, Perfect 10, Inc. v. CCBill LLC, in which the owner of a subscription website for adult entertainment alleged interactive service providers violated copyright and right of publicity laws among others by providing services to websites that posted stolen images, involved both the safe harbors from DMCA and the question of whether a claim under the right of publicity was barred by section 230 of the CDA.97Perfect 10, Inc. v. CCBill LLC, 488 F.3d 1102, 1108 (9th Cir. 2007). Because the DMCA sets a significantly higher threshold for providers to qualify for immunity, the Ninth Circuit first analyzed whether the providers met the threshold conditions set out in section 512(i) and then determined whether the providers could qualify for any of the safe harbors established in subsections (a) through (d).98Id. at 1109–18. Ultimately, the provider was not eligible for immunity because it was not enforcing its DMCA policy.99Id. at 1120–21. The district court stated that “the DMCA’s protection of an innocent service provider disappears at the moment the service provider loses its innocence, i.e., at the moment it becomes aware that a third party is using its system to infringe.”100Perfect 10, Inc. v. CCBill, LLC, 340 F. Supp. 2d 1077, 1086 (C.D. Cal. 2004).

In contrast to the complex and thorough examination of the requirements set forth by the DMCA to be shielded from liability, the question of whether the right of publicity claim was barred by section 230 of the CDA was quickly and succinctly handled on its face as the Ninth Circuit held the claim did not fit within the intellectual property carve-out.101Perfect 10, 488 F.3d at 1118–19; see infra Section II.B. Perfect 10 clearly illustrates some of the significant differences in the liability shields granted through the DMCA and the CDA. Though the statutes are entirely distinct from one another, scholars have advocated for CDA reform, in part because as it is, section 230 allows interactive service providers to avoid liability even if they are aware of and profit from illegal content, so long as the provider itself is not the author of the material.102See John E.D. Larkin, Criminal and Civil Liability for User Generated Content: Craigslist, a Case Study, 15 J. Tech. L. & Pol’y 85, 105 (2010) (discussing Lerman v. Flynt Distrib. Co., 745 F.2d 123, 139 (2d Cir. 1984)); Gregory M. Dickinson, An Interpretive Framework for Narrower Immunity Under Section 230 of the Communications Decency Act, 33 Harv. J.L. & Pub. Pol’y 863, 868 (2010).

II. SECTION 230 OF THE COMMUNICATIONS DECENCY ACT

A. Legislative History and Intent

Before Congress enacted section 230 of the Communications Decency Act,10347 U.S.C. § 230. it enacted a subsection of the Telecommunications Act of 1996104Telecommunications Act of 1996, Pub. L. No. 104-104, 110 Stat. 56 (codified as amended in scattered sections of 47 U.S.C.). in order to protect internet platforms from liability for third-party content. Common law had created a much different legal standard. In Stratton Oakmont v. Prodigy Services, a defamation case involving the “Wolf of Wall Street,” Jordan Belfort, in which an anonymous user wrote on Prodigy’s online message board that Belfort’s brokerage had engaged in criminal and fraudulent acts, the New York Supreme Court held that the message board was a “publisher” and moderating some posts and establishing guidelines for impermissible content meant that the message board was liable.105Stratton Oakmont v. Prodigy Servs. Co., No. 31063/94, 1995 N.Y. Misc. LEXIS 229 (Sup. Ct. May 24, 1995); Daisuke Wakabayashi, Legal Shield for Social Media Is Targeted by Lawmakers, N.Y. Times (Dec. 15, 2020), https://www.nytimes.com/2020/05/28/business/section-230-internet-speech.html [https://perma.cc/F2K5-74JR]. Thus, an internet platform would bear no liability for illegal context created by its users, but this protection did not extend to a platform that moderated user-created content.106 Christopher Cox, The Origins and Original Intent of Section 230 of the Communications Decency Act, U. Richmond J.L. & Tech. (Aug. 27, 2020), https://jolt.richmond.edu/2020/08/27/the-origins-and-original-intent-of-section-230-of-the-communications-decency-act/ [https://perma.cc/KW22-
VQQZ].
This created a policy-poor incentive in that platforms could adopt an “anything goes” model for user-created content to avoid open-ended liability.107Id. In response, then-Representatives Ron Wyden, a Democrat from Oregon, and Christopher Cox, a Republican from California, were concerned that this precedent would disincentivize websites to block obscene content.108Wakabayashi, supra note 105.

Representatives Wyden and Cox were also concerned about another extreme: then-Senator James Exon (Democrat from Nebraska) proposed a bill in the summer of 1996 to ban “anything unsuitable for minors from the internet.”109Cox, supra note 106. Senator Exon’s bill, which passed in the Senate with eighty-four votes in favor and sixteen votes opposed, cast an extremely wide net as “anyone who posted any ‘indecent’ communication, including any ‘comment, request, suggestion, proposal [or] image’ that was viewable by ‘any person under 18 years of age,’ would become criminally liable, facing both jail and fines.”110Id. Moreover, the bill went so far as to criminalize the mere transmission of such content.111Id. Representatives Wyden and Cox responded and proposed their own bill that was intended to protect speech and privacy on the internet from government regulation and “incentivize blocking and filtering technologies that individuals could use to become their own censors in their own households.”112Id. Representative Wyden emphasized that “parents and families are better suited to guard the portals of cyberspace and protect our children than our Government bureaucrats,” and argued against federal censorship of the internet.113141 Cong. Rec. H8470 (daily ed. Aug. 4, 1995) (statement of Rep. Ron Wyden). This way, content creators would be liable for compliance with all civil and criminal laws relating to their content, but this responsibility would not shift to internet platforms, “for whom the burden of screening billions of digital messages, documents, images, and sounds would be unreasonable—not to mention a potential invasion of privacy.” Instead, platforms are permitted to review and moderate some content in the course of enforcing rules against obscene content while still maintaining a broad liability shield.114Cox, supra note 106. This measure received 420 yeas and four nays in the House of Representatives, and Congress ultimately passed its version of the Telecommunications Act—with both the contradicting Cox-Wyden amendment and Exon amendment.115Id. However, within a year of the statute’s enactment, the Exon amendment was struck down by the Supreme Court, which unanimously held that the Exon amendment created an unacceptable burden on adult speech because “[i]n order to deny minors access to potentially harmful speech, the CDA effectively suppresses a large amount of speech that adults have a constitutional right to receive and to address to one another.”116Reno v. ACLU, 521 U.S. 844, 874 (1997). Ironically, because Exon’s legislation and Cox-Wyden’s legislation were merged into the same legislative title, after Exon was declared unconstitutional, the Cox-Wyden amendment became section 230 of the Communications Decency Act, the exact name of the legislation that it was designed to rebuke.117Cox, supra note 106. When section 230 was enacted in 1996, less than half of Senators and only a quarter of House Representatives even had email addresses.118Id. Though people likely generally understood the burgeoning significance of the internet, it was probably hard to foresee exactly how important user-generated content would become to everyday lives and activities or even the sheer volume of internet traffic.119Id. As of 2020, more than 85% of American businesses with websites rely on user-generated content, making section 230 essential to ordinary commerce. Id. User-generated content has saved lives by allowing people to locate loved ones during deadly tornados, user-generated content is vital to law enforcement and social services, and during the COVID-19 crisis, online access to user-created educational resources was crucial to countless families. Id.

Overall, section 230 serves three core purposes. First, it “maintain[s] the robust nature of internet communication and, accordingly . . . keep[s] government interference in the medium to a minimum.”120Jones v. Dirty World Ent. Recordings LLC, 755 F.3d 398, 407 (6th Cir. 2014) (quoting Zeran v. AOL, 129 F.3d 327, 330 (4th Cir. 1997)). Second, the immunity provided by section 230 “protects against the ‘heckler’s veto’ that would chill free speech,” as without section 230, individuals could threaten litigation against interactive computer service providers, which would be forced to choose to either remove the content or face litigation costs and potential liability.121Id. at 407–08. Third, section 230 encourages interactive computer service providers to self-regulate “offensive” material as a response to the holding in Stratton Oakmont, in which the provider of an electronic message-board service was “potentially liable for its user’s defamatory message because it had engaged in voluntary self-policing of the third-party content.”122Id. at 408. However, the broad immunity shield granted to providers has arguably led to disincentivize providers from self-regulating.123See Doe v. GTE Corp., 347 F.3d 655, 660 (7th Cir. 2003) (discussing the inconsistency between section 230’s caption and its judicial interpretation); see also Andrew J. Crossett, Unfair Housing on the Internet: The Effect of the Communications Decency Act on the Fair Housing Act, 73 Mo. L. Rev. 195, 202 (2008) (“The title makes little sense when the effect of the section is ‘to induce ISPs to do nothing about the distribution of indecent and offensive materials via their services.’ ” (quoting GTE, 347 F.3d at 660)).

Judicial interpretation of section 230 is crucial to determine whether platforms such as Facebook, Reddit, Imgur, and others could be liable for the infringing actions of third-party users. Section 230 unambiguously provides immunity to providers and users of interactive computer services from liability for subject matter generated by third parties as (c)(1) states, “No provider or user of an interactive computer service shall be treated as the publisher or speaker of any information provided by another information content provider.”12447 U.S.C. § 230(c)(1). Though “immunity” or a synonym is not explicitly included in section 230(c)(1), reviewing courts have recognized the provision to protect interactive service providers for the display of content created by someone else.125Seaton v. TripAdvisor LLC, 728 F.3d 592, 599 n.8 (6th Cir. 2013) (recognizing that section 230(c)(1) provides immunity); see also Almeida v. Amazon.com, Inc., 456 F.3d 1316, 1321 (11th Cir. 2006) (“The majority of federal circuits have interpreted the CDA to establish broad federal immunity to any cause of action that would make service providers liable for information originating with a third-party user of the service.” (citations and quotations omitted)). The main purpose of section 230 is to bar “lawsuits seeking to hold a service provider liable for its exercise of a publisher’s traditional editorial functions—such as deciding whether to publish, withdraw, postpone, or alter content.”126Zeran v. AOL, 129 F.3d 327, 330 (4th Cir. 1997). In Zeran v. AOL, which was decided shortly after the CDA was enacted, the Fourth Circuit heard a defamation claim against America Online (“AOL”) alleging “that AOL unreasonably delayed in removing defamatory messages posted by an unidentified third party, refused to post retractions of those messages, and failed to screen for similar postings thereafter,” but held that the CDA squarely barred the claim.127Id. at 328; see id. at 330–35.

 Section 230 defines an “interactive computer service” as “any information service, system, or access software provider that provides or enables computer access by multiple users to a computer server, including specifically a service or system that provides access to the Internet.”12847 U.S.C. § 230(f)(2). This broad definition covers many entities operating online, including broadband internet access providers (such as Verizon FIOS and Comcast Xfinity), internet hosting companies (such as DreamHost and GoDaddy), search engines (such as Google and Yahoo!), online messaging boards, and many varieties of online platforms.129Kathleen Ann Ruane, Cong. Rsch. Serv., LSB10082, How Broad A Shield? A Brief Overview of Section 230 of the Communications Decency Act 2 (2018), https://
digital.library.unt.edu/ark:/67531/metadc1156941/m2/1/high_res_d/LSB10082_2018Feb21.pdf [https://
perma.cc/KBF6-B8UX].
An “information content provider” is “any person or entity that is responsible, in whole or in part, for the creation or development of information provided through the Internet or any other interactive computer service.”13047 U.S.C. § 230(f)(3). Thus, section 230 distinguishes those who create content from those who provide access to that content, providing a broad liability shield to the latter group.131Brannon & Holmes, supra note 93, at 3.

It is undisputed that section 230(c)(1) of the CDA is limited by section 230(e)(2), which requires courts to “construe Section 230(c)(1) in a manner that would neither ‘limit or expand any law pertaining to intellectual property.’ ”132Perfect 10, Inc. v. CCBill LLC, 488 F.3d 1102, 1118 (9th Cir. 2007) (quoting Gucci Am., Inc. v. Hall & Assocs., 135 F. Supp. 2d 409, 413 (S.D.N.Y. 2001) and 47 U.S.C. § 230(e)(2)). However, there are conflicting interpretations of section 230(e)(2) of the CDA. This discrepancy is the focus in many rights of publicity cases and other cases rooted in state causes of action, such as Hepp v. Facebook. In determining whether section 230(e)(2) applies, courts have sometimes looked not only to whether the plaintiff is suing under a law that generally involves intellectual property issues, but more specifically, whether the plaintiff’s claim actually involves an intellectual property right.133Enigma Software Grp. USA, LLC v. Malwarebytes, Inc., 946 F.3d 1040, 1052–53 (9th Cir. 2019); see also, e.g., Corker v. Costco Wholesale Corp., No. C19-0290RSL, 2019 U.S. Dist. LEXIS 51933, at *6 (W.D. Wash. Mar. 25, 2020) (concluding section 230(e)(2) did not apply to a false association claim because the claim did “not involve an intellectual property right or trademark”); Doe v. Friendfinder Network, Inc., 540 F. Supp. 2d 288, 302–03 (D.N.H. 2008) (holding that section 230(e)(2) did not apply to state right of privacy claims that involved personal rights). It is significant to note that protection of intellectual property rights on internet platforms is limited by federal protections, such as the safe harbor provisions of section 512 of the DMCA.134See supra Section I.E; 17 U.S.C. § 512; 47 U.S.C. § 230. These safe harbors give providers a broad liability shield from indirect liability for copyright infringement by third-party users, which is relevant here as these safe harbors could potentially be interpreted to indicate congressional intent to protect platforms against liability for intellectual property infringement by third-parties. Because both statutes were enacted in the late 1990s, there has been debate over whether they should still exist in their current form, as the internet is nearly unrecognizable as compared to the late 1990s.

B. Arguments that Subsection (E)(2) Should Be Interpreted to Be Limited to Federal Intellectual Property Laws

In three relatively short paragraphs, the Ninth Circuit directly addressed in 2009 whether the intellectual property carve-out in section 230(e)(2) should open up interactive computer service providers to liability for claims under state right of publicity statutes in Perfect 10, Inc. v. CCBill LLC and ultimately held that it should not.135Perfect 10, 488 F.3d at 1118. The Ninth Circuit revisited the issue in 2019 in Enigma Software Group USA, LLC v. Malwarebytes, Inc., and affirmed its prior conclusion.136Enigma Software, Grp. USA, LLC v. Malwarebytes, Inc., 946 F.3d 1040, 1053 (9th Cir. 2019) (“We have observed before that because Congress did not define the term ‘intellectual property law,’ it should be construed narrowly to advance the CDA’s express policy of providing broad immunity.”).

Section 230 states that “[n]o provider or user of an interactive computer service shall be treated as the publisher or speaker of any information provided by another information content provider,” and expressly preempts any state law to the contrary,13747 U.S.C. § 230(c)(1), (e)(3). so the majority of federal circuits have interpreted section 230 to establish “broad ‘federal immunity to any cause of action that would make service providers liable for information originating with a third-party user of the service.’ ”138Almeida v. Amazon.com, Inc., 456 F.3d 1316, 1321 (11th Cir. 2006) (quoting Zeran v. Am. Online, Inc., 129 F.3d 327, 330 (4th Cir. 1997)); see also Carafano v. Metrosplash.com, Inc., 339 F.3d 1119, 1122 (9th Cir. 2003) (citing Batzel v. Smith, 333 F.3d 1018, 1026-27 (9th Cir. 2003)). There is no express definition of “intellectual property” in the CDA, and there are many types of laws that could arguably be characterized as intellectual property claims.139Perfect 10, 448 F.3d at 1118. The Ninth Circuit reasoned that while the scope of federal intellectual property laws is “relatively well-established,” state laws governing intellectual property claims significantly differ and do not provide analogous uniformity. Therefore, construing “any law pertaining to intellectual property” in subsection (e)(2) to literally mean any intellectual property law, including state laws, would open up interactive computer service providers to a massive amount of liability with extremely unpredictable litigation.140Id. To avoid this, the Ninth Circuit held that the term “intellectual property” should instead mean “federal intellectual property” in order to protect Congress’s “expressed goal of insulating the development of the Internet from the various state-law regimes.”141Id.; see 47 U.S.C. § 230(a)–(b); see also Batzel v. Smith, 333 F.3d 1018, 1027 (9th Cir. 2003) (noting that “courts construing § 230 have recognized as critical in applying the statute the concern that lawsuits could threaten the freedom of speech in the new and burgeoning Internet medium” (citations omitted)). Furthermore, regarding the right of publicity specifically, there is an argument that the publicity rights do not constitute intellectual property rights for the purposes of the liability carve-out, in which case subsection (e)(2) would be irrelevant and right of publicity suits would be barred by section 230(c)(1).

This concern is certainly valid with respect to right of publicity claims; as noted above, of the twenty-five states that have actually granted statutory protection to an individual’s right of publicity, there are vast discrepancies between state statutes, including the scope of the right, who may assert a claim, and the duration of the right.142See supra Part I. However, websites and their respective contents are accessible in all fifty states at any given time. Thus, if section 230 does not immunize interactive computer service providers from causes of action stemming from right of publicity statutes, each state with relevant legislation could potentially have a different outcome. For example, if potentially infringing content used only an individual’s voice for a commercial purpose without the individual’s consent, it would be actionable only in Alabama, California, Hawaii, Indiana, Illinois, Nevada, New York, Ohio, Oklahoma, South Dakota, Texas, and Washington, but likely not in Florida, Kentucky, Massachusetts, Nebraska, Pennsylvania, Rhode Island, Tennessee, Utah, Virginia, or Wisconsin.143See Ala. Code §§ 6-5-771, 6-5-772 (2022); Cal. Civ. Code § 3344 (West 2022); Haw. Rev. Stat. Ann. § 482P-2 (2022); 765 Ill. Comp. Stat. Ann. §§ 1075/1–1075/60 (2022); Ind. Code Ann. §§ 32-36-1.02 to 32-36-1-20 (2022); Nev. Rev. Stat. Ann. §§ 597.770–597.810 (2022); N.Y. Civ. Rights Law § 51 (McKinney 2022); Ohio Rev. Code §§ 2741.01–2741.99 (West 2022); Okla. Stat. tit. 12 §§ 1448–1449 (2022); S.D. Codified Laws §§ 21-62-1 to 21-64-12 (2022); Tex. Prop. Code Ann. §§ 26.001–26.015 (West 2021); Wash. Rev. Code Ann. § 63.60.080 et seq. (2022). But see Fla. Stat. Ann. § 540.08 (2022); Ky. Rev. Stat. Ann. § 391.170 (West 2022); Mass. Ann. Laws ch. 214, § 3A (2022); Neb. Rev. Stat. Ann. §§ 20-201 to 20-211 (2022); 42 Pa. Cons. Stat. § 8316 (2022); 9 R.I. Gen. Laws § 9-1-28 (2022); Tenn. Code Ann. §§ 47-25-1101 to 47-25-1108 (2022); Utah Code Ann. §§ 45-3-1 to 45-3-6 (LexisNexis 2022); Va. Code Ann. §§ 8.01-40, 18.2-216.1 (2022); Wis. Stat. § 995.50 (2022). Even states that would allow this claim to proceed have different required elements regarding who may bring the claim and the duration of the right, among others, and even if an individual were able to successfully assert their right and win their case, these statutes grant different remedies.144See supra note 143. Overall, allowing section 230(e)(2) to include the state right of publicity laws within the intellectual property exception could open up interactive computer service providers to a massive amount of unpredictable liability. Again, because the internet is accessible throughout the country, these providers would be required to comply with the most restrictive state’s standards to avoid liability.

Additionally, states also have different choices of law and jurisdictional reaches that could lead to forum-shopping. For instance, the broad choice of law and jurisdictional reach of the Indiana statute, collectively with the statute’s “expansive scope of protection and purported applicability to non-domiciliaries and deceased individuals, opens up Indiana courts for suits brought by many individuals who might not have a cause of action in their home states.”145Adkins, supra note 60, at 524; see Ind. Code Ann. § 32-36-1-1.02 to 32-36-1-20 (2022). Though forum-shopping would likely not pose a significant risk if the relevant statute requires that an individual seeking to assert a claim be domiciled in that state as an individual can only be domiciled in one state, it is still a possibility, particularly if the statute does not limit who may assert a claim in the state or if the allegedly infringing content in question involves multiple individuals. For example, because the Indiana right of publicity statute specifies that it “applies to an act or event that occurs within Indiana, regardless of a personality’s domicile, residence, or citizenship,” an individual who may not meet the required elements of another state’s statute could be incentivized to assert their right in Indiana instead.146Ind. Code Ann. § 32-36-1-1(a) (2022) (emphasis added). This possibility could force entities that utilize others’ personality rights to comply with Indiana’s statute over others.

Alternatively, individuals who split their time between different states may raise a question of domicile. For example, in a series of cases involving who could control the commercial use of the iconic photograph of Marilyn Monroe standing over a subway grate with her white skirt blowing up around her from the film The Seven Year Itch, because Monroe split her time, work, and property ownership between New York and California, the significant differences in the state law made the question of domicile critical.147Adkins, supra note 60, at 499, 526; see e.g., Shaw Family Archives Ltd. v. CMG Worldwide, Inc., No. 05 Civ. 3939 (CM), 2008 WL U.S. Dist. LEXIS, at *1 (S.D.N.Y. Sept. 2, 2008). Eventually, the Monroe estate lost its rights in Monroe’s identity because the court determined that Monroe’s domicile resulted in the application of New York law.148Milton H. Greene Archives, Inc. v. CGM Worldwide, Inc., 568 F. Supp. 2d 1152, 1198–99 (C.D. Cal. 2008).

In Perfect 10, the Ninth Circuit most likely implicitly categorized the right of publicity as intellectual property because it considered whether the California statute protecting the right of publicity should be included in the Section(e)(2) exception and concluded it should not.149Perfect 10, Inc. v. CCBill LLC, 448 F.3d 1102, 1118, 1121 n.5 (9th Cir. 2007). The Ninth Circuit stated that “[s]tates have any number of laws that could be characterized as intellectual property laws: trademark, unfair competition, dilution, right of publicity and trade defamation.”150Id. Due to the nature of the inconsistency of state laws, “no litigant will know if he is entitled to immunity for a state claim until a court decides the legal issue.”151Id.

The California right of publicity statute is distinct from the right of privacy and stresses the economic value of an individual’s persona as property,152Adkins, supra note 60, at 508–12; see Cal. Civ. Code § 3344 (West 2022). which aligns with the general consensus that the right of publicity is a property right rather than a personal one. However, this is not true for all states. Because the right of publicity originally stemmed from a privacy theory, some states have retained this classification. In New York, for instance, the current statute is titled the “Right of Privacy,” and as such, is concerned with protecting an individual’s identity rather than unfair competition.153See N.Y. Civ. Rights Law §§ 50–51 (McKinney 2022). Despite the fact that Haelan was the first to recognize that a right of publicity existed separately from the right of privacy under New York law in 1953, the current New York statute is relatively limited compared to other states.154Adkins, supra note 60, at 505–06; Haelan Lab’ys, Inc. v. Topps Chewing Gum, Inc., 202 F.2d 866, 868 (2d Cir. 1953). If the right of publicity is not rooted in a theory of property, “then the right clearly may be made the subject of license or waiver, but cannot have independent, exclusive, alienable, or divisible characteristics.”155Sheldon W. Halpern, The Right of Publicity: Commercial Exploitation of the Associative Value of Personality, 39 Vand. L. Rev. 1199, 1238 (1986). However, if the right of publicity is defined as a property right, then the right may be “assignable, survivable, descendible, and even taxable.” This difference was another significant issue in the series of Marilyn Monroe cases described above; because New York law applied, issues of assignability resulted in the Monroe estate losing its rights in Monroe’s identity.156Adkins, supra note 60, at 526; Milton H. Greene Archives, Inc. v. CGM Worldwide, Inc., 568 F. Supp. 2d 1152, 1198–99 (C.D. Cal. 2008). Somewhat similarly, a New Hampshire trial court held that three right-of-privacy torts, including “intrusion upon seclusion, publication of private facts, and casting in a false light,” involved rights that could not be considered property rights.157Doe v. Friendfinder Network, Inc., 540 F. Supp. 2d 288, 302–03 (D.N.H. 2008). Thus, the claims did not fit within the intellectual property carve-out and section 230 barred the claims.158Id. at 303.

Theoretically, if the right of publicity is not actually classified as intellectual property, section 230(e)(2) would not apply and right of publicity claims brought in those states would unquestionably be barred. Despite considering the intellectual property carve-out of section 230, the Ninth Circuit declined to explicitly define what constitutes “intellectual property” or reference a definition of the term in Perfect 10 and instead construed the term narrowly to advance the CDA’s express policy of providing broad immunity.159See Perfect 10, Inc. v. CCBill LLC, 448 F.3d 1102, 1118–19 (9th Cir. 2007). Conversely, the Third Circuit applied multiple definitions of intellectual property, including one from Black’s Law Dictionary that defines the term as a “category of intangible rights protecting commercially valuable products of the human intellect. The category comprises primarily trademark, copyright, and patent rights, but also includes . . . publicity rights.”160Hepp v. Facebook Inc., 14 F.4th 204 app. a (3d Cir. 2021). The intellectual property system aims to strike a good balance between the interests of innovators and the wider public in order to “foster an environment in which creativity and innovation can flourish.”161What Is Intellectual Property?, World Intell. Prop. Org., https://www.wipo.int/about-ip/en/ [https://perma.cc/uw8c-2usn].

Interestingly, this is very similar to Congress’s stated purpose behind section 230 of the CDA. Even in 1996, the internet was already a valuable tool for society that offered significant opportunities for people to both create and express content, as well as learn from the massive amount of information available, so Representatives Cox and Wyden wanted to strike an analogous balance to the intellectual property system. By shielding interactive computer service users and providers from liability and allowing them to moderate user-generated content so long as they do not participate in the generation of allegedly infringing content in any way, section 230 was designed to balance innovation and public interest of free speech online.162Cox, supra note 106; Ron Wyden, Perspectives: I Wrote This Law to Protect Free Speech. Now Trump Wants to Revoke It, CNN (June 9, 2020, 10:31 AM), https://www.cnn.com/2020/06/09/
perspectives/ron-wyden-section-230/index.html [https://perma.cc/5A88-RP8A]; see 47 U.S.C. § 230(a)–(c).
The precarious balance of these significant competing interests could be greatly threatened if section 230(e)(2) were interpreted to include state right of publicity laws because internet platforms would be subject to liability in an ever-evolving and incredibly inconsistent doctrine of law.

Right of publicity actions involve both confusion-based and association-based relationships. Confusion-based relationships include situations “where a person’s name or likeness is used in commercial advertising, creating a likelihood that consumers will believe the person endorses or approves of the advertised product.”163Beckerman-Rodau, supra note 41, at 164–65; see e.g., Carson v. Here’s Johnny Portable Toilets, Inc., 698 F.2d 831, 837 (6th Cir. 1983). Association-based relationships, on the other hand, are “mere references that conjure associations with a person [but] do not automatically create a likelihood that consumers will be confused as to whether the person endorses or approves of the product.”164Beckerman-Rodau, supra note 41, at 165. See generally ETW Corp. v. Jireh Publ’g, Inc., 332 F.3d 915 (6th Cir. 2003) (noting that the Sixth Circuit has created an eight-factor test to determine the likelihood of confusion). Because confusion-based relationships are already protected by the broad scope of trademark and unfair competition law, these types of claims do not necessarily need to be brought under state right of publicity law in order to fit within the subsection (e)(2) intellectual property exemption.165Beckerman-Rodau, supra note 41, at 167. For example, celebrities and public figures can register their names as a trademark or service mark under federal trademark law.166See 15 U.S.C. § 1052(e)(4), (f) (allowing surname to be registered as trademark provided it has acquired distinctiveness); see also Russell W. Jacobs, Recapturing Rareness: The Significance of Surname Rareness in Trademark Registration Determinations, 50 Idea 395, 395 (2010). As described above in White, Vanna White brought a likelihood of confusion claim under the Lanham Act and the Ninth Circuit found that the provided evidence was sufficient to present a genuine issue of material fact.167White v. Samsung Elecs. Am. Inc., 971 F.2d 1395, 1399–1401 (9th Cir. 1992). Similarly, unfair competition acts can be brought under federal trademark law even without a registered trademark.168See Kournikova v. Gen. Media Commc’ns, Inc., No. CV 02-3747 GAF, 2002 U.S. Dist. LEXIS 25810, at *17 (C.D. Cal. Aug. 9, 2002) (“A celebrity may bring a false endorsement claim for the unauthorized use of her identity if such use is likely to confuse consumers as to the celebrity’s sponsorship or approval of the product.”), aff’d, 2002 U.S. App. LEXIS 24439 (9th Cir. 2002). See generally Allen v. Nat’l Video, Inc., 610 F. Supp. 612, 625 (S.D.N.Y. 1985) (holding that unfair competition under 15 U.S.C. § 1125(a) is a new federal tort that covers more than trademark infringement, and it should be construed broadly to remedy unfair competitive actions). Thus, analogous cases involving confusion-based relationships can be brought under the Lanham Act as it provides nationwide coverage, a clear remedy, and a wide scope of damages. Such claims would not be barred by section 230 because under either interpretation of subsection (e)(2), the Lanham Act would clearly fit within the statutory exemption for intellectual property. Therefore, plaintiffs such as White may have some recourse available to them even if the circuit split on the interpretation of subsection (e)(2) is resolved to bar state right of publicity laws. However, this case was decided in 1992, so the CDA had not yet been enacted. The next relevant question would be to determine whether interactive service providers might still be entitled to immunity for violations of specific provisions of the Lanham Act. In a more recent case, the Ninth Circuit held that despite the fact that the Lanham Act generally deals with intellectual property—for example, trademarks—the intellectual property carve-out in section 230(e)(2) “does not apply to false advertising claims brought under [section] 1125(a) of the Lanham Act, unless the claim itself involves intellectual property.”169Enigma Software Grp. USA, LLC v. Malwarebytes, Inc., 946 F.3d 1040, 1053 (9th Cir. 2019), cert. denied, 141 S. Ct. 13 (2020) (statement of Thomas, J., respecting the judgment). The Supreme Court denied certiorari for this case, but Justice Thomas stated that the Court should consider whether the text of section 230 of the CDA aligns with the current state of immunity enjoyed by internet platforms in a more appropriate case. His statement did not address the intellectual property exception to liability under section 230.

Overall, the Ninth Circuit declined to include rights of publicity protected by state law within the “intellectual property” exemption because doing so would “fatally undermine the broad grant of immunity provided by the CDA.”170Perfect 10, Inc. v. CCBill LLC, 448 F.3d 1102, 1119 n.5 (9th Cir. 2007). Despite the isolated language in section 230(e)(2), reading section 230 holistically leads to the conclusion that courts should defer to the legislative intent and purpose of creating a “vibrant and competitive free market that presently exists for the Internet and other interactive computer services, unfettered by . . . State regulation.”17147 U.S.C. § 230(a)–(b); see Perfect 10, 448 F.3d at 1118–19. Thus, to interpret section 230(e)(2) to include diverse state intellectual property laws, particularly those regarding the right of publicity, would mean that entities otherwise entitled to CDA immunity would be forced to endure litigation costs for extremely unpredictable state laws, defeating the purpose and policy goals of section 230.

C. Arguments that Subsection (E)(2) Should Be Interpreted to Include State Intellectual Property Laws

In contrast, the Third Circuit recently addressed whether state right of publicity laws should be included in the intellectual property exemption in Karen Hepp’s appeal and concluded that “a state law can be a law pertaining to intellectual property, too.”172Hepp v. Facebook Inc., 14 F.4th 204, 209–12 (3d Cir. 2021) (citations omitted). Despite the Ninth Circuit’s holding in Perfect 10, the Third Circuit reasoned that the plain language of section 230(e)(2) is clear. If Congress had actually intended for the intellectual property exemption to be limited to federal law, “it knew how to make that clear, but chose not to.”173Id. at 210 (citations omitted) (quoting Atl. Recording Corp. v. Project Playlist, Inc., 603 F. Supp. 2d 690, 703 (S.D.N.Y. 2009)).

The Third Circuit’s holding builds off Universal Communication Systems, Inc. v. Lycos, Inc., the first case to address whether section 230 precludes intellectual property laws. First Circuit case was decided shortly before Perfect 10 in which there were alleged violations of federal law as well as trade name dilution in violation of Florida law.174Universal Commc’n Sys., Inc. v. Lycos, Inc., 478 F.3d 413, 417 (1st Cir. 2007). In Universal Communication, the First Circuit held “[c]laims based on intellectual property laws are not subject to section 230 immunity,” so it addressed the dilution claim separately.175Id. at 422–23 (citing 47 U.S.C. § 230(e)(2)).

The Third Circuit also considered district court cases that interpreted section 230(e)(2), including Atlantic Recording Corp. v. Project Playlist, Inc.176Hepp, 14 F.4th at 204, 209–10 (3d Cir. 2021); see Atl. Recording Corp. v. Project Playlist, Inc., 603 F. Supp. 2d 690, 704 (S.D.N.Y. 2009) (“Section 230(c)(1) does not provide immunity for either federal or state intellectual property claims.”). In Atlantic Recording, record companies asserted copyright claims under both state and federal law, and the court reasoned that because Congress specified whether local, state, or federal law applied four times in subsection (e), Congress did not intend to limit the intellectual property exemption to only federal law.177Atl. Recording, 603 F. Supp. 2d at 703. Within section 230(e), the statutory language specifies whether local, state, or federal law applies in four places: once to federal criminal law in subsection (e)(1), twice in the general state law provision in subsection (e)(3), and again in the communications law context in subsection (e)(4). Therefore, Congress expressly chose not to limit subsection (e)(2) only to federal intellectual property law. Therefore, in Hepp, the Third Circuit found that because Congress knew how to cabin the interpretation about state law and did so explicitly, “the structure does not change the natural meaning.”178Hepp, 14 F.4th at 211. Atlantic Recording did not involve any DMCA safe harbors, but other such cases that involve allegedly infringing third-party user content could consider the interaction between the CDA intellectual property exception and the DMCA safe harbors to determine whether interactive service providers might still be immune from liability.179See Atl. Recording, 603 F. Supp. 2d at 694; see also supra Section I.E.

While it is true that section 230 generally created a pro-free-market policy, the statute’s policy goals do not necessarily swallow state intellectual property rights because state property rights can also facilitate market exchange. The Third Circuit noted that because the natural reading of section 230(e)(2) would include state law, “policy considerations cannot displace the text.”180Hepp, 14 F.4th at 211. Even so, the Third Circuit stated that policy could cut the other way even outside section 230’s text: “if likeness interests are disregarded on the internet, the incentives to build an excellent commercial reputation for endorsements may diminish.”181Id.; cf. Zacchini v. Scripps-Howard Broad. Co., 433 U.S. 562, 573 (1977) (explaining the economic theory underlying the right of publicity).

The Third Circuit also considered whether Hepp’s claims actually arose from a law pertaining to intellectual property, and concluded that they do.182Hepp, 14 F.4th at 212–14. Black’s Law Dictionary defines “intellectual property” to include publicity rights, and both legal and lay dictionaries treat “intellectual property” as a compound term.183Id. at 212–13, apps. a–b; see Intellectual Property, Black’s Law Dictionary (11th ed. 2019). The court also applied a test from another legal dictionary, Bouvier’s, which the Pennsylvania right of publicity statute satisfies.184Hepp, 14 F.4th at 213, app. a. Overall, there is substantial evidence to support the conclusion that at least Pennsylvania’s statutory right of publicity falls within the definition of intellectual property.185Id.; The Wolters Kluwer Bouvier Law Dictionary proposes a test that the Pennsylvania statute satisfies because it grants individuals monopolies in their likenesses. See id. Further, the statute provides for “property-like relief, including the ability to obtain damages and injunctions against trespassers.” Id. at 213; see 42 Pa. Cons. Stat. § 8316(a) (2022). Moreover, the Third Circuit considered the only Supreme Court case to address the right of publicity, Zacchini v. Scripps-Howard Broadcasting Co., which analogized the right of publicity to patent and copyright law because the right of publicity focuses “on the right of the individual to reap the reward of his endeavors and [has] little to do with protecting feelings or reputation.”186Id. at 213; Zacchini, 433 U.S. at 573. As analyzed above, the right of publicity and trademarks are relatively analogous for confusion-based relationships. The Florida Supreme Court articulated the harm caused by a right of publicity violation by “associat[ing] the individual’s name or . . . personality with something else.”187Tyne v. Time Warner Ent. Co., 901 So. 2d 802, 806 (Fla. 2005) (citations omitted). Thus, the legal definition including trademark also supports the conclusion that the right of publicity is intellectual property.

However, Judge Cowen dissented in Hepp and stated that he “believe[s] that the ‘intellectual property’ exception or exclusion to immunity under § 230(e)(2) . . . is limited to federal intellectual property laws (i.e., federal patent, copyright, and trademark laws) and—at most—state laws only where they are co-extensive with such federal laws.”188Hepp, 14 F.4th at 216 (Cowen, J., dissenting). Judge Cowen argued that despite the fact that the majority implied there was an existing circuit split between the First and Ninth Circuits due to Universal Communication and Perfect 10, Hepp actually created the circuit split because in Universal Communication neither party actually raised the issue of whether state law counts as intellectual property under section 230 and the First Circuit seemingly assumed it did.189Id. at 217; Perfect 10, Inc. v. CCBill LLC, 488 F.3d 1102, 1119 n.5 (9th Cir. 2007). Ultimately, Judge Cowen supported the Ninth Circuit’s approach for the reasons analyzed in Section II.B and stated that “the more expansive interpretation would gut the immunity system established by Congress and undermine the policies and findings that Congress chose to codify in the statute itself.”190Hepp, 14 F.4th at 220 (Cowen, J., dissenting). Furthermore, on October 21, 2021, Facebook requested that the Third Circuit re-hear the Hepp appeal en banc, arguing that the “ ‘majority’s decision misread the intellectual property exception to the immunity established by section 230 of the Communications Decency Act (CDA) creating a conflict with’ the Ninth Circuit, and ‘ignores a key textual feature and downplays the contextual and structural features of the statute.’ ”191Jana S. Farmer, Gillian A. Fisher, Daniel J. Taylor & Leia Leitner, Third Circuit Takes an Anti-Platform View in Interpreting the Communications Decency Act, Creating a Circuit Split, XII Nat’l L. Rev. 302 (Oct. 29, 2021), https://www.natlawreview.com/article/third-circuit-takes-anti-platform-view-interpreting-communications-decency-act [https://perma.cc/z05z-qdht].

D. Arguments for Section 230 Reform and Proposed Changes

The above arguments and analyses of section 230 of the CDA are applicable in its current state, but section 230 as a whole has recently come under fire from both sides of the political aisle. There have been calls to amend or even repeal the statute.192Anand et al., supra note 18. Many on the left have criticized section 230 because they believe it has “enabled tech platforms to host harmful content with impunity,” while many on the right argue that it has allowed tech platforms to disproportionately suppress conservative speech and perspectives.193Id. The law arguably allows bad actors to hide behind the law’s liability shield and prevents harmed users, such as Karen Hepp, from holding internet platforms accountable. In the 116th congressional session, twenty-six bills were introduced that would have amended the scope of section 230 immunity, and the bills had an extremely wide range of proposed changes, such as reducing the scope of immunity in certain types of cases, placing conditions on immunity, or repealing the statute entirely.194Brannon & Holmes, Cong. Rsch. Serv., supra note 93, at 30. Currently, there are fourteen bills that have been introduced for the 117th congressional session related to section 230, but none of the proposals are related to the judicial interpretation or scope of the intellectual property exception to immunity within subsection (e)(2).195Anand et al., supra note 18.

There is also a question of executive authority in whether the Federal Communications Commission (“FCC”) has regulatory authority to implement section 230. Congress passed the CDA as part of the Telecommunications Act of 1996, which in turn amended the Communications Act of 1934, a statute administered by the FCC.196Brannon & Holmes, Cong. Rsch. Serv., supra note 93, at 36. The National Telecommunications and Information Administration (“NTIA”) filed a petition in 2020 that provides the FCC with an opportunity to consider its rulemaking authority.197Id. at 37. To clarify the FCC’s role in administering section 230, Congress could grant an express delegation or disavowal of authority.198Id. at 42. A delegation would give the FCC a statutory basis for promulgating regulations while a disavowal would prohibit the FCC from attempting to regulate under section 230.199Id.

Legislative action on section 230 in any shape or form could have significant and unintended consequences. Since section 230 was passed in 1996, it has been considered to be the “cornerstone of online expression” and has been referred to as the “[twenty-six] words that created the internet” and the internet’s “Magna Carta.”200Anand et al., supra note 18. The internet has grown exponentially and has influenced daily public life considerably since the statute was enacted in 1996, so a fundamental change to section 230 could change the internet as we know it, and even a small change to section 230 could have a substantial ripple effect. For example, social media operators could potentially adjust their content moderation practices to comply with reforms, ranging from aggressively screening content to not moderating any content, including content that may be considered objectionable or obscene to most users. On the other hand, if section 230 were to remove immunity for certain types of content, it does not necessarily mean that providers or users will actually be liable for such content; it simply means that section 230 would not bar liability. Thus, providers could continue to host potentially obscene or objectionable content if they believe the benefits of hosting such content would outweigh potential litigation costs, particularly if lawsuits are unlikely or providers believe they have a strong likelihood of prevailing in a suit.201Brannon & Holmes, Cong. Rsch. Serv., supra note 93, at 31. This could be a move to bring the reality of section 230 closer to its original congressional intention of creating a free-market system.

Overall, despite the heated debate over section 230, there have not been any proposed changes that have been close to being implemented. The extremely wide range of proposed changes also means that their implications on section 230 generally, as well as the right of publicity specifically, are ironically very unpredictable. Therefore, the circuit split on the interpretation of subsection (e)(2) and whether state right of publicity claims should be barred will continue to be a noteworthy issue until Congress acts, whether through amendments, repealing the statute entirely, or more directly providing guidance on the relatively narrow subject of the right of publicity.202Since this Note was first drafted in 2021, the Supreme Court has considered but not yet decided two cases related to CDA section 230, Gonzalez v. Google LLC, 143 S. Ct. 80 (2022), and Twitter, Inc. v. Taamneh, 143 S. Ct. 81 (2022). These cases do not bring up the intellectual property subsection at issue in this Note but could potentially overhaul section 230 entirely.

CONCLUSION

The explicit statutory language of section 230 of the CDA supports the Third Circuit’s interpretation of subsection (e)(2), the intellectual property exception to immunity. Within subsection (e), Congress specified whether federal, state, or local law applied in four instances, so we must defer to the express language and assume that Congress chose not to limit subsection (e)(2) to only federal intellectual property laws. The general consensus among the legal community is that the right of publicity falls under the umbrella of intellectual property, so the literal interpretation of section 230 should not bar right of publicity claims brought under state statutes. Unless or until Congress clarifies what should be included within this exception to the broad liability shield protecting interactive service providers or takes some other action, the Third Circuit’s interpretation will likely be upheld.

Overall, however, it would make the most sense to interpret section 230(e)(2) in the way that aligns most closely with the legislative intent and history of the statute as a whole to avoid fundamentally crippling the statute by exposing interactive service providers to liability from extremely varied state statutes relating to the right of publicity. The Third Circuit’s interpretation could very likely create an exception that swallows the whole statute. Assuming the right of publicity constitutes intellectual property, under the Ninth Circuit’s interpretation to protect the integrity of section 230, right of publicity claims should be barred so long as providers do not participate in the creation of the allegedly infringing content. This would maintain uniformity and predictability throughout the court system. The Ninth Circuit’s interpretation is thus most beneficial to interactive service providers such as Facebook and most frustrating to individuals who feel they have been harmed by allegedly infringing content on internet platforms, such as Karen Hepp. Such individuals could potentially attempt to redress this harm through other types of claims, such as copyright or trademark. These two examples would undoubtedly constitute intellectual property laws, and claims under federal law would be doubly effective against section 230’s broad shield, but alleged infringements of the right of publicity do not always meet the required elements for such claims. Furthermore, providers could be immune from liability through other statutes, such as the safe harbors from the DMCA, so individuals may be left without a remedy. There is no clear balance or solution to these concerns in the current form of section 230 of the CDA.

The severe implications of the Third Circuit’s seemingly “correct” interpretation could strongly incentivize Congress to clarify either the scope of subsection (e)(2) or separately protect the right of publicity in order to avoid the purpose and intent behind section 230 of the CDA. Many scholars have advocated for a federal statute or a uniform act to protect the right of publicity. Federal codification of the right of publicity would create uniform and equal protections to individuals across the entire country, as opposed to the current state statutes that have created extremely varied interests in the right. A federal statute or uniform act would also drastically reduce the economic costs created by uncertainty in litigation. Finally, unauthorized uses of individuals’ “personas,” including name and likeness, are becoming increasingly more common due to improvements in technology and the expansion of social media. This type of action would balance the interests of wanting to protect both the public’s right of publicity and interactive service providers from liability for user-generated content. Ultimately, due to the complex and time-intensive nature of Congressional processes, any proposed change, if any, to section 230 may not be established for some time, so there is likely going to be substantial consequences and potentially a wave of lawsuits for alleged violations of the right of publicity in the wake of the Third Circuit’s holding in Hepp.

 

96 S. Cal. L. Rev. 449

Download

J.D., University of Southern California Gould School of Law, 2023. B.A., University of California, Los Angeles, 2020.