Articles Archives – Page 11 of 57 – Southern California Law Review

Affirmative Acting: The Role of Law in Casting More Actors With Disabilities (A Note in Five Acts)

May 2023May 2023Kira Patterson*

SETTING THE STAGE: INTRODUCTION

“Always find your light.” This is a common piece of advice given to theater artists, encouraging them to make sure they can be seen on stage.¹ But who gets the chance to grace the stage in the first place? Our society has recently begun to actively ask new questions about equity and visibility. In the context of theater, we have largely focused not on who is on the stage, but on how theater productions can be enjoyed equitably. To answer these questions, we have turned to effectuating and enforcing the standards set forth in the Americans with Disabilities Act (“ADA”).²

Over the thirty years since the ADA was passed, and even more concertedly following the Act’s 2008 amendments,³ theaters have begun to reserve sections of the house⁴ for patrons with wheelchairs and to renovate facilities to create ADA-compliant restrooms for audience members.⁵ However, while the ADA has prompted great strides in improving theatergoers’ ability to access productions, figuring out how to apply the ADA to the people on the stage is another question altogether.⁶

While audience accessibility is a tremendous step forward in ensuring that enjoyment of theater productions does not exclude people who cannot easily access and navigate unwelcoming spaces, the time has come to turn the spotlight back towards the stage. We need to turn our attention to creating and nurturing structures allowing for equal access for theater performers—specifically, for the purposes of this Note, those with disabilities. Pushes in the theater world to pinpoint and remedy gender and racial inequities have become much more prevalent in recent years,⁷ and rightfully so, but we must not leave those with disabilities out of the discussion. As it is, “disability is too often an afterthought, if it is thought of at all.”⁸

Indeed, Actors’ Equity Association (“AEA”), the union for actors and other theater makers,⁹ even reported that barely 1% of contracts they issued from 2016 to 2019 went to artists who self-reported living with a disability.¹⁰ AEA also estimated that about a quarter of Americans live with at least one disability.¹¹ If you compare these statistics (about 25% of Americans have disabilities, but only about 1% of theater jobs offered over the course of three years went to artists who reported living with disabilities), the problem should begin to crystallize: Why are we not seeing representation of people with disabilities on stage at the same rates as in society?

While data on disability representation on stage is scarce, we can look to data collected in theater’s more closely studied sister entertainment industries of television and film to get a sense of what levels of representation in the theater might look like. For instance, in the sphere of network television in 2018, only 22% of characters with disabilities were actually portrayed by an actor with the same disability; for streaming services, this number decreased to 20%.¹² The unfortunate result gleaned from this study and studies like it is that the overwhelming majority of characters with disabilities—at least in the context of film and television—continue to be portrayed by actors without disabilities.¹³ The first time an Emmy was awarded to a show starring people with disabilities was not until 2016,¹⁴ and the number of actors with disabilities who have ever won an Oscar can be counted on one hand.¹⁵ These statistics truly pull back the curtain on an entertainment industry that does not tend to value actors with disabilities.

There is reason to believe the statistics are just as grim in the theater. For instance, it was not until 2019 that an actor in a wheelchair (Ali Stroker) first won a Tony award (for her tremendous performance in a revival of the classic Broadway hit Oklahoma!).¹⁶ This lack of representation begs the question of why, as society finally begins to converse more openly about equity, “disability” is still so often excluded from the discussion.¹⁷

This Note will attempt to answer that question—and explore what role law could play in arriving at a solution—through a variety of lenses, including the ADA and employment discrimination law. It will set the proverbial stage by laying out the history of disability discrimination in theater and entertainment, after which it will discuss relevant federal and state sources of disability and employment law. The Note will then make the case—by looking at potential legal remedies¹⁸—that in the subjective world of theater, the way to increase representation of actors with disabilities on stage is not a simple legal fix; instead, it will likely take a combination of changes—attitudinal, legal, and otherwise—working in tandem in the theater industry to get more actors with disabilities on stage. And while making these moves in the direction of inclusion and equity on stages across the country would certainly advantage actors with disabilities, it would also benefit society at large: theater that reflects our tapestried reality “is simply better, richer, [and] more rewarding when it is by, for, and about all of us.”¹⁹

ACT I. HISTORY OF DISABILITY IN PERFORMANCE

All too often, actors with disabilities are excluded from the audition room; much—if not most—of the time, actors with disabilities do not get invited to audition at all, regardless of the disability status of the role in question.²⁰ And if the actor has made it into the audition room? That is only the first part of the journey. Next comes the actual casting of the role, where no matter how many talented actors have made it into the room to audition for a single part, only one person leaves with the job. Even once actors with disabilities make it through the door to get seen by the director, the odds are against them in terms of actually landing a role.

Recent Broadway shows that have main characters who have disabilities provide a good look into the regularity with which actors with disabilities get passed over for roles, while actors without disabilities gain more access to those roles. Broadway productions of relatively well-known shows that fit this description are not hard to find. The Curious Incident of the Dog in the Night-Time tells the story of a young man on the autism spectrum, and yet the 2015 Broadway production nonetheless cast an actor “without autism or any other disabilities” for the role.²¹ Wicked, over its yearslong and wildly popular Broadway run, never once filled the role of Nessarose, who uses a wheelchair, with an actress with physical disabilities.²² Likewise, all of the lead roles in recent productions of The Miracle Worker and Richard III, both of which focus on main characters with physical disabilities, were portrayed by actors without physical disabilities.²³

This practice of casting actors without disabilities in the roles of characters with disabilities has come to be known, in some circles, as “disability drag.”²⁴ In fact, a whole microcosm of scholarship has developed around this idea of disability drag, which also takes to task the various tropes that seem to be intertwined in the writing of most, if not all, characters with disabilities currently on stage and screen.²⁵ This area of academic investigation and rumination asks us to reframe the way we think about characters and people with disabilities: “What if their disability weren’t the thing to overcome but merely one element of one’s identity?”²⁶ Nonetheless, on the whole, society appears to turn away from asking itself such introspective questions, especially when the alternative involves making money by casting big-name actors.

None of this means that the world of creating theater is not making some strides on its own. For instance, Deaf West Theatre’s 2015 Broadway production of the musical Spring Awakening was produced and performed in both English and American Sign Language,²⁷ with a cast comprised of “25 deaf, hard of hearing and hearing actors and musicians.”²⁸ The show was met with great success and earned multiple Tony Award nominations, including one for the highly regarded Best Revival of a Musical,²⁹ even though it was a production of a type that Broadway had never seen before.

There are also smaller theater companies popping up that have been created with the explicit goal of promoting the work of artists with disabilities. For instance, the mission of the Phamaly Theatre Company in Denver, Colorado, is “to be a creative home for theatre artists with disabilities” as well as to “model a disability-affirmative theatrical process.”³⁰ Alie B. Gorrie, an actress with low vision, describes her reaction to attending a production “filled with disabled artists, singing, dancing, and actively defying disability tropes” at the Phamaly Theatre Company as the first instance where she felt like she truly belonged in the theater.³¹ The experience, however, left Gorrie with a lingering question: Why had it taken two decades of working in the theater industry for her to feel this sense of belonging?³² Which begs the broader question: How many people who dream of working in the theater industry have already been discouraged and turned away by the lack of access and opportunities?

Despite these steps forwards, it is apparent that sidelining actors who have disabilities deprives society of a wealth of talent. We have seen how powerful performances by actors with disabilities can be and how rewarding it can be to see them in the spotlight, as evidenced in a number of recent television shows. Consider RJ Mitte, an actor with cerebral palsy playing a character with the same disability on the hit AMC show Breaking Bad.³³ More recently, think of Lily D. Moore, an actress born with Down syndrome playing a fan-favorite character with the same diagnosis in Mindy Kaling’s Netflix series Never Have I Ever.³⁴ Therefore, the question we must now be asking is what legal solutions can be utilized to ensure that Mitte and Moore are not the “token” actors with disabilities, but instead just actors. And are those legal solutions alone enough?

ACT II. LEGAL BACKGROUND

Scene 1: The Americans with Disabilities Act

One lens through which to approach the problem returns our attention to the ADA. Signed into law in 1990,³⁵ and amended in 2008 to provide broader protections for people with disabilities,³⁶ the ADA provides for protection of individuals with “a physical or mental impairment that substantially limits one or more major life activities.”³⁷ The ADA has seen much success over the years: it has empowered people with disabilities to become their own best advocates³⁸ and has modernized our built environment to promote physical accessibility.³⁹ Specifically relevant here, though, is Title I of the ADA, which concerns employment discrimination.⁴⁰

Title I of the ADA prohibits employers (including theaters)⁴¹ from “discriminat[ing] against a qualified individual on the basis of disability in regard to job application procedures, the hiring, advancement, or discharge of employees, employee compensation, job training, and other terms, conditions, and privileges of employment.”⁴² Thus, discrimination against people with disabilities in the workplace is already prohibited by law in many circumstances, according to the ADA. There are, however, two distinct points that illustrate how far we have left to go and how far short the ADA has fallen in getting us there.

First, although the ADA requires an employer to make “reasonable accommodations” for employees with disabilities,⁴³ Congress did not give a clear-cut definition of what exactly counts as a “reasonable accommodation.”⁴⁴ Instead, Congress provided examples of accommodations that could be implemented to enable “qualified individual[s]” with disabilities to perform the “essential functions” of their jobs.⁴⁵ There are no statutory limitations—financial, quantitative, or otherwise—on what constitutes a “reasonable accommodation,” other than that an accommodation that would cause an employer’s business “undue hardship” is not “reasonable.”⁴⁶ Similarly, Congress did not provide further instruction on how to determine what constitutes “undue hardship.”

This lack of guidance from Congress means that implementing the ADA can easily “become a checklist of what is or isn’t provided.”⁴⁷ In other words, it can become the “absolute minimum you can do to avoid looking like a jerk”⁴⁸ or exposing yourself to liability. Sure, you may have a wheelchair ramp in place, but does that really work to make the actors in need of the accommodation feel welcome and unburdened in their artistic journey?

This Note argues that a wheelchair ramp here and there is not enough. Instead, for actors to truly feel welcomed into the space and able to practice their craft uninhibited, the theater must ask itself questions such as, “Are we putting an extra burden on our artists with disabilities by requiring them to perform while simultaneously navigating a world that is not built for them?” and “How are we ensuring that we are hiring actors with disabilities in the first place?”

Second, while enforcing the ADA may help to ease the strain disproportionately placed on the small group of actors with disabilities who have already made their way into the rehearsal hall, what about those who have yet to be cast? Able-bodied actors are routinely cast in roles portraying people with disabilities,⁴⁹ which diminishes the number of roles available for actors with those disabilities. Further, it often “simply never occur[s]” to casting directors “to cast, or even consider, actors with disabilities in roles that don’t specify whether a character is disabled or not.”⁵⁰

Even though we are taking steps towards creating a more inclusive culture, it does appear as though we are nonetheless collectively excluding people with disabilities from that equity-driven vision of our society—even with the assistance of the ADA. So, if the ADA as it currently operates does not seem fit to truly improve diversity onstage, are there other potential legal routes?

Scene 2: Title VII of the Civil Rights Act of 1964

When it comes to the world of preventing discrimination in employment, Title VII of the Civil Rights Act of 1964⁵¹ is undoubtedly the star of the show. Since the ADA may not, on its own, provide a way to ensure that more actors with disabilities get onstage, it is worth exploring another relevant legal avenue: employment discrimination law governed by Title VII. Congress formulated this broad new civil rights bill in 1963 and took final steps towards securing the bill’s passage in 1964.⁵² Title VII notably included language banning employment discrimination because of a person’s “race, color, religion, sex, or national origin.”⁵³ While Title VII does not apply to disability discrimination, it provides some guidance as to how the ADA might be amended to address the issues discussed here.

The basic structure of a case alleging individual disparate treatment (also known as intentional discrimination) in one of the above categories has been crafted over time through case law by the Supreme Court. The so-called “burden-shifting” structure that has been created is set forth in the pivotal case of McDonnell Douglas Corp. v. Green.⁵⁴ First, a plaintiff who alleges disparate treatment under section 703(a)(1) of Title VII “because of such individual’s race, color, religion, sex, or national origin”⁵⁵ must prove their prima facie case that (1) they do indeed fall into one of those categories, (2) they applied for a job and were qualified, and (3) they were rejected by the employer.⁵⁶ Next, the employer has the chance to bring to light any “legitimate, nondiscriminatory reason” for having rejected the employee.⁵⁷ If the employer can do so, the burden shifts back to the plaintiff, who has an opportunity to prove that the “legitimate, nondiscriminatory reason” given by the employer was “pretext” for what in truth amounts to discriminatory animus.⁵⁸

Integral to this Note, however, is the language highlighted in section 703(e) of Title VII that an employer may protect itself from liability by presenting a particular affirmative defense.⁵⁹ The essence of this defense is that the employer asserts that it rightfully, and therefore legally, discriminated against this job applicant. The employer can do this by showing that it discriminated because of “religion, sex, or national origin”⁶⁰ if it can also show that “religion, sex, or national origin is a bona fide occupational qualification reasonably necessary to the normal operation of that particular business.”⁶¹

This exception to the general rule, which is known as a “bona fide occupational qualification” (typically referred to as a “BFOQ”),⁶² can sometimes be used by employers to legally justify certain discrimination in hiring practices if that discrimination is based on religion, sex, or national origin. For example, in Dothard v. Rawlinson, an all-male prison asserted that it would be unsafe for women to become guards in their prisons.⁶³ Female job applicants hoping to become guards then sued the prison, claiming that they were not hired because of their sex.⁶⁴ The Supreme Court took the side of the prison, holding that while the applicants’ sex was the reason they were not hired, this discrimination was legal due to the BFOQ exception.⁶⁵ In other words, the prison was allowed to reject female applicants because of their sex due to the fact that having male guards was “reasonably necessary to the normal operation of that particular business.”⁶⁶

Conversely, in UAW v. Johnson Controls, Inc., the Supreme Court refused to grant an employer the use of a BFOQ.⁶⁷ Johnson Controls stated that it would not allow women to work in certain jobs at its manufacturing plant that involved lead exposure, citing an interest in preserving the women’s fertility.⁶⁸ In essence, Johnson Controls was asserting that being a man was a BFOQ that was required in order to get the job.⁶⁹ Here, the Supreme Court interpreted the BFOQ exception narrowly by ruling that the amorphous danger of harm to female employees’ fertility is not an appropriate use of the exception and that female employees who were qualified for the job could not be turned away simply on the basis of their sex.⁷⁰

As seen in Johnson Controls above, the BFOQ is not a free pass to discriminate against job applicants however an employer sees fit; Congress created the BFOQ exception to be used narrowly and “the courts have construed it as such.”⁷¹ It is not unreasonable, however, to imagine a scenario in which this affirmative defense could actually be used to benefit a particular group of job applicants. Consider a scenario in which an employer wants to have only Senegalese chefs work at a Senegalese restaurant, with the stated goal of “authenticity.” Here, the employer could use a national origin BFOQ to justify this hiring practice, with the end result being that a minority group (Senegalese chefs) gains greater access to job opportunities they otherwise may not have had. While perhaps counterintuitive, this Note will propose the use of a BFOQ not simply as a way to shield an employer from liability, but also as a way to encourage diversity in the hiring process.

Scene 3: Threshold Question—Employee or Independent Contractor?

It must be noted going forward that applying the ADA and Title VII to workers hinges on the workers’ classification as “employees,” as opposed to “independent contractors,” because the ADA and Title VII do not cover independent contractors.⁷² So what is the difference? The ADA and Title VII both provide the following definition of “employee”: “[A]n individual employed by an employer.”⁷³ Since that definition is not particularly elucidating, courts have often looked to the common law of agency for a less circular definition.⁷⁴ Among other factors, the Restatement (Second) of Agency defines an employee as the “servant” of an employer (the “master”).⁷⁵ This relationship is said to be formed when the master gains control over the servant’s performance of a service, and, in particular, when the master gains the right to control the “physical conduct” of the servant.⁷⁶ Conversely, then, an independent contractor is a worker whose physical conduct and general performance are not under the complete control of the master.⁷⁷

Many theaters officially classify the actors they hire as independent contractors, often primarily in order to take advantage of related tax benefits and to circumvent paying minimum wages, overtime, and workers’ compensation.⁷⁸ The argument theaters provide for this practice is that actors are temporary workers, typically only hired to perform in one show at a time, and that therefore being an actor is more akin to being a part of the “gig economy”⁷⁹ than being a part of a typical workplace. Theaters in this camp tend to paint a picture of their actors not as their so-called “servants” whose physical conduct they control, but instead as transient workers whose job is simply to put on a performance.

In reality, however, there is so much more to an actor’s responsibilities and interactions with a director. While actors may have moments of free decision-making throughout the process of preparing (“blocking”) a play, almost everything comes down to what the artistic director envisions. This is really an employee-employer relationship where the employer has full control over not only when and where rehearsals are held, but ultimately full control concerning when, where, and how an actor portrays their part.

Though employee classification is crucial for actors—as well as employees writ large—to achieve better legal protections, a deeper exploration of the distinction between employees and independent contractors and the implications of this divide for employment equity, particularly in the context of theater, is beyond the scope of this Note.⁸⁰ Thus, the remainder of this Note will assume for the sake of argument that actors are classified as employees, not as independent contractors. This classification allows for their protection by the ADA and Title VII.

ACT III. LEGAL REMEDY NO. 1: CREATING A NEW BFOQ

Scene 1: Creating a Race or Color BFOQ

Notably missing from the list of categories that can be used to assert a BFOQ defense⁸¹ are race, color, and disability. Over the past few decades, the bulk of relevant scholarship has focused on reasons Congress specifically did not include race or color as possible BFOQs.⁸² Relatedly, scholars have started to ask whether Congress erred in this omission, and some even go so far as to champion adding a race or color BFOQ.⁸³

More specifically, this question about a race or color BFOQ has recently been explored in the context of entertainment.⁸⁴ Do historically marginalized actors lack opportunities as a “result of illegal discrimination by the theater industry,”⁸⁵ or is it instead a product of artistic freedom and sound business decisions? Should the issue be relegated to the realm of First Amendment jurisprudence?⁸⁶

Legal scholars have often approached this question by looking at language used by the Equal Employment Opportunity Commission (“EEOC”).⁸⁷ The EEOC’s regulations⁸⁸ mention that a gender BFOQ could theoretically exist for hiring actors if deemed necessary for a play’s authenticity: “Where it is necessary for the purpose of authenticity or genuineness, the Commission will consider sex to be a bona fide occupational qualification, e.g., an actor or actress.”⁸⁹ The EEOC has thus explicitly “recognized that the entertainment industry is one place where discrimination might be necessary.”⁹⁰

The fact that use of a BFOQ has been considered by the EEOC as potentially useful (and lawful) in an entertainment context gives credence to the idea that it is permissible to legally discriminate, through the use of a BFOQ, in order to preserve a play’s primary functions of storytelling and authenticity.⁹¹ Therefore, “it seems reasonable to assume that where the characters are race-specific, race is a job requirement, and hence, should be a BFOQ exception.”⁹²

Scene 2: Creating a Disability BFOQ

So, could a disability BFOQ similarly be added to the ADA? The idea is not without precedent, at least in the realm of some states’ local laws. For instance, the Administrative Rules of Montana state that an employer may use a BFOQ “where the reasonable demands of a position require a distinction based on . . . physical or mental disability.”⁹³

But the question remains: Is the addition of a disability BFOQ really enough to make a difference? Or would it just perpetuate the status quo of allowing employers/artistic directors to keep employees/actors with disabilities off the stage? According to the University of Southern California Annenberg Inclusion Initiative, only 2.7% of characters with speaking roles in a survey of 900 popular movies from 2007 to 2016 were characters portrayed with a disability.⁹⁴ Assuming the trend holds true across the sister industries of stage and screen, these statistics show that a disability BFOQ probably could not effectuate all that much change. If only around 2–3% of characters are written to have disabilities, even if a majority of directors cast those roles with actors who have disabilities, we would have at most a 3% increase in the number of actors with disabilities getting cast. And there is no guarantee that any directors would even opt to utilize the disability BFOQ. Thus, the most progress a disability BFOQ could make would likely be marginal at best.

Furthermore, creating a disability BFOQ opens the door to possible misuse and abuse by employers. Indeed, use of a BFOQ, though it can be
co-opted for the benefit of a group of employees, is usually seen as an employer-friendly tactic. For example, an employer who does not want to hire actors with disabilities could use the BFOQ as a shield, asserting that such an actor with a disability could not serve “essential functions”⁹⁵ (such as deft movement across the stage) required of the job.

Scholarship at the forefront of this conversation seems to overwhelmingly come to the same conclusion: “[T]he fear that employers could misuse a generally applicable . . . BFOQ to shield invidious . . . discrimination is too great to warrant the enactment of such a provision.”⁹⁶ Given these potential setbacks, it becomes necessary to look at what other remedial legal options remain.

ACT IV. LEGAL REMEDY NO. 2: AFFIRMATIVE ACTION

Scene 1: Background

The concept of affirmative action, created during the civil rights movement in the United States, derives from a “paradox,” namely that “[o]nce we amended the Constitution and passed laws to protect people of color from being treated differently in ways that were harmful to them, the government had trouble enacting programs that treat people of color differently in ways that might be beneficial.”⁹⁷ We face a similar problem with regard to disabilities, in that in employment discrimination law’s noble effort to level the playing field, we must fight to create ways to treat people with disabilities that “might be beneficial”⁹⁸ as well.

From a statistical standpoint, affirmative action for race actually resulted in some of its intended effect; the years between 1974 and 1980 saw a 20% increase in the rate of minority employment in businesses relying on affirmative action (as compared to an increase of only 12% in companies without affirmative action plans in place).⁹⁹ Furthermore, there is still room for the affirmative action model to change over time, as “[t]here is no Brown v. Board of Education . . . for affirmative action, no well-established precedent.”¹⁰⁰ Thus, the door is left ajar for a new movement in which we use affirmative action tactics to make sure that more actors with disabilities are not only getting into the audition room, but also getting cast.

While decades of proof show that affirmative action has led to success, specifically in the context of school desegregation,¹⁰¹ the concept also comes with quite a bit of baggage.¹⁰² Scholars and laypeople alike have been arguing for years over whether “affirmative action for racial minorities disadvantages white people by virtue of their race.”¹⁰³ It is likely that this same argument would surface regarding whether affirmative action in the context of casting actors with disabilities disadvantages able-bodied actors. To this point, however, although there may be winners and losers in affirmative action, it has been determined that the practice is occasionally justified nevertheless.¹⁰⁴

In United Steelworkers v. Weber, the Supreme Court created precedent that some affirmative action regimes are, in fact, justified, and it laid out a test dictating when these regimes are constitutional.¹⁰⁵ While the Court’s opinion is perhaps not particularly clear in terms of where to draw that line,¹⁰⁶ it does provide us with a set of loose guidelines. In order for a plan to fall on the permissible side of that line, it must (1) be “designed to break down old patterns of . . . segregation and hierarchy,” (2) “not unnecessarily trammel the interests of” other employees or applicants, and (3) be a “temporary measure.”¹⁰⁷

These guidelines, specifically designed to apply to affirmative action in regard to racial segregation and discrimination, could easily be adapted to apply to disability as well. One could imagine guidelines for theater companies that (1) break down existing patterns of hierarchy in terms of casting actors without disabilities; (2) do not “unnecessarily trammel” the interests of actors without disabilities, who would retain plenty of chances to be cast; and (3) only last until such time that theaters understand and realize not only that diverse casting is a noble goal, but also that it makes sound economic sense. While this raises a different question as to how these guidelines would be implemented, as discussed below, there may actually be no need to adapt these guidelines because of the differences in statutory language between Title VII and the ADA.

Scene 2: Statutory Interpretation

Challenges to affirmative action in the context of ending racial segregation sometimes stem from a disgruntled white student who feels that a school’s admission policies are a zero-sum game (and thus, feels that their rights are being “unnecessarily trammel[ed]”).¹⁰⁸ For instance, in Fisher v. University of Texas, a white woman who was denied admission to the University of Texas sued on the grounds that the school’s admissions system was unconstitutional because it took race into account.¹⁰⁹ Ultimately, Justice Anthony Kennedy authored the close opinion in favor of the University of Texas, deciding that the university’s policy of considering race as one of a number of factors in admissions “met the standard of strict scrutiny”¹¹⁰ and was thus appropriate. While the final outcome of this case comes down on the side of the affirmative action plan being implemented by the university, it also demonstrates the very live and contentious idea that there are people who tend to feel they are being injured by affirmative action schemes at large.

The Court has maintained its belief that at least some affirmative action regimes could be unconstitutional because they “unnecessarily trammel”¹¹¹ other employees’ rights, and their authority on this matter comes from citing the text of Title VII itself: it is unlawful to “discriminate . . . because of . . . race” when hiring employees.¹¹² White students have long used this argument to say that they themselves were discriminated against because of their race (as a white person) when a Black student is admitted and there is an affirmative action regime in place at that university;¹¹³ the claim is one of “reverse racism.”¹¹⁴ Similar arguments have long been made by many white plaintiffs in the employment context: there have “recently [been] a number of headlines regarding ‘anti-white racism’ and there have been a variety of civil rights lawsuits filed by white employees . . . claiming race discrimination.”¹¹⁵

The language of the ADA, on the other hand, dictates only that a covered entity may not “discriminate against a qualified individual on the basis of disability.”¹¹⁶ The ADA itself provides limited guidance on whether an employer may or may not, for instance, discriminate in favor of a qualified individual on the basis of disability.¹¹⁷ In light of this difference in statutory language, it is possible that an affirmative action plan in the context of disability under the ADA may not even need to pass muster under the three-part Weber¹¹⁸ test described above. Further analysis of this distinction in language, although beyond the scope of this Note, is required to determine if theater companies would be within their rights to implement affirmative action regimes regarding hiring actors with disabilities.

Scene 3: Application

Given the analysis above, this Note proposes that theaters could help remedy the imbalance in casting practices by beginning to use an affirmative action model to bring more inclusivity into the casting room and onto the stage. If future analysis supports the above interpretation of the statutory text, this model does not have to live up to the Weber¹¹⁹ standards. Each theater company is unique, with its own set of structures and hierarchies already in place, so the most effective way for each individual theater company to utilize an affirmative action model would likely be best judged by the company itself. The 2020s appear to have ushered in a hunger for an increase in overall diversity,¹²⁰ and it is possible that some theaters would jump at the chance to create a scheme through which they could improve the diversity on their stages—if only because it would reflect well on the theater.

Perhaps one answer is a required training for theater companies throughout the country (likely in an online format) through which they could gain a better understanding of the necessities and risks associated with creating and implementing an affirmative action plan.¹²¹ Then, each theater company could come up with a plan that best fits its specific needs and goals. Implementation of these plans would likely require the creation of an organization to oversee these plans and establish accountability, as well as conduct periodic check-ins with each theater company to assess follow-through and commitment going forward. While this suggestion would involve significant resources (time, money, and otherwise), this Note has demonstrated how crucial it is to take affirmative steps in this arena to enact true change. Investing these resources would be a necessary first step.

However, clearly the nebulous idea of “using affirmative action in casting actors with disabilities” leaves a lot of details to be desired. Who would ensure that theaters truly implemented affirmative action measures? How would relevant statistics be tracked, given that each theater and, more granularly, each show has a completely different set of needs? What kind of penalties would be imposed if theaters chose not to follow their affirmative action plans? All of this is not to say that legal remedies would not move theater in the right direction, but given these difficult questions with no immediate answers, it seems clear that this proposed legal remedy is not enough on its own either. So, what options remain?

ACT V. A LOOK AT POTENTIAL QUASI-LEGAL AND NONLEGAL REMEDIES

Scene 1: Societal Shifts—Effects of the COVID-19 Pandemic

One force that has the potential to shift the way we as a society see entertainment and theater, and therefore theater creators, is the COVID-19 pandemic.¹²² Our society’s transition to the use of Zoom and other online platforms has greatly increased theater’s accessibility in a number of ways,¹²³ perhaps most notably in terms of the internet’s ability to transcend physical barriers and allow people from all around the world to watch a performance.¹²⁴

Additionally, many virtual productions are simply more affordable¹²⁵—both for audiences who no longer need to worry about issues such as transportation to and from the theater, costly parking, and the allure of overpriced theater snacks and drinks, and for theater companies that suddenly find themselves without the need for large, elaborate sets, accessible theaters, or a whole team of spotlight operators.

This shift has the possibility to push access for actors with disabilities in the right direction and could provide the movement with enough momentum to continue to embrace inclusivity and accessibility once we (presumably) reenter a less digital world. However, Deaf¹²⁶ theater artist Elbert Joseph has his doubts: “[O]nce we go back to being in person, are people going to be willing to continue [making theater accessible]? Because there is no more excuse.”¹²⁷ And he is right: we have now seen a digital landscape in which disability has proven to be much less of a barrier in the bid for access.¹²⁸

Writer and performer Katie Hae Leo, while acknowledging the importance of the ADA as it stands, believes that the COVID-19 pandemic has reminded society of the vulnerabilities associated with being a person with disabilities.¹²⁹ She adds that, although the pandemic may have established a precedent of creating more access for artists with disabilities, it will all be for naught unless we “codify some of those changes, and make sure that they become part of, at the very least[,] best practices and at the best, law.”¹³⁰

Now that we have seen, by way of the pandemic, that many accessibility measures are in reality quite easy to implement,¹³¹ the above legal proposals of adding a disability BFOQ to the ADA and implementing an affirmative action regime for casting actors with disabilities could come into play. When utilized in tandem with the lessons we have learned from being thrust into the virtual world during the pandemic, these legal solutions could help to create a theater landscape that is both welcoming and encouraging to theater artists with disabilities.

Additionally, while creating diversity onstage is a noble goal in and of itself, theater companies do have pure economic reasons to invest in increased representation. Looking back at theater’s sister industries, film and television, that exact understanding seems to be unfolding as the early 2020s progress. While statistics, as discussed above, show dismal rates of casting actors with disabilities over the years, both film and television have begun to make great strides in their bid for inclusivity on screen. Take, for instance, the critically acclaimed 2021 film CODA, which centers on a family with deaf adults and their hearing child.¹³² The deaf characters are all played by deaf actors,¹³³ and the story puts deafness at the heart of the viewer’s experience. The film even led to the first acting Oscar nomination (and win) ever for a deaf man, Troy Kotsur.¹³⁴ Kotsur told the New York Times via a sign language interpreter that the success of this film marks a wider understanding that we should no longer “think of deaf actors from a perspective of limitations.”¹³⁵ As film and television make these moves forward, and as theaters begin to grapple with the fact that more diverse casts could lead to more money and acclaim, hopefully theaters will begin to follow in the footsteps of their sister industries.

Scene 2: Building Upon Ongoing Diversity, Equity, and Inclusion Work

Since it appears that no one solution, legal or otherwise, is sufficient to meaningfully increase opportunities for actors with disabilities on stage, it is worth looking to other work that is already being done in the arena for inspiration. The initiatives currently taking shape, in theater and beyond, are known as Diversity, Equity, and Inclusion (“DEI”) initiatives.¹³⁶ DEI work, according to the International Labour Organization, can be responsible for an increase in innovation of up to 59% and an increase in understanding and assessment of consumer demand of up to 37%.¹³⁷

This wave of DEI work in workplaces around the country and beyond focuses on the tenets of “diversity” (the ways in which people differ from one another), “equity” (fair treatment and opportunity regardless of identity), and “inclusion” (providing a variety of people with power and decision-making authority).¹³⁸ However progressive a DEI mindset in a workplace might be, though, underrepresentation “remains a very real problem.”¹³⁹ A 2020 review of workplace diversity, for example, found that around 85% of top executives in the United States are white,¹⁴⁰ with similar statistics showing that the majority of top executives do not report having a disability.¹⁴¹

Even so, companies, including theaters, are now actively considering DEI initiatives; these initiatives tend to center on anti-racism and racial equity.¹⁴² Many theater websites boast initiatives to combat racism within their internal structures.¹⁴³ In addition to actively increasing representation of people of color in the workplace, these initiatives are shining a spotlight on the destructive effects of racism on the workplace. Imagine if this push for equity in terms of race could be harnessed and used through the lens of disability as well. This would bring awareness to the trials and tribulations of actors with disabilities, as this Note has detailed, and could help to create a society in which anti-ableism becomes central to the workplace.

Scene 3: Exploring Nontraditional Casting

Another potential route to getting more actors with disabilities on stage would be to follow the dictates of “nontraditional casting.”¹⁴⁴ Under the regime of nontraditional casting, in order to expand opportunities for
oft-overlooked actors, artists are cast in roles in which certain categories (such as gender, ethnicity, disability, and race) are not “germane to the character’s or the play’s development.”¹⁴⁵ The attempts at kickstarting nontraditional casting have been widespread; multiple major theater organizations banded together in the late 1980s to create a not-for-profit organization called the Non-Traditional Casting Project (“NTCP”).¹⁴⁶

The NTCP, as a part of its advocacy work, identified a few distinct types of nontraditional casting meant to act as “jumping-off points for the imagination,”¹⁴⁷ such as “societal casting,”¹⁴⁸ “cross-cultural casting,”¹⁴⁹ and “conceptual casting.”¹⁵⁰ These various categories are meant to serve as tools for creating opportunities for actors who may otherwise be passed over.

One further category to be addressed is “blind casting,” in which “actors are cast on the basis of their talent without regard to their physical attributes [and abilities or disabilities].”¹⁵¹ While the idea of blind casting may appear innocuous on the surface, and perhaps even look like a good solution, academic scholarship points us to the conclusion that even casting that is nondiscriminatory on its face leads to the same disparities on stage after all is said and done.¹⁵²

2020/aug/11/its-dangerous-not-to-see-race-is-colour-blind-casting-all-its-cracked-up-to-be [https://
perma.cc/ZB3H-GEWR] (quoting Diep Tran, an arts journalist specializing in diversity) (“Colour-blind casting is dangerous . . . [because] [i]t negates the very real structural hindrances that block actors of colour from the same opportunities as white actors—like low pay in the theatre industry, a lack of roles that are ethnically specific that actors of colour can play, and unconscious bias on the part of white theatres and casting directors.”). Furthermore, in the past, directors have gone so far as to use the idea of blind casting to do things such as cast white actors as characters of color, using the explanation that the white actors just happened to be best for the role.¹⁵³

Because of the potential harms of blind casting, scholars urge directors to consider “conscious casting” instead, where attributes and abilities/disabilities are taken into account to the extent that they interact with the plot lines and characters and affect the meaning of a play or movie.¹⁵⁴ Utilizing conscious casting from the nontraditional casting canon may prove another useful tool in the casting toolbox. However, the distinction between casting “blindly” and “consciously” is not always straightforward and still allows for a well-meaning director to make a blunder by casting actors in a way that sets forth an unintentional message.¹⁵⁵

Even so, conscious casting can and should be used in the context of casting actors with disabilities. Conscious casting could even be combined with the affirmative action plan discussed above; this could open the door to actors with disabilities not only playing characters with disabilities, but
able-bodied characters as well. Not only would this provide more job opportunities to actors with disabilities, but it would also allow directors to make purposeful statements through their casting about how our society views, and should or should not view, people with disabilities.

Making conscious casting an industry standard would signal to artistic and casting directors alike that diversity on stage could be a meaningful enhancement to their repertoire and the messages conveyed, and, as such, should be taken into account. It is true that some baggage might come along with this approach: it could require extra auditions to be held, extra outreach into various underrepresented communities, and extra thought put into how casting each actor affects how the play comes across to the audience.¹⁵⁶ Given the dramatic loss of talent caused by excluding actors with disabilities, however, this Note argues that the potential for positive outcomes far outweighs the baggage.

CURTAIN CALL: CONCLUSION

At the end of the day, representation on stage can (and should) inspire new generations of both activists and actors, but it appears as though there is no single legal solution that will be able to ensure or enforce that representation. Instead, if we hope that “[o]ne day, every American theatre will be a safe, equitable, and inclusive workplace filled with arts practitioners who represent and reflect the wonderful diversity of the human tapestry,”¹⁵⁷ we will need to source solutions from within the legal field as well as beyond.

This Note does not, by any means, cover the breadth of issues and possibilities left to be discovered and discussed in terms of getting better representation on theater stages. For instance, studies that have thus far been done about disability in film and television should be replicated for the stage in order to give us a more accurate picture of the issue as it applies to stage actors.¹⁵⁸ Also, further research beyond the scope of this Note may yield other creative and effective legal and nonlegal tactics that can be used to not only increase diversity onstage, but also to maintain it.

It is hopefully clear by now that there is a problem in the theater world that needs to be addressed. Not enough actors with disabilities are getting employed—or even getting the chance to prove that they should be employed. This issue has negative effects all around. Of course, it impacts actors with disabilities by lessening their opportunities to practice their craft. But it also affects society at large in a number of ways; representation of disabilities on stage can lead to a feeling of “belonging” for many people who have so often felt sidelined, and the art that gets created becomes more inclusive and authentic overall.

It should also be clear by now that there is not yet a simple solution to the above problem, in the law or in society. This cannot dissuade us, however, from fighting to ensure that actors with disabilities have the opportunity to perform on stage. It appears as though it will take a conglomeration of methods: the creation of a disability BFOQ; affirmative action based on disability; monetary and business incentives; ongoing DEI work; and conscious casting could all be pieces of the as yet unsolved puzzle. And while we are still missing puzzle pieces, we should begin by working with the methods we already have.

This Note has presented potential legal avenues for addressing the lack of opportunities for actors with disabilities in the theater industry and has concluded that using the law as a vehicle for improving the odds for these actors is probably not enough. Either way, casting more actors with disabilities is an issue that clearly requires immediate attention. After all, when it comes to the heart of the reason that all of this research and discussion is necessary in the first place, actress Ali Stroker put it best in her Tony Award acceptance speech: “This award is for every kid watching tonight who has a disability, who has a limitation or a challenge, who has been waiting to see themselves represented in this arena,” she said.¹⁵⁹ “You are.”¹⁶⁰

96 S. Cal. L. Rev. 483

Download

Kira Patterson*

* Senior Editor, Southern California Law Review, Volume 96; J.D. Candidate 2023, University of Southern California Gould School of Law; B.A. Drama and Psychology 2015, Tufts University. Thank you to my supportive, loving, wonderful friends and family for having my back throughout law school. Special thanks to my advisor, Dr. Orly Rachmilovitz, for her guidance during the note-writing process, and a final thank you to my mentors (also known as my parents), Barbara and Patrick Patterson, for inspiring me every day. This Note is dedicated to the memory of colleague and friend Jenny Lin.

Race and Politics: The Problem of Entanglement in Gerrymandering Cases

April 2023April 2025Stephen Menendian*

Gerrymandering—the manipulation of political districting processes and boundaries for partisan political advantage—has proven a troubling and difficult area of constitutional concern. This is partly due to the exceptionally divergent standards of judicial review applicable depending upon the basis for the gerrymander claim. The Supreme Court has consistently held that racial gerrymanders are subject to strict scrutiny review and presumptively violate the Equal Protection Clause of the Fourteenth Amendment. The Court has recently declared that partisan gerrymanders, on the other hand, are a political question and non-justiciable.

This Article argues that current guidance from the Supreme Court on standards for evaluating gerrymandering claims is inadequate to guard against constitutional violations because of the problem of entanglement: the race and partisan preferences of voters are so deeply intertwined in many contexts that it is practically impossible to discern whether race or partisanship was the basis for political districting decisions. The entanglement of race and politics in political districting processes means that there is a dangerous risk that unconstitutional racial gerrymanders will escape judicial review under the cover of partisanship.

This Article explicates the problem of entanglement in gerrymandering cases and evaluates several possible solutions. Presenting original research drawn from the 2020 decennial census and voter data from the 2020 presidential election, this Article establishes an empirical basis for the problem of entanglement. Although prior legal scholarship has emphasized the problem of “conjoined polarization”—the overlap in partisan and racial preferences—as an enabling factor in partisan redistricting processes, this Article claims that racial residential segregation plays a more central and dynamic role than has generally been acknowledged in undergirding the entanglement of race and politics in political redistricting processes.

INTRODUCTION

The manipulation of political districting processes for political advantage—popularly known as gerrymandering—has long bedeviled the United States,¹ but concerns about the abuse of this practice have intensified in recent decades due to a confluence of factors: intensifying partisan political polarization, widening racial political polarization, the use of detailed voter files to predict voting behavior, the emergence of sophisticated computer technology to generate ever-more precise political maps, and a sharp divergence in the Supreme Court’s jurisprudence governing different forms of this practice.

In reviewing suits brought to challenge gerrymandering practices, the Supreme Court has held that state legislative efforts to draw political districts based on race violate the Equal Protection Clause, a natural extension of the Court’s general prohibition on the use of racial classifications in policymaking.² On the other hand, the Supreme Court has held that legislative efforts to draw political districts based upon partisanship or for partisan political advantage are “political questions” and non-justiciable.³

In this regard, these two forms of gerrymandering are treated in the utmost extreme: racial gerrymandering is subject to the highest level of judicial scrutiny while partisan political gerrymandering is treated as non-justiciable, meaning not that it is subject to the lowest level of judicial review, rational basis review, but that the practice is deemed unsuitable for judicial review at all. Racial gerrymanders are subject to strict scrutiny judicial review whereas partisan political gerrymanders are not subject to judicial review whatsoever.⁴

The Supreme Court’s broader equal protection clause jurisprudence supplies a basis for treating these two types of claims differently. Prevailing equal protection jurisprudence treats race as a “suspect” class in government policymaking subject to strict scrutiny review, while most other classifications are reviewed under a rational basis test.⁵ But strict adherence to this approach would compel a very different result than the determination that partisan gerrymanders are non-justiciable. Lower courts would still be able to entertain such cases, just under a much lower level of review, rational basis.

If there were no relationship between race and partisanship in voting patterns, then political gerrymanders and racial gerrymanders could be regarded as separate and distinct categories and there would be no logical inconsistency in a jurisprudence that regulated one but not the other. Partisan gerrymanders would have no observable racial effect, or vice versa. In practice, however, race has long been highly correlated with partisan political affiliation.⁶ Although racial political polarization waxes and wanes over time, it is strong enough that a jurisprudence of gerrymandering cannot neatly divide the two types.

The Court’s racial gerrymandering jurisprudence makes clear that sorting voters into separate political districts on the basis of race is unconstitutional, just as it is presumptively unconstitutional to sort pupils into different schools on the basis of race.⁷ In racially diverse states with racially polarized voting patterns and merely modest levels of racial residential segregation, however, it is likely that partisan gerrymandering will effectively sort people into different districts on a racial basis. In much of the country, race and partisanship are entangled, such that redistricting efforts on one basis are largely indistinguishable from the other. As a consequence, unregulated partisan gerrymanders have a dangerous potential to subvert the constitutional rule against racial gerrymandering.

Although political scientists have long recognized the correlation of race and partisan affiliation (what political scientists term “conjoined polarization”),⁸ prior analysis of gerrymandering jurisprudence has underexamined the specific role of racial residential segregation in facilitating the entanglement of race and politics in redistricting processes. In recent legal scholarship analyzing this problem, segregation is either completely absent from the discussion, mentioned in passing, or is treated as an assumed operative background condition.⁹

This Article argues that it is the interaction of racial residential segregation and racial political polarization that creates the entanglement problem in redistricting processes, not merely “conjoined polarization” by itself.¹⁰ Where the level of racial residential segregation is higher, the entanglement of race and politics in districting processes is likely to be greater, not only because of the geographic concentrations of people that facilitate political district line-drawing, but also because regions with higher levels of racial residential segregation have both greater racial political polarization and partisan political polarization.

This Article presents original analysis of the 2020 presidential election results and 2020 census data to demonstrate that racial segregation and partisan segregation are strongly correlated. Moreover, regions with higher levels of racial residential segregation appear to have higher levels of partisan polarization. As a result, partisan gerrymanders in those regions are likely to result in the segregation of voters into different political districts on the basis of race and vice versa.

Part I provides a brief history of gerrymandering, including the types and forms of political districts that were historically practiced. Political districts were far more varied in the early years of the republic than is generally appreciated or understood today. More importantly, Part II notes that although gerrymandering practice can be traced to the early decades of the republic, efforts to curb it also extend back into the nineteenth century. Standards and norms for democratic practice have improved and evolved since the framing of the Constitution, laying the groundwork for particularized claims brought to challenge this practice.

Part II compares racial gerrymandering and partisan political gerrymandering cases, rulings, and reasoning. It analyzes points of divergence and convergence between the two lines of cases. The partisan and racial gerrymandering cases germinate from the same seed and the same soil but have produced extremely divergent results in the body of the Supreme Court’s precedent governing these cases. This creates a problem in cases brought that challenge redistricting where race and partisan affiliation are largely co-extensive. In such cases, racial gerrymandering could escape judicial scrutiny under the cover of partisanship.

Part III explicates the entanglement problem, that purely partisan redistricting maps are in many cases objectively indistinguishable from redistricting maps that explicitly use race. The key components of this problem are racial political polarization and racial residential segregation. When these factors coincide, partisan gerrymandering is likely to sort people into different districts on a racial basis. Part III also shows that racial residential segregation plays a larger role than is generally appreciated in both racial and partisan gerrymandering processes. It presents original and other recent empirical research suggesting that regions with higher levels of racial residential segregation have both more racial political polarization and political segregation.

Part IV reviews three possible ways to address the entanglement problem in terms of current constitutional law and text, weighing the merits of each. First, any hybrid gerrymandering case in which race appears to play a significant role but is co-extensive with partisanship could be categorically exempted from judicial review if the state raises such a defense. This approach is not a functional solution because it would formalize a loophole for subverting the Constitution as long as racial gerrymanders are clothed in the guise of partisanship.

Second, any case where race and partisanship are co-extensive could instead be drawn within the racial gerrymandering line and held to strict scrutiny review, even though race cannot be said to “predominate.” This approach would better align with the Court’s broader anti-classification jurisprudence but would require adjustments to the standards applicable to racial gerrymandering cases.

Finally, the Court could reverse its judgment that partisan gerrymanders are non-justiciable. The Court only recently gathered a majority of Justices in support of that view. It could reverse course and direct lower courts to review such claims under a lower standard of review within the equal protection jurisprudence or some other constitutional provision or basis altogether. In this regard, Part IV makes the case for revisiting the Court’s Guarantee Clause jurisprudence based upon principles and concerns articulated by the framers of the Constitution.

I. A BRIEF HISTORY OF GERRYMANDERING

Although the United States was still a young nation at the time of the ratification of the Constitution in 1787, the framers already enjoyed decades of cumulative experience with democratic political processes, including political districting, based upon the collective experiments already underway in the various states since the Revolution.¹¹ The Federalist Papers, for example, note political districts of varying size and composition both within and between states as a matter of fact.

In Federalist No. 57, James Madison observes that different sized political districts contribute to both the federal and state legislatures: “The city of Philadelphia is supposed to contain between fifty and sixty thousand souls. It will therefore form nearly two districts for the choice of federal representatives. It forms, however, but one county, in which every elector votes for each of its representatives in the State legislature.”¹² Thus, Pennsylvania’s county-districting system for electing state legislators necessarily resulted in large population disparities between political districts in that state at the time of the adoption of the Constitution. But in Federalist No. 61, Alexander Hamilton notes that although the New York State Assembly is drawn from counties, the New York State Senate is drawn from districts composed of two to six counties apiece.¹³

While acknowledging the existence of population disparities between political districts (and implicitly, the existence of inequities in political representation), both Madison and Hamilton unequivocally maintain throughout The Federalist Papers that the principle of majoritarianism—that the majority should prevail—is the fundamental basis of free government and republican government.¹⁴ In Federalist No. 58, for instance, Madison asserts that “the fundamental principle of free government” is that the “majority would rule.”¹⁵ In that context, he was writing against the suggestion made by critics of the proposed Constitution that supermajorities should be required for either a quorum or a decision (such as passing a law) in the House of Representatives. As he explains: “In all cases where justice or the general good might require new laws to be passed, or active measures to be pursued, the fundamental principle of free government would be reversed. It would be no longer the majority that would rule: the power would be transferred to the minority.”¹⁶

Indeed, this is one of the chief objections the framers had with the Articles of Confederation, which gave equal suffrage to each state in the federal legislature (unlike the Constitution, which does so only in one legislative chamber, the Senate).¹⁷ Prior to the constitutional convention in Philadelphia in the summer of 1787 where the Constitution was hammered out, Madison privately wrote to Thomas Jefferson expressing his hopes for systemic changes to the federal government. Chief among these concerns was converting from a system in which each state receives equal voting power in Congress to a system of representation based upon population.¹⁸

Hamilton firmly agreed. In Federalist No. 22, Hamilton maintains that “the fundamental maxim of republican government . . . requires that the sense of the majority should prevail.”¹⁹ Therefore, in his view, Every idea of proportion and every rule of fair representation conspire to condemn a principle, which gives to Rhode Island an equal weight in the scale of power with Massachusetts, or Connecticut, or New York; and to Deleware [sic] an equal voice in the national deliberations with Pennsylvania or Virginia, or North Carolina.²⁰

In a powerful and eloquent denunciation of the principle of equal suffrage between states, Hamilton goes on to develop the argument on the “impropriety of an equal vote between States of the most unequal dimensions and populousness” in various ways.²¹

This argument, however, and all of the reasoning developed in support of it, would appear to have equal force against political districts within states of “most unequal dimensions and populousness.” Indeed, in Federalist No. 46, Madison asserts that “[e]very one knows that a great proportion of the errors committed by the State legislatures proceeds from the disposition of the members to sacrifice the comprehensive and permanent interest of the state, to the particular and separate views of the counties or districts in which they reside.”²²

How can the principles, reasoning, and keen insights developed by Hamilton in pushing for more proportional representation between states in the federal government be reconciled with the apparent lack of concern with unevenly populated political districts within states or the absence of an explicit mechanism regulating it? The answer is not clear. It may have been an oversight. There were many weighty matters that preoccupied the Constitutional Convention, and the issue of unequal political districts within states may not have been a topline concern. Or, if it were a serious concern, perhaps any concerned framers were either outnumbered by those who were not or sensed efforts to regulate it were either impracticable or not a winnable issue. Despite their reputation, especially Madison’s, as “author” of the Constitution, historians have noted that most of the proposals Madison or Hamilton introduced or supported at the convention were defeated.²³

Or perhaps they assumed that unequal populations across political districts within states, to the extent that they were found, would exist within tolerable limits, or would not result in the vast and strikingly unequal representation of political interests to the same extent found when small states enjoyed the same voting power as the larger states. After all, the proponents of the Constitution were largely concentrating their reform efforts on addressing defects in the experience of government under the Articles of Confederation and the paralysis that resulted from allowing small states to block legislation necessary to advancing the interests of the nation.

Yet another possibility is that they believed that a separate and sufficient mechanism existed for addressing the districting problem. As an example of the latter, perhaps they believed that Article I, Section 4, allowing Congress to alter state electoral rules in federal elections, would suffice to remedy any particularly egregious or extreme case that might arise within a state.²⁴ This possibility is purely speculative given that none of the three lengthy Federalist Papers (59–61) dedicated to defending the inclusion of this provision mention the composition, size, or population of political districts. In any case, this provision quickly fell into disuse, because it would be decades before Congress passed a law under this authority.²⁵

Regardless of the framers’ concern—or lack thereof—for the issue of inequities between political districts within states, state legislatures took full advantage of the maneuvering room granted them by the Constitution’s silence on this matter. In one of the most notorious instances of abuse of this power, the Massachusetts legislature passed a redistricting law in 1812 designed to minimize the political power of the Federalist Party in the next election by concentrating Federalist voters into a small number of districts while spreading Republican voters into a wider range of districts.²⁶ The plan worked because the Republican party won twenty-nine seats compared to eleven for the Federalist Party, despite winning only 49% of the vote.²⁷ The map, however, so conspicuously divided up the Boston region in an unnatural manner that critics likened the shape to a salamander. Governor Elbridge Gerry, a leading proponent of the plan, lent his name to political history when a political cartoonist dubbed the plan a “Gerry-mander: A new species of Monster.”²⁸

The type and form of political district found in the early republic was more diverse and less uniform than those that exist today. Not only were political districts of unequal population and dimension regularly employed, and the manipulation of those districts common to the extent of political tolerance, but the form and type of district were not nearly as uniform as is the case today.²⁹ All districts organized for electing members to the House of Representatives today are what political scientists call “single-member plurality” districts.³⁰ This means that each district elects a single member, and that member is elected by a plurality of the vote (winning more votes than any other candidate—also called “first past the post”). This was not, however, the case in the early years of the republic. A variety of district types co-existed, from multi-member to at-large districts. Many of these district types were designed to maximize or entrench partisan political power.

The Constitution neither prescribes nor prohibits particular types of districts or methods of electing representatives, aside from the requirement that voting qualifications be the same as those employed for the most “numerous branch of the state legislature.”³¹ It only requires a certain number of representatives for each state based upon relative population. Consequently, most of the original thirteen states used multi-member districts in the first congressional elections.³²

Between 20% and 44% of House members were elected from multi-member districts until the Twenty-Eighth Congress.³³ This means that each district elected more than one representative. They did not use proportional representation systems, as is common to most modern parliamentary democracies, as such systems had not yet been developed. Instead, they gave some districts greater political representation relative to others. Later, some states used “at-large” voting, meaning that House members were elected in some states by a vote of the entire state, as United States senators are elected today.³⁴

As is true of many aspects of our political system and institutions, there was a gradual trend toward greater uniformity. In 1842, Congress passed the first of a series of laws, generally known as “Apportionment Acts,” which outlawed at-large, statewide House districts under its Article I, Section 4 authority.³⁵ Although ostensibly aimed at giving political minorities within states more opportunity to elect members to Congress, it also had the effect of outlawing multi-member districts, not just at-large systems. There were serious doubts about the constitutionality of such laws,³⁶ and at least a few states continued to use at-large systems in violation of the law.

It was not until an apportionment act in 1872 that Congress added that districts should not only be geographically contiguous and single-member, but also that they should contain “as nearly as practicable an equal number of inhabitants.”³⁷ This requirement was reiterated in similar subsequent enactments, although it was not yet a constitutional principle.³⁸

In 1967, Congress passed another law prohibiting multi-member and at-large districts (in states with more than one representative) based on concerns that southern states might resort to at-large, statewide systems in response to the Voting Rights Act of 1965.³⁹ Both multi-member and at-large districts could be used to dilute Black voting strength in southern states.⁴⁰

The key developments, however, were in the courts of the 1960s. In 1962, the Supreme Court ruled that political districting processes were “justiciable” and could be reviewed by courts in the case of Baker v. Carr.⁴¹ Two years later, the Supreme Court ruled that political districts should approximate equal population and announced the principle of “one person, one vote.”⁴² Unequal political districts undermined this principle. If districts could be devised of unequal size, then some people enjoy greater electoral influence, and their votes might count more than others. This legal principle has been serially re-affirmed and strengthened such that the Court has struck down districting laws drawing districts with deviations of less than 1% in population between them.⁴³ Permitting suits challenging inequities and disparities in the design of political districts opened the door for the challenges to partisan political and racial gerrymandering processes.

II. THE GERRYMANDERING CASES

This Part of the Article will briefly review the major challenges to racial gerrymandering and partisan political gerrymandering reviewed by the Supreme Court and conclude with some comparative observations and analysis.

A. RACIAL GERRYMANDERING

The first notable racial gerrymandering case actually precedes Baker v. Carr. In Gomillion v. Lightfoot, the Supreme Court considered a challenge to a redistricting plan in Alabama that would have rendered the city of Tuskegee a twenty-eight-sided political district for no perceptible reason other than to disenfranchise the town’s Black population, which lived virtually exclusively in the districts outside the newly drawn city boundaries.⁴⁴ The Court held that complaints alleging racial gerrymandering of municipal boundaries were cognizable under the Equal Protection Clause of the Fourteenth Amendment.⁴⁵

A decade later, in White v. Regester, the Supreme Court affirmed a lower court ruling that a redistricting plan adopted in Texas had elements designed to exclude Mexican-Americans from electing representatives in the state legislature through the employment of multi-member districts.⁴⁶ The Court, however, rejected a similar claim involving multi-member districts in Indiana that had a disparate effect on Black voters in racially segregated urban neighborhoods.⁴⁷ These cases, however, did not involve the drawing of districts so much as the type of district.

Racial gerrymandering claims received a significant boost in a series of cases considered by the Supreme Court after the 1990 census, beginning with the landmark case of Shaw v. Reno, in which the Court first recognized this claim as such in the drawing of district lines.⁴⁸ After the 1990 census, North Carolina was awarded an additional congressional seat. The state legislature’s initial apportionment plan was rejected by the Department of Justice under the preclearance provision of the Voting Rights Act (“VRA”).⁴⁹ A revised VRA-compliant plan created a second majority-Black district. Five white North Carolina residents sued, arguing that the redistricting plan violated the Equal Protection Clause of the Fourteenth Amendment. Specifically, the residents claimed that the state engaged in an “unconstitutional racial gerrymander.”⁵⁰

In an opinion authored by Justice Sandra Day O’Connor, the Court articulated several principles that helped lay the foundation for a clear rule against racial gerrymanders. First, the Court situated the case firmly within its racial classification jurisprudence, affirming that “laws that explicitly distinguish between individuals on racial grounds fall within the core of [the Equal Protection Clause’s] prohibition,” and that “[e]xpress racial classifications are immediately suspect.”⁵¹ Furthermore, the Court asserted that the harms of racial classification are as present in the electoral context as they are in other contexts that the Court had reviewed:

Racial classifications of any sort pose the risk of lasting harm to our society. They reinforce the belief, held by too many for too much of our history, that individuals should be judged by the color of their skin. Racial classifications with respect to voting carry particular dangers. Racial gerrymandering, even for remedial purposes, may balkanize us into competing racial factions; it threatens to carry us further from the goal of a political system in which race no longer matters—a goal that the Fourteenth and Fifteenth Amendments embody, and to which the Nation continues to aspire.⁵²

The Court acknowledged, however, that all redistricting is necessarily race-conscious, drawn with an awareness of racial demographics, just as the legislature is aware of many other demographics features when drawing legislative districts.⁵³ Therefore, the Court held that not all race-conscious redistricting is unconstitutional.⁵⁴ The Court did not, however, identify the line between permissible race-conscious political redistricting and impermissible racial gerrymandering. It concluded that the plan in question was so “bizarre on its face that it was ‘unexplainable on grounds other than race.’ ”⁵⁵ Therefore, the Court held that the appellants stated a claim strong enough to survive a motion to dismiss and remanded the case for further determinations.⁵⁶

A similar set of facts led to another suit that the Supreme Court considered involving Georgia in the case of Miller v. Johnson.⁵⁷ In announcing its decision in an opinion authored by Justice Anthony Kennedy, however, the Court both affirmed critical parts of Shaw and helped indicate where to draw the line between race-conscious political districting and impermissible racial gerrymandering.

The Court first specifically rejected the state of Georgia’s claim that “evidence of a legislature’s deliberate classification of voters on the basis of race cannot alone suffice to state a claim under Shaw.”⁵⁸ Kennedy also observed that “the essence of the equal protection claim recognized in Shaw is that the state has used race as a basis for separating voters into districts.”⁵⁹ Critically, however, the Court promulgated a “predominant factor” test to guide it’s application of the facts to the law in this context.⁶⁰

To establish a racial gerrymandering claim, a plaintiff must prove that “race was the predominant factor motivating the legislature’s decision to place a significant number of voters within or without a particular district.”⁶¹ To make this showing, a plaintiff must establish that the legislature subordinated traditional race-neutral districting principles, including but not limited to compactness, contiguity, respect for political subdivisions or communities defined by actual shared interests, to racial considerations.⁶² Where these or other race-neutral considerations are the basis for redistricting legislation, and are not subordinated to race, a state can “defeat a claim that a district has been gerrymandered on racial lines.”⁶³

In several subsequent cases involving similar fact patterns and southern states, the Court further clarified that to bring racial gerrymandering claims, individuals must reside in the district that they claim is gerrymandered⁶⁴ and that such claims should be brought on a district-by-district basis, not against an entire plan.⁶⁵ This is consistent with the view that these cases are concerned with the dangers of racial classification. An individual residing outside of a district that it claims has been gerrymandered has not, in some sense, been “classified” on the basis of race by government as the Court has construed this concept.⁶⁶ It should be noted that most of the contemporaneous cases heard by the Court at this time involved drawing districts to increase or preserve minority representation in Congress under the VRA.⁶⁷

There remain a number of ambiguities in how to apply the predominance test, some of which the Supreme Court has grappled with, but has not necessarily fully resolved.⁶⁸ In particular, the Court has tried to clarify how the consideration of race is to be viewed in relation to other considerations before triggering strict scrutiny. In a case heard in the 2015 term, the Court rejected a lower court’s ruling that race did not “predominate” as a consideration in the redistricting plan because of “non-racial factors,” including the goal of creating districts of equal population.⁶⁹ The Court clarified that the equal population factor is not a factor to be considered in the ordinary course of redistricting, but a constitutional mandate.⁷⁰ While that may seem like an easy case, the Court considered a harder question in the 2016 term.

The state of Virginia defended a Republican-led redistricting plan against a racial gerrymandering claim on the grounds that the “predominance test” should only be applied if the use of race is in conflict with “traditional districting principles,” as it had presumed in cases such as Shaw.⁷¹ The Court rejected this position in a seven-to-one decision presented in an opinion by Justice Kennedy. He concluded that racial gerrymanders can exist or arise even under plans that otherwise conform to traditional factors such as compactness and contiguity. As he explained, “The Equal Protection Clause does not prohibit misshapen districts. It prohibits unjustified racial classifications.”⁷²

Critically, the Court emphasized that an unconstitutional racial gerrymander can arise or exist even if an identical plan could have been adopted without consideration or use of race. Justice Kennedy explained that “[t]he racial predominance inquiry concerns the actual considerations that provided the essential basis for the lines drawn, not post hoc justifications the legislature in theory could have used but in reality did not.”⁷³ Thus the predominance test is a factual inquiry into considerations used as part of the actual districting process. Holding otherwise would provide a state with constitutional cover for unconstitutional behavior. As the Court observed, “By deploying [non-racial] factors in various combinations and permutations, a State could construct a plethora of potential maps that look consistent with traditional, race-neutral principles. But if race for its own sake is the overriding reason for choosing one map over others, race still may predominate.”⁷⁴

B. PARTISAN GERRYMANDERING

Since the early 1960s, the Supreme Court has become more solicitous of racial gerrymandering claims even has it has become more hostile to partisan political gerrymandering claims. This Section will now review the latter line of cases.

The Supreme Court squarely confronted a partisan gerrymandering claim in the 1986 term in the case of Davis v. Bandemer.⁷⁵ In that case, the Court reviewed a redistricting plan proposed by the Republican-controlled Indiana state legislature. This plan yielded an immediate partisan advantage in which 43 out of the 100 seats in the state House of Representatives were filled by Democratic candidates even though 51.9% of the statewide votes went to Democratic candidates.⁷⁶ Upon review, a deeply divided Court produced a fragmented set of opinions in which most justices agreed on the result—a holding that the plan was not unconstitutional—but disagreed on the rationale and basis for that judgment.⁷⁷

Six of the Justices, and therefore the Court, endorsed the legal principle that partisan or political gerrymanders are justiciable, while three Justices, Justice O’Connor, Justice Burger, and Chief Justice Rehnquist, preferred to rule that such claims are not. Even among the Justices in the majority and plurality, however, there was a lack of consensus on the grounds for doing so and on the standards that should be adopted to evaluate such claims. The debate between the Justices prefigures most of the issues that have been subsequently debated in this context.

The plurality of Justices White, Marshall, Brennan and Blackmun begin with the principle that in order to establish a partisan gerrymandering claim, plaintiffs must “prove both intentional discrimination against an identifiable political group and an actual discriminatory effect on that group.”⁷⁸ They caution, however, that the mere fact that “a particular apportionment scheme makes it more difficult for a particular group in a particular district to elect the representatives of its choice does not render that scheme constitutionally infirm.”⁷⁹ In addition, they reject the notion that mere disproportionality in representation is a sufficiently adverse effect to establish a constitutional violation. The plurality advises that reviewing courts examine both the district individually as well as the state’s overall districting plan holistically. The plurality also asserts that a constitutionally infirm redistricting plan can occur either when a minority manipulates boundaries to consistently thwart the will of the majority, or when a majority uses its power to shut a minority out of the political process or a meaningful chance to influence it.

Ultimately, the plurality held that “unconstitutional discrimination occurs only when the electoral system is arranged in a manner that will consistently degrade a voter’s or a group of voters’ influence on the political process as a whole.”⁸⁰ In applying this standard, the plurality emphasized that claimants would need to examine more than a single election to conclude that an impermissible partisan gerrymander exists, and that they should be able to distinguish between a meaningful structural disadvantage established by districting compared to mere lack of success in persuading voters. The plurality felt that the district court’s conclusions were less persuasive than the state’s defense on this point and voted to reverse the lower court.

Justice Powell and Justice Stevens found the plurality’s approach too restrictive and would have affirmed the lower court. In particular, Justice Powell wrote in favor of a multi-factor analysis:

The most important [factor to consider is] the shapes of voting districts and adherence to established political subdivision boundaries. Other relevant considerations include the nature of the legislative procedures by which the apportionment law was adopted and legislative history reflecting contemporaneous legislative goals. To make out a case of unconstitutional partisan gerrymandering, the plaintiff should be required to offer proof concerning these factors, which bear directly on the fairness of a redistricting plan, as well as evidence concerning population disparities and statistics tending to show vote dilution. No one factor should be dispositive.⁸¹

Justice O’Connor and Justice Burger argued strenuously that the majority’s holding would prove unworkable in practice, as illustrated by the inability of the six Justices in favor of holding partisan gerrymanders as justiciable to get behind a single standard and therefore was a “political question” as set out in the framework for evaluating such questions in Baker v. Carr.⁸² In addition to the lack of a clearly defined and easily applicable standard for adjudicating such claims, they had prudential concerns. Justice O’Connor argued that without a clear standard, opening federal courts to claims of partisan gerrymandering would lead to “pervasive and unwarranted judicial superintendence of the legislative task of apportionment.”⁸³ She wondered if there was any logical stopping point short of “roughly proportional representation for every cohesive political group.”⁸⁴

Despite its inviting holding, Bandemer established a heavier burden for plaintiffs to overcome so as to make out a discriminatory political gerrymandering claim than its authors probably imagined. Over the next eighteen years, no federal courts, at any level, ruled that a redistricting plan was an unconstitutional political gerrymander. This was the situation when the Court heard the case of Vieth v. Jubelirer in 2004.⁸⁵

In Vieth, the Court considered a suit brought by members of the Democratic Party, claiming that the state of Pennsylvania’s partisan redistricting plan violated the Constitution.⁸⁶ Five Justices agreed that the plan did not violate the Constitution, but four of those Justices took the position of Justice O’Connor, Justice Burger, and Chief Justice Rehnquist in Bandemer, arguing that political gerrymanders should be non-justiciable and voting to overturn Bandemer.⁸⁷ The fifth Justice, Justice Kennedy, however, agreed that current standards were unworkable, but he preferred to leave the door open to discovering one.⁸⁸ The four dissenting Justices agreed with Bandemer’s holding but disagreed on the specifics of how to operationalize that principle.⁸⁹

The Court subsequently heard a case that had been held over until Vieth was decided, League of United Latin American Citizens v. Perry.⁹⁰ In Perry, the petitioners challenged the 2003 redistricting plan enacted by the Republican-controlled Texas legislature. Drawn merely a year after the 2002 midterm elections, the 2003 plan led to an election result where twenty-one Republicans and eleven Democrats were elected in congressional elections the following year,⁹¹ a far more lopsided result than the aggregate vote count would suggest. The petitioners argued that the mid-decennial nature of the redistricting plan revealed the legislature’s sole motivation to gain partisan advantage,⁹² which should be sufficient to trigger heightened scrutiny. In the majority opinion, Justice Kennedy entertained the petitioners’ proposed “sole-intent” test, but ultimately found it “not convincing” because some contested district lines were drawn based on more local interests and a number of line-drawing requests by Democratic legislators were honored.⁹³

Interestingly, Justice Stevens and Justice Breyer, in their dissent, proposed a different test wherein a plaintiff must prove partisan aims by showing that: (1) the legislature “subordinated neutral districting principles to political considerations” and (2) their predominant motive was to “maximize one party’s power,” along with a showing of discriminatory intent by establishing that (1) the plaintiff’s candidate of choice won election under the old plan; (2) the plaintiff’s residence is now in a district that is a safe seat for the opposite party; and (3) the plaintiff’s new district is less compact than the old district.⁹⁴ However, this complex alternative test was clearly not entertained by the majority.

The next major partisan gerrymandering case was heralded when the Court agreed to review a redistricting plan arising from Wisconsin in Gill v. Whitford, based on the fact that a new technical standard had been developed and proposed: the efficiency gap.⁹⁵ This was a formula devised by social scientists to provide metrics that could gauge the specific disadvantage created by certain redistricting schemes. The authors of this formula claimed it provided clear and workable standards for operationalizing them.⁹⁶ The result was a letdown when the Court dismissed the case for lack of standing. The Court also breezily dismissed another case the same year in a per curiam opinion regarding Maryland’s Democratic-favored political gerrymandering.⁹⁷ This case was reconsidered, and then dismissed, in the 2019 term.

Justice Kennedy’s departure from the Court led to a more decisive result in this line of cases. In Rucho v. Common Cause, a majority of the Court held for the first time that partisan gerrymanders are a political question and non-justiciable largely for the reasons developed in the dissent in Bandemer and the concurrences in Vieth.⁹⁸ This would seem to be, thus far, the end of the line for claims of partisan gerrymandering.

C. POINTS OF CONVERGENCE AND DIVERGENCE

The juxtaposition of the racial gerrymandering and partisan political gerrymandering cases is itself revealing. The partisan and racial gerrymandering cases germinate from the same seed and the same soil. They both arise out of a political context in which democratic representation is being distorted or manipulated for specific advantage, and in a legal context in which the Constitution provides scant direct guidance (despite the provision of a political mechanism by which Congress, via Article 1, Section 4, can alter state electoral rules in federal elections). The germ for both claims is the Court’s greater solicitude toward challenges to various electoral schemes in the early 1960s. And yet the Court has evolved vastly different frameworks and conclusions for regulating these forms of political districting activity for reasons that are not entirely convincing or coherent.

In theory, the different levels of scrutiny and accord given to race versus other classifications under the Equal Protection Clause could explain the differences in treatment, but the proponents of treating political gerrymanders as non-justiciable decline to ground their reasoning on this basis.⁹⁹ After all, if this were the critical explanatory factor, then partisan gerrymandering claims would be justiciable, just at a much lower level of judicial review.

Instead, the main contention of the non-justiciable position emphasizes the lack of workable standards and the constitutional structure and history of gerrymandering as a political practice. Recall, for example, that Justice White and the plurality in Bandemer emphasized whether a districting scheme created a severe structural disadvantage in access to the political process, whereas Justice Powell (joined by Justice Stevens) would have applied a multi-factor, holistic approach. Justice O’Connor and the Justices who joined her opinion argued that the Baker factors for evaluating whether an issue is a political question squarely fit, and that there was no logical stopping point short of proportional representation once such claims were entertained.

The most obvious rejoinder to the claim that political gerrymandering suits cannot be grounded onto a workable standard is the fact that workable standards have already been developed and adopted in the racial gerrymandering context. Logically, if the standard is workable in one context, it should be workable in another, absent some factor that would render it otherwise. Indeed, this is a prominent theme of the Justices who support judicial review of extreme partisan gerrymanders.

In Bandemer, all six Justices in the majority signed onto a sharp critique of Justice O’Connor’s opinion (joined by Burger and Rehnquist) arguing that she failed to “point out how the standards that we set forth here for adjudicating this political gerrymandering claim are less manageable than the standards that have been developed for racial gerrymandering claims.”¹⁰⁰ This was also a central point of contention in Vieth. The plaintiffs in that case modeled their claim on the standard adopted in Miller, that the partisan objective was a “predominant factor” in the redistricting process.¹⁰¹ As the plurality in Vieth forthrightly noted, “Appellants contend that their intent test must be discernible and manageable because it has been borrowed from our racial gerrymandering cases.”¹⁰² Yet, it disagreed, for reasons that Justice Stevens upbraided in his dissent:

Especially perplexing is the plurality’s ipse dixit distinction of our racial gerrymandering cases. Notably, the plurality does not argue that the judicially manageable standards that have been used to adjudicate racial gerrymandering claims would not be equally manageable in political gerrymandering cases. Instead, its distinction of those cases rests on its view that race as a districting criterion is “much more rarely encountered” than partisanship, and that determining whether race—“a rare and constitutionally suspect motive”—dominated a districting decision “is quite different from determining whether [such a decision] is so substantially affected by the excess of an ordinary and lawful motive as to [be] invali[d].” But those considerations are wholly irrelevant to the issue of justiciability.¹⁰³

Moreover, the stronger the argument advanced by the plurality in Vieth and the majority in Rucho against porting the standards used in the racial gerrymandering cases to the political gerrymandering context, the more they reveal the weaknesses of the doctrine they have developed in the racial gerrymandering context. In Vieth, for instance, Justice Scalia writes on behalf of the plurality that the “predominant motivation” test is “vague . . . when used to evaluate single districts, [but] it all but evaporates when applied statewide.”¹⁰⁴ Any vagueness, however, is built into the notion that courts can discern the predominance of any particular factor in a complex legislative process. Qualitative considerations in legislative processes are not like mechanical inputs into the production of widgets. They inherently resist the quantification necessary to determine “predominance.”

The argument Justice Scalia develops in the guise of a rhetorical question is especially revealing:

And how is the statewide “outweighing” to be determined? If three-fifths of the map’s districts forgo the pursuit of partisan ends in favor of strictly observing political-subdivision lines, and only two-fifths ignore those lines to disadvantage the plaintiffs, is the observance of political subdivisions the “predominant” goal between those two? We are sure appellants do not think so.¹⁰⁵

The exact same argument could be developed at the scale of a single political district in the racial gerrymandering context, using neighborhoods, census tracts, or other census designated geographies as the smaller scale analogue subdivision. The “predominant factor” test seeks to quantify the inherently qualitative. Any deficiency in the predominant factor test in the partisan gerrymandering context is likely equally applicable to the racial gerrymandering context.

Finally, Justice Scalia tries to draw another distinction to justify this difference in treatment:

Determining whether the shape of a particular district is so substantially affected by the presence of a rare and constitutionally suspect motive as to invalidate it is quite different from determining whether it is so substantially affected by the excess of an ordinary and lawful motive as to invalidate it.¹⁰⁶

But this argument utterly ignores the fact that Shaw and its progeny never characterized the pursuit of partisan political advantage as an ordinary (“traditional”) districting principle and also readily acknowledged that virtually all political districting is race-conscious, drawn with the awareness of racial demographics. To that extent, it is impossible to declare that consideration of race is any less “ordinary” than consideration of partisan advantage in the redistricting process. Moreover, in both cases, the argument is that it is only the predominance of this consideration that renders it illegal, not its mere presence. To that extent, Justice Scalia’s comment assumes what it concludes when it declares one legal and the other unlawful.

Chief Justice Roberts makes a similar leap in logic in Rucho: “A permissible intent—securing partisan advantage—does not become constitutionally impermissible, like racial discrimination, when that permissible intent predominates.”¹⁰⁷ If true, it is only because he and his colleagues make it so. If they decided that securing partisan advantage was unconstitutional when it predominates the redistricting process, then it would be so. Both Justice Scalia and Chief Justice Roberts assume that pursuit of partisan advantage is permissible to justify their conclusion that the case lines are different. Further, if the main reason that partisan gerrymandering is non-justiciable is the lack of workable standards, then the constitutionality of such processes is based upon their judgment in that regard rather than an evaluation of valiant, but ineffective, efforts.

As unconvincing as these arguments are for why the predominance test cannot be reasonably applied to the partisan context, there is an even stronger argument for why partisan and racial gerrymanders cannot be treated in such categorically distinct ways, and it is a point that the dissenters have not developed: the entanglement of partisanship and race in redistricting processes.

Against a backdrop of even moderate racial residential segregation, racial political polarization in voting patterns combined with the race-conscious districting processes mean that partisan gerrymandering cannot be clearly distinguished from racial gerrymandering. Even if mapmakers completely ignore racial demographics in generating districting options, a redistricting process motivated by the pursuit of partisan advantage is likely to produce maps strikingly similar to those that might occur if the state had “used race as a basis for separating voters into districts.”¹⁰⁸ The next Part of this Article confronts and unpacks this problem.

III. THE ENTANGLEMENT PROBLEM

The entanglement of race and partisanship in political districting processes is rooted in a few critical phenomena. To understand the entanglement problem, we must first examine the concept of polarization, especially as it is applied to politics.

Polarization is itself an elusive notion, and it is not always clear what it means. Sometimes polarization is invoked to describe the salience of political conflict.¹⁰⁹ This simple characterization is misleading. Although there may be an underlying relationship,¹¹⁰ partisanship and political polarization are not coterminous.¹¹¹

The existence of closely contested elections versus landslide elections cannot tell you whether the electorate is polarized or not. Landslides might occur in deeply polarized contexts but in which one group dominates another, as in the Jim Crow South, or low-polarization contexts in which a broadly popular, mainstream candidate triumphs. Similarly, closely contested elections can occur when both candidates are popular or both are deeply unpopular. Aggregate returns are not indicative.

More often, polarization is used to refer to intensity of disagreement or political conflict. This conceptualization, however, also fails to capture the essence of polarization. Extreme (corrosive or toxic) partisanship can exist in a state of either high or low political polarization. It is the distance or scope of disagreement, not necessarily the intensity of feeling or contestation, that is key to understanding polarization. This is perhaps easier to understand in an economic context.

Suppose that most people in a certain society are middle-income, with far fewer extremely rich and extremely poor individuals. Such an income distribution would be depicted graphically in the form of what statisticians call a “normal distribution” (or a “Gaussian” distribution) with a bulge in the middle and smaller downward sloping tails toward the edges of the income distribution.¹¹² In contrast, suppose that most people in another society are either extremely poor or very wealthy, with fewer people as middle-income. This is what statisticians call a bimodal distribution, with bulges towards the ends of the distribution. The mean or median results could be identical in both situations. See Figure 1.

FIGURE 1. A Normal vs. a Bimodal Distribution

Polarization refers to a process by which a distribution resembling a normal distribution evolves into something more closely resembling a bimodal distribution. Thus, if the income distribution hollows out this way, economists may describe this as “economic polarization.”¹¹³ The same is true of politics.

As political scientists and surveyors have demonstrated, American political attitudes have indeed become more polarized in recent decades.¹¹⁴ The Pew Research Center, which conducts regular surveys on political attitudes, found that in both 1994 and 2004, more Americans expressed mixed or centrist political opinions than either conservative or liberal views.¹¹⁵ By 2014, however, this had reversed, a trend that has continued to the present.

One particularly important dynamic contributing to political polarization is that Republicans have become more conservative and Democrats have become more liberal. In 1994, only 64% of Republicans were more conservative than the median Democrat, and only 70% of Democrats were more liberal than the median Republican. By 2014, those figures had risen to 92 and 94%, respectively.¹¹⁶

Another form of polarization is racial political polarization. This tends to refer to distinctive partisan preferences between racial groups. Table 1 below indicates the significant racial gaps in support for partisan candidates in presidential elections since 2000.

TABLE 1. Republican Support by race in Presidential Elections, 2000–2020.¹¹⁷.

Although there is non-trivial variation between electoral cycles, this table shows consistently high political polarization between racial groups in the popular vote for President (and by implication for political party) over two decades. In presidential elections since 2000, Black support for the Republican candidate narrowly ranges from 4% to 12%. White support ranges from 55% to 59%. Latino support ranges from 26% to 44% (in 2004).¹¹⁸ And Asian support ranges from 27% to 43%. The white-Black voting gap ranges from 46 to 53 points. Although political polarization by race seems to reach a peak in 2012, the figures as of 2020 are just as large, if not larger, as they were in 2000. In short, racial political polarization has worsened over time, but it has been consistently high throughout the twenty-first century.

Similar tables could be generated for congressional races, midterm elections, governor’s races, and the like. Political scientists have observed that racial political polarization in the United States has been pronounced at least since 1965.¹¹⁹ This connection between race and partisan affiliation is described as the “conjoined polarization.”¹²⁰ Although this phenomenon might be troubling in its own right, it is not automatically a problem in the context of political redistricting processes.

If racial groups were evenly distributed across space, then political polarization by race would not factor into political redistricting processes. Gender political polarization offers a contrasting illustration. Gender political polarization has increased significantly in recent years too, and a table similar to Table 1 could be presented illustrating it.¹²¹ But the absence of gender political segregation means that gender is not a meaningful basis upon which to conduct political redistricting processes.

Residential segregation is the critical element that enables gerrymandering. This is true generally, but in the context of racial gerrymandering, racial residential segregation is the critical factor.

When racial groups are highly concentrated into certain neighborhoods or communities, and there is a moderate to high degree of racial political polarization, then racial gerrymandering is relatively easy. On the other hand, even if the political community is racially polarized, if racial groups are highly integrated across space, then racial gerrymandering would prove extremely difficult to impossible. If either condition is absent, it is not possible to either use race to draw political districts for partisan advantage nor to draw districts for partisan advantage that segregate people into different political districts on the basis of race. Only when both conditions hold does the entanglement problem arise. Table 2, below, indicates the relationship.

TABLE 2. Conditions for Entanglement

The entanglement of race and politics in political redistricting processes, then, is not simply a function of the conjoined polarization of racial political polarization and partisan political polarization. It depends on the existence of racial residential segregation as well.

Unfortunately, racial residential segregation is persistently moderate to high across much of the United States. Using traditional measures of segregation such as the dissimilarity index, national Black-white dissimilarity in 2020 was 55.2, meaning that 55% of Black Americans would have to move into a predominantly white neighborhood to have no racial residential segregation between Black and white Americans.¹²² This is in a range social scientists regard as “moderately high.” Asian-white dissimilarity scores are 40.0, and Hispanic-white dissimilarity scores are 45.3, in the moderate range.¹²³ Even though most of these numbers have improved in recent decades, they demonstrate the persistence of racial residential segregation. In many cities, these figures are much higher.

Other popular measures of segregation are the Isolation/Exposure Indices, which describe neighborhood composition of the typical (median) or average person by race. The “typical” Black and white Americans reside in vastly different neighborhood milieus. As of 2020, the average white resident of a metropolitan area resides in a neighborhood that is 69% white, 9% Black, 12% Hispanic, and 6% Asian.¹²⁴ In contrast, a typical Black resident lives in a neighborhood that is 41% Black, 34% white, 17% Hispanic, and 6% Asian. Not only are these demographically different worlds, these figures mean that Black “exposure” to white people is 34, roughly the same level it was in 1940.

Using a different measure of segregation, the Divergence Index, which can account for multiple racial groups simultaneously and provide a single holistic score for every city or metropolitan region, racial residential segregation appears stubbornly persistent.¹²⁵ The Divergence Index compares demographic proportions of smaller geographies to larger geographies and then sums those population-weighted differences to yield a holistic score.¹²⁶ Over 50% of cities and metro areas have a higher (more segregated) Divergence score as of 2020 than in 1990.¹²⁷

Given the central role of residential segregation in facilitating or impeding the manipulation of district boundaries for political advantage, it is strange that prior legal scholarship has under-emphasized or ignored this factor in analyzing gerrymandering cases. Perhaps that is because it is assumed that segregation is a neutral or binary background factor that either exists or does not, and does not actually affect the level of partisan political polarization in a region. If so, there are several findings that challenge this assumption.

A study conducted by the political scientist Jessica Trounstine identified a direct relationship between racial residential segregation and partisan political polarization.¹²⁸ She finds that the relationship between segregation and racial political polarization is statistically powerful: a city in the 10th percentile of segregation has a 35% point divide in racial support for a political candidate, compared to a 63% point divide at the 90th percentile.¹²⁹ In other words, the more racial residential segregation, the more racial political polarization.

Could this difference be explained by the fact that white people vote more conservatively in certain states or regions than others? She tests for this and finds that the relationship between segregation and polarization is unaffected by the conservatism of the local white population. In fact, she found that “cities with more conservative white populations have smaller racial divides.”¹³⁰

In short, racial residential segregation is not a background condition that exists in a binary state, either existing or not, to undergird the entanglement problem. Rather, it is a condition that interacts dynamically with racial political polarization. A higher level of racial residential segregation seems to coincide with higher racial political polarization. Similarly, racial residential segregation interacts dynamically with partisan political segregation and (by inference) partisan political polarization, as will be shown below.

To appreciate the full relationship between race and politics, it is important first to emphasize that the critical role of residential segregation in relationship to gerrymandering is general, not specific to racial segregation and racial gerrymandering. Partisan political polarization, by itself, is neither a necessary nor a sufficient condition to enable partisan gerrymandering. Political segregation, however, is necessary. If people of different political preferences or partisan affiliation are evenly distributed across space, regardless of how polarized they may be, it is extremely difficult to draw political districts for partisan advantage. On the other hand, high levels of partisan residential segregation (say, Republicans living in one community and Democrats in another) make it much easier to draw districts for partisan political advantage. For simplicity of illustration, compare hypothetical voting precincts of equal size with voting totals depicted in Tables 3 and 4 below.

TABLE 3. Hypothetical #1 Voting Precinct Electoral Results

Although the vote total in Table 3 is relatively close (only a two-point margin between the winner and loser), the precincts are extremely divergent from the aggregate vote total, suggesting a high degree of political segregation between precincts. In contrast, consider this a different election with the same aggregate result in Table 4 below.

TABLE 4. Hypothetical #2 Voting Precinct Electoral Results

In this case, the winner’s vote margin is the same as in the first case, but the precincts only marginally diverge from the aggregate result. The largest gap for support for Candidate A and the aggregate result is just 4 points, in Precinct 1, compared to a 37–40-point gap between each precinct and the ultimate results in the first case. The first case suggests a high level of political segregation across precincts. If these results were stable over time, and not specific to a particular candidate but consistent across candidates and issues, then we could reasonably describe that region as having a high degree of political segregation. It should be much easier for mapmakers to draw political districts for partisan advantage in cases like the first rather than the second (although a larger number of precincts or sub-precinct level voting data would be needed).

Unfortunately, the United States has widespread political segregation as well, which maps vividly illustrate.¹³¹ Democrats and Republicans are not just more divergent in their views, they are increasingly residing in different communities.¹³² Using this type of data or other geographically disaggregated vote tabulations, it is possible to calculate the degree of political segregation that exists in different regions. Using the formula for the Divergence Index (described above),¹³³ I have calculated the relative degree of political segregation for different major metropolitan regions of the United States.

Using 2020 presidential election precinct tabulation results, out of the 314 largest metropolitan areas in the United States (those with a population of 200,000 or more), Table 5 below lists the top 20 most politically segregated regions of the United States along with their divergence score.¹³⁴

TABLE 5. The 20 Most Politically Segregated Metropolitan Areas in the United States (2020 Presidential Election)

The dynamic of political segregation can also be depicted visually, providing a more intuitive and easily comprehensible approach. Compare maps of Jackson, Mississippi (the most politically segregated metro), with Carson City, Nevada (ranked 273 out of 314), which has one of the lower political divergence scores), in Figure 2, below. The contrast is vivid.

FIGURE 2. Political Segregation in Jackson, Mississippi, and Carson City, Nevada

As the maps displayed in Figure 2 illustrate, the metropolitan area of Jackson ranks first in political divergence, indicating the presence of ideological extremes where precincts overwhelmingly voted in favor of one party while neighboring precincts voted in favor of the other party. In contrast, all of the precincts in Carson City approximate the regional average, suggesting a much lower level of political segregation. The high level of political segregation in Jackson makes it especially vulnerable to partisan gerrymandering. This is partly what makes districting such a dilemma in the United States: the high degree of political geographic or residential segregation.

One of the striking features of the list presented in Table 5, above, is that the vast majority of the most politically segregated regions are in the South (17 of the top 20).¹³⁵ Given that region’s racial history, it raises the question of whether there is a relationship between political residential segregation and racial residential segregation in the United States. Figure 3, below, confirms the relationship between the two, as shown in a scatterplot.

FIGURE 3. Racial Segregation and Political Segregation in the United States, 2020 (314 Metros)

Figure 3 depicts the 314 largest metropolitan areas in the United States. The horizontal axis indicates the divergence index score (calculated as described above) for political segregation and the vertical axis indicates the relative percentile rank for racial residential segregation using the same formula but with data from the 2020 decennial census.¹³⁶ The scatterplot indicates a clear positive relationship, with a 0.50 correlation between the two variables.¹³⁷

Not only does this analysis suggest a relationship between the two phenomena, but it also means that where racial segregation is higher, partisan gerrymandering should be easier. Conversely, where partisan segregation is higher, racial gerrymandering should be easier. In short, they go hand in hand, but in a dynamic relation.¹³⁸ The problem lies in their coincidence.¹³⁹ This is the crux of the entanglement problem, not the simple fact of conjoined polarization between race and politics.

Whether the context is partisan gerrymandering or racial gerrymandering, the active ingredient is segregation, not polarization. Partisan gerrymandering only requires some degree of political segregation. Racial gerrymandering, however, requires both racial political polarization and racial residential segregation.¹⁴⁰ Prior legal scholarship treats segregation as the backdrop condition and conjoined polarization as the central or proximate problem when the truth is the opposite.

In practical terms, the persistence of racial political polarization means that the strong correlation between political segregation and racial residential segregation easily facilitates both racial and partisan gerrymandering in ways that are essentially indistinguishable. State officials can draw maps at a hyper-granular level that may relocate a small number of people from one district to another, with full awareness of their race and the fact of conjoined racial identity with partisan preference, and nonetheless claim that the decision was based on partisan motivations. In other words, racial residential segregation enables partisan gerrymandering that will result in the political segregation of people between districts on the basis of race. Even if the map-makers were to scrub all data regarding race from their software, a map drawn on partisan or other non-racial characteristics could appear objectively indistinguishable from maps drawn in cases like Shaw.

This problem is not speculative or theoretical. The Supreme Court has already heard cases touching on this problem. In oral argument in Wittman v. Personhuballah,¹⁴¹ Chief Justice Roberts asked “if race and partisanship are co-extensive, which one predominates?”¹⁴² In that case, several Republican members of Congress appealed a lower court’s decision to strike down a redistricting plan it found to be based on race. This question led to a brief dialogue among the Justices and the lawyer for the original plaintiffs regarding this issue, but the case was ultimately dismissed for lack of standing among the members of Congress to bring their appeal.

In another case decided that same term, Cooper v. Harris, the Court acknowledged that many of the Shaw considerations (compactness, for example) used to assess whether race predominated become less probative when the state raises the defense of partisanship.¹⁴³ As it explained, “political and racial reasons are capable of yielding similar oddities in a district’s boundaries. That is because, of course, ‘racial identification is highly correlated with political affiliation.’ ”¹⁴⁴ In that case, the Court rejected the claim that partisan goals are a complete defense to racial gerrymandering claims. As it explained, the predominance “inquiry is satisfied when legislators have ‘place[d] a significant number of voters within or without’ a district predominantly because of their race, regardless of their ultimate objective in taking that step.”¹⁴⁵

So, for example, if legislators use race as their predominant districting criterion with the end goal of advancing their partisan interests—perhaps thinking that a proposed district is more “sellable” as a race-based VRA compliance measure than as a political gerrymander and will accomplish much the same thing—their action still triggers strict scrutiny.

Id.

But this guidance merely sidesteps the more difficult question of how to determine whether a legislature’s districting decisions were “because of race” in such cases, and assumes that racial motives can be disentangled from partisan ones (either as a means or an end).¹⁴⁶ Indeed, the entire concept of “predominance” assumes that the factors considered by entities charged with redistricting, and which are being reviewed by courts, are separate and independent elements.¹⁴⁷ Because of the difficulties introduced by entanglement, lower courts may be hesitant to rule that race “predominates” when race and partisanship are highly entangled or when the state supplies reasons to believe that any apparent use of race was merely partisanship.¹⁴⁸

Again, this is not a speculative concern. The NAACP Legal Defense Fund and the Lawyers Committee for Civil Rights brought a suit against Georgia in 2017 alleging that a mid-decade redrawing of political districts was both a racial and political gerrymander.¹⁴⁹ The district court acknowledged that ascertaining the existence of a racial gerrymander was “particularly hard to do when the State offers a defense rooted in partisan gerrymandering, as it did here. We did not move these voters because they are black, the State tells us. We moved them because they were Democrats.”¹⁵⁰ The court ultimately sided with the state for that reason.

Experience demonstrates that this epistemological problem created by the entanglement of racial and partisan gerrymanders already exists and may be intensifying. The provision of block level census data following the 1990 census meant that state legislatures could draw more fine grain political districts based on race than was ever possible before using computer programs. Indeed, the Court confronted this fact in Bush v. Vera, in which the Court noted that the computer program “REDAPPL enabled districters to make more intricate refinements on the basis of race than on the basis of other demographic information.”¹⁵¹

This technology has only improved in the intervening decades. It is now possible to generate thousands of potential maps at a keystroke with computer processing and programs that draw from large voter or census files.¹⁵² Large data files can be cross-referenced not only to generate demographic profiles, but also psychographic information, such as predicting propensity to vote, donate money, or even respond to certain campaign communications.¹⁵³

Upon the death of a North Carolina Republican strategist involved in redistricting efforts, his daughter made public his personal computer files against the wishes of the party and company he worked for.¹⁵⁴ The trove contained thousands of documents detailing the various ways that he sought to generate political advantages for his clients, describing gerrymandering as legal vote stealing.¹⁵⁵

In Vera, however, Justice O’Connor stated that “[i]f district lines merely correlate with race because they are drawn on the basis of political affiliation, which correlates with race, there is no racial classification to justify.”¹⁵⁶ Although likely dicta, this approach nonetheless does not resolve the issue because, if there is a perfect identity or correspondence of race and partisanship, how is a court supposed to judge whether the lines were drawn “because of political affiliation” or “because of race”? If a state legislature is trying to conceal its racial intentions, it would simply develop a record of partisan purposes and other traditional districting considerations. The entanglement of race and partisanship would then allow state legislatures to subvert the Constitution.¹⁵⁷ The vastly divergent standards for both forms of gerrymandering makes it even more difficult to regulate this problem. There are, however, solutions.

IV. SOLUTIONS

This Article argues that the wildly divergent standards established by the Supreme Court governing partisan political gerrymandering and racial gerrymandering claims are untenable. Justices across the ideological spectrum agree that the use of race in drawing political districts may run afoul of the Constitution, but the Court’s extremely divergent rules regulating gerrymandering cases make it extremely difficult, if not impossible, to know whether race has been used or not. In too many contexts, partisan political gerrymanders will entail conduct that violates the principles and standards established by the foundational racial gerrymandering cases, or run so closely up to them that any attempt to fully disentangle partisan ends from racial ones is likely to be hopelessly futile or end up subverting the Court’s racial gerrymandering jurisprudence by allowing racial gerrymanders to escape under the cover of partisanship.

Beyond the categorical differences grounded in the Court’s precedent regulating the use of race at a higher level of judicial review than most other classifications, the Court has tried to manage the entanglement problem in two steps: first, by recognizing that all political districting processes are inherently race-conscious, as they are conscious of other demographic and community characteristics.¹⁵⁸ Thus, the Court has distinguished between awareness of race and actions or policy decisions that use race in the sorting of people into different districts. The Court has held that it is the latter that violates the Constitution, not the former.

Second, by requiring that racial considerations actually “predominate” other factors, the Court has drawn a line between impermissible consideration of race and other ordinary or “traditional” considerations such as compactness, contiguity, community boundaries, and so forth.¹⁵⁹ It is notable that in the listing of such ordinary considerations that partisan political advantage is never mentioned or listed. And this is presumably not simply because a bare-faced partisan consideration is unseemly, but because it was not considered by the Court (at least in those decisions) as a regular or ordinary consideration in the districting processes. Nonetheless, the Court’s jurisprudence prompts an objective, factual inquiry into whether race was in fact used or not.

Unfortunately, several demographic factors are converging in a way that makes it much more difficult—if not impossible—to delineate between race and partisanship as a consideration. First, political polarization appears to have increased in recent years.¹⁶⁰ Second, political segregation is highly visible and becoming more pronounced as well.¹⁶¹ Third, racial political polarization has increased markedly in recent decades.¹⁶² The interaction of these three factors, on top of a fourth—the persistence of racial residential segregation¹⁶³—means that partisanship and race are highly correlated in a way that makes partisan districting largely and increasingly coterminous with racial districting. In simplified terms, “conjoined polarization” and racial residential segregation interact to create the conditions that entangle race and partisanship in political redistricting processes.

That this is a practical problem is evidenced by the fact that many cases heard by the Supreme Court in recent years feature both claims, whereas that would have been anomalous even a few decades ago.¹⁶⁴ With the Court shutting the door on partisan gerrymandering claims, it seems increasingly likely that suits designed to curb the excesses of partisan gerrymandering will be brought under the color of racial gerrymandering.¹⁶⁵ Thus, the Court will eventually need to squarely confront and address this problem.

One possible solution is to carve out an exemption for racial gerrymanders that appear to be largely or entirely based on partisan motives, as Justice O’Connor intimated in Vera, but to extend the exemption to cases in which the objective use of race clearly “predominates.”¹⁶⁶ Under this approach, mapmakers would be permitted to use race in districting processes as long as their purpose was purely partisan. Under this approach, partisanship would be a complete defense to racial gerrymandering claims.

This approach would solve the smaller problem of the difficulties lower courts confront disentangling race and partisan motivates, but it would leave intact the larger problem of allowing racial gerrymanders to persist under the cover of partisanship. To that extent, this is only a partial solution or non-solution, because it would potentially obliterate the racial gerrymandering claim and undermine the constitutional prohibition against the use of race in policymaking in many ordinary cases.¹⁶⁷ Under a rule such as this, any egregiously racially segregated political district could be justified on the basis of mere partisanship. The obvious exception would be cases in which state legislatures are seeking to create “majority-minority” districts under the VRA, because those could not plausibly be defended on partisan-only grounds. This approach would clearly violate the Court’s prevailing anti-classification jurisprudence and the principles and spirit of Shaw and its progeny’s rules against permitting state legislatures to sort people into one district or another on the basis of race.

A second possible solution would be to create an exemption in the opposite direction: a supplement could be drawn to the rule that partisan gerrymanders are non-justiciable if partisan purposes overlay or are essentially indistinguishable from racial considerations. In such a case, partisan gerrymanders could be swept into the racial gerrymandering line, even though it would be difficult or implausible to assert that race “predominates.” This is essentially the tack the Court appeared to take in Cooper v. Harris in its 2016 term, prior to the Court’s more recent decision that partisan gerrymanders are non-justiciable in Rucho.

The NAACP Legal Defense Fund’s president and director-counsel, Janai Nelson, has suggested an approach along these lines.¹⁶⁸ The approach would shield against rearguard incursions into racial gerrymandering from the partisan direction. This option is most consistent with the Court’s anti-classification jurisprudence: any sorting or segregation of people into different political districts based on race violates the Constitution, even if it cannot be said that race “predominates.” This approach drives the presumptive rule against the use of racial classifications to its logical endpoint.

Although requiring a tweak to the Court’s racially gerrymandering jurisprudence, the Court can easily justify the use of a threshold test less than “predominance” when two factors are so tightly entangled that “predominance” becomes nonsensical.¹⁶⁹ In such cases it is either impossible for race to “predominate” because partisan considerations are co-extensive with race or, even if they are greater, it is because their correspondence renders the possibility of calculating predominance by disaggregating and weighing the relative influence of each factor or consideration impossible.¹⁷⁰

A third possible solution to the entanglement problem is to reverse course on partisan gerrymanders and declare that they are justiciable. Aside from the unlikely chance that the Court will revisit, let alone reverse, it’s recent decision in Rucho, even if it were to do so, there remains the challenge of defining the standard upon which they can or should be reviewed.

The most obvious and straightforward option is the predominance test—to inquire whether partisan considerations “predominate” over other ordinary districting considerations. In cases where race and partisanship are entangled, this could help solve the larger problem of allowing racial gerrymanders to escape under the cover of partisanship, but it does not actually solve the epistemological problem of how courts may distinguish between entangled factors or inputs.¹⁷¹ Thus, it would have the inverse effect of the first possibility, which is to help address the larger problem, but leave the smaller one intact.

Moreover, this possibility remains only a partial solution to the larger problem. Even if the Supreme Court were to allow courts to review partisan gerrymandering claims, it is unlikely that such partisan gerrymanders would be reviewed at the exactingly high level of scrutiny as racial classifications are. Thus, there is still some risk present that racial gerrymanders will escape regulation in the guise of partisanship due to the gap in the standards of review.¹⁷²

This third option, however, although requiring a reversal of recent precedent, is at least more logically consistent with the idea that these are categorically distinct claims arising from different case law and constitutional concerns. It renders the entanglement problem less urgently in need of resolution since the more extreme partisan gerrymandering cases would be regulated through a parallel structure under (presumably) rational basis review.

This approach has several other meritorious considerations in its favor, especially its potential grounding in various aspects of constitutional jurisprudence. Some versions of this approach, for example, naturally conform to the paradigm famously known as “Carolene Products footnote four.”¹⁷³ This famous footnote in Constitutional Law maps neatly to the entanglement problem at issue in this Article. That is because at the heart of this footnote are issues of political process and racial equality and their interrelation, the same issues here. As the Court said in that famous footnote:

It is unnecessary to consider now whether legislation which restricts those political processes which can ordinarily be expected to bring about repeal of undesirable legislation, is to be subjected to more exacting judicial scrutiny under the general prohibitions of the Fourteenth Amendment than are most other types of legislation. . . . Nor need we enquire whether similar considerations enter into the review of statutes directed at particular religious, or national, or racial minorities; whether prejudice against discrete and insular minorities may be a special condition, which tends seriously to curtail the operation of those political processes ordinarily to be relied upon to protect minorities, and which may call for a correspondingly more searching judicial inquiry.¹⁷⁴

Carolene Products footnote four has been called a “paradigm” within equal protection jurisprudence on account of the fact that it provides a coherent and comprehensive roadmap for judicial review.¹⁷⁵ It is not just that the footnote addresses one possible way of dealing with laws that affect access to the political process, like political redistricting does, but it also specifically deals with the intersection of race and political access: suggesting that laws which impede racial minorities access to the political process should be reviewed more closely.

The Equal Protection Clause arguably provides a sufficient basis by itself for establishing a rule against partisan gerrymanders: they violate fundamental democratic principles such as those that motivated the Court to rule, in the early 1960s, that political districts should be of equal size. They violate majority rule—what Madison called the “fundamental principle of free government” in Federalist No. 58 and Hamilton called the “the fundamental maxim of republican government” in Federalist No. 22.¹⁷⁶ Whether permitting a minority to entrench itself through the manipulation of district boundaries or by manipulating the number of voters in each district, the result can be the same and does violence to the principle of “one person, one vote” either way.

A claim rooted in equal protection could narrowly assert that permitting extreme partisan gerrymanders would violate a person’s right to be treated equally by law or more broadly assert that it would also hinder access to both the political process and the ability to use that process to remedy unfair or unjust legislation. But even if the Equal Protection Clause itself, or some broader more synthetic reading of it, Carolene Products, or even related associational claims grounded in the First Amendment are part of the foundation for rendering partisan gerrymandering claims justiciable,¹⁷⁷ there is another constitutional provision which has lain dormant but could be potentially enlisted to this cause: the Guarantee Clause.

The Guarantee Clause requires the United States guarantee to the states a republican form of government.¹⁷⁸ Although rarely invoked, the prevailing consensus is that this Clause requires majority rule and that representatives serving in state governments be selected by elections.¹⁷⁹ In other words, it is a guarantee to the citizens of those states (and of the nation) that each state government must be republican in form. This clause might be the basis for challenges to features of various state governments that are anti- or un-democratic. In addition to a formal recognition of the problem of entanglement in the gerrymandering cases, a revival of the Guarantee Clause could provide an easily understandable basis for reversing or excepting Rucho.

The problem here is that Supreme Court precedent does not allowed federal courts to entertain claims brought under the Guarantee Clause, thus far. In 1849, and again in 1946, the Supreme Court ruled that claims under this clause are non-justiciable.¹⁸⁰ Prominent and notable jurists, however, would have held otherwise. In his courageous and lonely dissent in Plessy v. Ferguson, the first Justice Harlan would not only have held that the segregative railway statute adopted by the state of Louisiana and reviewed in that case violated the Thirteenth and Fourteenth Amendments to the Constitution, but also the Guarantee Clause. As he explained:

Such a system is inconsistent with the guaranty given by the constitution to each state of a republican form of government, and may be stricken down by congressional action, or by the courts in the discharge of their solemn duty to maintain the supreme law of the land, anything in the constitution or laws of any state to the contrary notwithstanding.¹⁸¹

The aforementioned Supreme Court decisions, moreover, occurred prior to the Court’s ruling that districting cases are justiciable in Baker v. Carr. And in any event, these decisions cannot be based on the meaning of that clause as understood by its framers.¹⁸²

Federalist No. 9 explains that the “principles” of republican governance are “now well understood,” and in addition to the fundamental majoritarian principle, they include: (1) “[t]he regular distribution of power into distinct departments”; (2) “the introduction of legislative balances and checks”; (3) “the institution of courts composed of judges holding their offices during good behavior”; and (4) “the representation of the people in the legislature by deputies of their own election.”¹⁸³

Moreover, Federalist No. 39 dealt specifically with the meaning of republican government. As Madison explained there, “we may define a republic to be . . . a government which derives all of its powers directly or indirectly from the great body of the people; and is administered by persons holding their offices during pleasure for a limited period, or during good behaviour.”¹⁸⁴ In other words, it is a government, in the words of Lincoln, “of, by, and for the people.”¹⁸⁵ Madison goes onto explain that

[i]t is essential to such a government, that it be derived from the great body of the society, not from an inconsiderable proportion, or a favored class of it; otherwise a handful of tyrannical nobles, exercising their oppressions by a delegation of their powers, might aspire to the rank of republicans, and claim for their government the honorable title of republic.¹⁸⁶

Critics might note that, despite Madison and Hamilton’s declarations on the centrality of the principle of “majority rules,” they and their colleagues seemed unconcerned with the use of districting for partisan advantage in terms of either unequal size of districts or manipulation of lines, or else they would have argued against it (as they did against equal suffrage for states) or proposed or included provisions in the Constitution or in their respective roles in the federal government against it. In Part II, a number of possibilities were presented as to why Madison and Hamilton may not have introduced measures relating to this potential problem. A few more may now be considered.

First, any interpretative methodology assessing the Guarantee Clause would have to account for the fact that many categorical exclusions for voting existed at the time of the framing of the Constitution that are now prohibited, including those on the basis of sex, race, and class (through the Fifteenth, Nineteenth and Twenty-Fourth Amendments, respectively).¹⁸⁷ Thus, there are textual reasons to “update” any originalist understanding of the Guarantee Clause based upon the text of the Constitution itself, as amended. After all, it has already been observed that the modern Supreme Court has repeatedly struck down districting plans with population disparities in percentage terms far less than those observed by Madison and Hamilton in The Federalist Papers.

Second, the framers failed to anticipate the extent to which partisanship would manifest in the federal councils and the harmful effects thereof. There is no reason to believe that they should devise measures to address problems they lacked the foresight to see. And, by their own accounts, the precautions and safeguards that the framers believed would curtail the harmful effects of partisanship were based on assumptions and premises that proved fallacious or were quickly refuted by experience as political parties organized themselves in the federal councils.

To be clear, the framers were well-aware of the problem of partisanship. Having observed it within the states and other republics, Alexander Hamilton referred to this problem as “the diseases of faction” and the “demon of faction,” and James Madison called it “mischiefs of faction” and the “rage of party.”¹⁸⁸ What they underestimated was the degree to which political parties would become the primary organizing forces to frame political discourse and focus policy debate in the federal government.

As Madison and Hamilton explained throughout The Federalist Papers, the framers believed that the size and diversity of peoples and interests represented in the national government would ameliorate the effects of faction as observed in the state governments.¹⁸⁹ As Madison concluded in Federalist No. 10, the “variety of sects dispersed over the entire face of [the confederacy of states] must secure the national councils against any danger from that source.”¹⁹⁰ Hamilton arrived at similar conclusions in Federalist Nos. 60 and 61, where he wrote that “a diversity of local circumstances, prejudices, and interests” would make it unlikely that a “predominant faction” would prefer a particular class of electors over another.¹⁹¹ Not only that, Hamilton felt that the diverse manner in which the various federal branches would be populated would safeguard against this problem, such that he concluded there is “little probability of a common interest to cement these different branches in a predilection for any particular class of electors.”¹⁹²

If at any moment the quantity of highly competitive national political parties had been greater and more numerous or if political parties had not acquired so much significance in organizing political interests at a national level, then this conclusion might have proven correct. But by the end of the first decade of the government’s operation under the Constitution, the problems of partisanship were already manifest to such an extent that it was one of the principal subjects of concern in George Washington’s 1796 Farewell Address. Far from leaving office assured with his good works and sanguine on the young nation’s prospects, he sounded an alarm.

In this regard, any originalist argument would be incomplete without considering George Washington’s remarks in his Farewell Address, which were drafted with input from Hamilton and Madison, his top advisors. The degree to which they underestimated the corrosive effects of toxic partisanship—what they called the “baneful effects of the spirit of party”—is clear from the substance of the address.¹⁹³

Having observed the emergent dynamics of partisanship firsthand as the nation’s first chief executive,¹⁹⁴ Washington expressed a deep-seated fear that political parties presented a danger to the stability of the young republic, and potentially an existential threat. The Senate Historical Office characterizes his remarks concerning the “dangers of parties in state” as reflecting the view that political parties “carried the seeds of the nation’s destruction through petty factionalism.”¹⁹⁵ The remarks specify, in serial form, the harmful effects that flow from extreme partisanship. Among them:

• “It serves always to distract the public councils and enfeeble the public administration.”

• “It agitates the community with ill founded jealousies and false alarms,”

• “kindles the animosity of one part against another,”

• “foments occasionally riot and insurrection.”

• “It opens the door to foreign influence and corruption, which find a facilitated access to the government itself through the channels of party passions. Thus the policy and the will of one country are subjected to the policy and will of another.”¹⁹⁶

If anything, the experience of the last few years amply illustrates these dangers, especially in the administration of President Donald Trump, which experienced arguably each of these effects, most obviously in the Ukraine scandal that led to the first impeachment,¹⁹⁷ the interactions with Russian officials that led to the Mueller investigation,¹⁹⁸ and the riot and insurrection at the Capitol on January 6, 2021.¹⁹⁹

Extreme partisanship has fostered a visceral antipathy against the other party (what political scientists call “negative partisanship”) often for no other reason than the “animosity of one part against another,” such that even bipartisanship on broadly popular legislation (such as when Republicans in Congress voted for the 2021 infrastructure bill) is viewed within the faction as a violation of partisan solidarity.²⁰⁰

And, in the most extreme case, above all the previously listed concerns, Washington asserted that partisanship could lead to the “destruction of public liberty” in this way:

The alternate domination of one faction over another, sharpened by the spirit of revenge natural to party dissension, which in different ages and countries has perpetrated the most horrid enormities, is itself a frightful despotism. But this leads at length to a more formal and permanent despotism. The disorders and miseries which result gradually incline the minds of men to seek security and repose in the absolute power of an individual; and sooner or later the chief of some prevailing faction, more able or more fortunate than his competitors, turns this disposition to the purposes of his own elevation on the ruins of public liberty.²⁰¹

Some political prognosticators and military leaders have warned that some version of this frightful vision might be realized in the aftermath of the 2024 election.²⁰²

Critically, however, Washington also characterized the dangers of extreme partisanship in geographic terms, which he called “the danger of parties in the state, with particular reference to the founding of them on geographical discriminations.”²⁰³ Although he may well have been referring to the sectional divide between the North and the South, the specific formulation is a perfect fit for the excesses of gerrymandering. It accurately characterizes the manifestation of partisanship through gerrymandering, which is a process of making geographic discriminations.

Whether a consequence of the framers’ underestimation of the ultimate role of political parties, the extent to which partisanship would infect the federal government, or some other reason, the Constitution itself is silent in regards to the problems posed by extreme forms of political districting. But that does not mean that the framers left the political community helpless, even in the absence of an explicit constitutional provision adopted to solve this problem. Provisions like the Guarantee Clause are open-textured specifically to empower that political community to address problems such as this, empowering both Congress and the courts to regulate such practices as partisan districting as circumstances may necessitate.

CONCLUSION

Unlike the canonical and venerated document that it is regarded as today,²⁰⁴ the United States Constitution was recognized by its framers as a political experiment of uncertain prospects.²⁰⁵ To give the political community governed by it the flexibility to make it work in practice and tailor it to exigencies without violating its text or spirit, the framers provided an amendment process to correct for unforeseen flaws or circumstances and used terse, open-textured language amenable to varying shades of interpretations in enumerating certain powers, rights, or prohibitions.²⁰⁶

Nonetheless, the Constitution was chiefly designed to overcome the deficiencies of the Articles of Confederation and other problems that had plagued the young republic.²⁰⁷ Consequently, it is silent on many issues that subsequently bedeviled the nation governed by it in intervening centuries. Two notable examples that frustrated subsequent generations in the first half of the nineteenth century were the definition of national citizenship and its relationship to state citizenship, and in the latter half of the twentieth century and early twenty-first century, the issue of abortion.

Because the original Constitution did not explicitly indicate how federal citizenship was defined or acquired, the Supreme Court ultimately decided the question of whether persons of African descent were or could become United States citizens, which it did in the most notorious case of Dred Scott v. Sandford.²⁰⁸ The Court’s decision was reversed by opening line of the Fourteenth Amendment.²⁰⁹ Similarly, the issue of abortion has proven to be highly divisive and one of the most deeply contested legal matters of the last fifty years. Advocates and jurists often look beyond the explicit text to the structure of the Constitution and the history and traditions of the republic at certain points in time to try to resolve these matters.²¹⁰

Unfortunately, extreme manipulation of political districting processes is another issue upon which the Constitution remains explicitly silent. This does not mean federal officials are powerless to address it. Constitutional text provides indirect solutions, such as the affirmative powers afforded Congress under the Elections Clause to “make or alter” the laws for elections to the federal legislature and implicit protections made by inferences drawn from other provisions, such as the Equal Protection Clause. Nonetheless, as a result of the lack of constitutional specificity regarding this issue, certain problems generated by this underlying phenomenon are treated differently, depending on the circumstances, the class of persons most affected, or the form of the districting problem.

This Article focuses on the problem of the entanglement of race and partisanship in the judicial review of gerrymandering claims. It conducts a brief history of gerrymandering, examines the divergent lines of cases, reveals the factors that contribute to the growing problem of gerrymandering, including the relationship between political segregation and racial residential segregation, and closes with a survey of possible solutions grounded in the constitutional text and structure.

Although Congress could potentially pass laws curbing gerrymandering in the states, and would have the authority to do so under Article I, Section 4, the core of the problem of gerrymandering is that it violates what Madison called the “fundamental principle of free government”—that of majority rule, and therefore should be within the cognizance of the Constitution, not ordinary legislation, to resolve. This is true even though the Constitution is silent on it, an omission that is adequately compensated for by the applicability of indirect provisions that the framers included such as the Guarantee Clause and subsequent Amendments, most notably the Fourteenth, requiring equal protection of the law.

Blame for underestimating the rise of political parties and the harmful effects of extreme partisanship cannot be entirely placed on insufficient foresight of the framers. A number of developments have contributed to the intensity of political partisanship in the federal government, including the enlargement of the sphere of national politics, the evolution of the information and media environment, and technological developments.²¹¹

There is no way that the framers could have fully anticipated the extremities toward which political districting processes designed for partisan purposes might distort many of the principles of representative government that they sought to institutionalize. In particular, they could not have anticipated the development of modern technological tools such as computer databases and programs such as Geographic Information System (“GIS”) technology that would easily permit state legislators to essentially select their voters rather than the other way around. But the framers’ insufficient foresight does not leave us helpless.

Whether the remedy lies in an act of Congress or the courts applying a synthetic reading of the Constitution as a whole, Carolene Products footnote four, the Guarantee Clause, or a novel reading of the Equal Protection Clause, partisan gerrymandering is a problem for our health of democracy that requires resolution. It is a problem in its own right because it undermines the values and foundation of the republic, and because it causes and results in the racial segregation of voters in clear violation of the Constitution without necessarily running afoul of the standards established by the Supreme Court to secure those protections.

96 S. Cal. L. Rev. 301

Download

Stephen Menendian*

* Stephen Menendian is the Assistant Director at the Othering and Belonging Institute at University of California, Berkeley. The author would like to thank john powell and Dan Tokaji for their insights on this critical issue, Chris Elmendorf and Joshua Clark for their invaluable expert feedback on drafts of this Article, Samir Gambhir and Peter Mattingly for their contributions to the underlying research regarding segregation and assistance in developing the maps and scatterplots, Wenqi (Michael) Xu, Sara Osman, and Yemaj Sheik for their general research and citation assistance.

Fractionalization to Securitization: How the SEC May Regulate the Emerging Assets of NFTs

April 2023April 2023Lauren Au*

Blockchain technology opened the world to a variety of new technological advances that reshaped the way humans interact and transact with one another. One of the most recent and trending applications of blockchain technology is non-fungible tokens or “NFTs.” NFTs are unique digital tokens encoded on a blockchain that represent ownership of specific digital assets such as artwork, collectibles, videos, domain names, and so forth.¹ NFTs can be thought of as certificates of authenticity. Although NFTs resemble cryptocurrencies, NFTs are non-fungible. This means that no two tokens are identical, and they are not interchangeable with one another. They are valuable because each comes with a unique digital signature or ledger that allows it to be easily authenticated, verified, and transferred. This has completely revolutionized the way people trade different assets, and many NFTs are sold online for millions of dollars.² Additionally, NFTs can come in different forms, ranging from whole NFTs of digital artwork or real property to fractionalized NFTs (“f-NFTs”) that break up ownership of an NFT into multiple “shards” so a larger number of people can own a piece of a single digital asset.³

NFTs are a new and influential technology that can have far-reaching implications for current securities law, intellectual property law, and other legal areas. In securities law, NFTs have established a new way for people to invest and gain returns from digital assets. This has disrupted the legal and financial sectors and created new risks for investors such as fraud and hacking.⁴ With the recent rise of NFTs as potential investment assets comes the possibility of government regulation to protect investors. The growing use of NFTs alerted many regulators, such as the Securities and Exchange Commission (“SEC”), to the possibility of regulating these digital assets as some type of security.⁵ However, regulatory and securities laws struggle to keep pace with emerging innovations and financial technologies like NFTs. Much of the SEC’s limited guidance focuses on cryptocurrencies and blockchain technology generally, with little guidance specifically on NFTs as a security. Leaders in the industry have requested no-action letters, although the SEC remains silent.⁶ Leaders believe “it would be a lot easier to operate in an environment where sensible ground rules are laid out that allow for innovation.”⁷ NFT creators, buyers, and exchange platforms must rely on general SEC regulations of other digital assets to guide their decision-making and avoid regulation. Given these issues, it is important for the SEC to provide guidance on NFTs to protect and inform potential investors, while also ensuring issuers can properly develop NFTs and NFT platforms without the fear of strict regulation.

This lack of guidance stems from the fact that many regulators are divided on whether NFTs can be classified as an “investment contract” security or regulated by the SEC. The Securities Act of 1933 defines a “security” as “any note, stock, . . . bond, debenture, . . . [or] investment contract.”⁸ In 1946, the U.S. Supreme Court developed a four-pronged test in SEC v. Howey to clarify whether an asset is an “investment contract” security.⁹ The Howey test holds that a contract, transaction, or scheme is an “investment contract” when an individual (1) makes an investment of money (2) in a common enterprise (3) with a reasonable expectation of profit (4) derived from the efforts of others.¹⁰ While some argue that NFTs are not an “investment contract” security under the Howey test because they do not satisfy either the second or fourth prong, others believe that fractionalized NFTs could pass all four prongs.¹¹ There has been little in-depth legal research and analysis that focuses specifically on f-NFTs as securities and the potential regulatory framework that could control this digital asset.¹² The legal field must catch up with rapid technology developments and take a revised look at current regulations to see how they can be applied to NFTs. To analyze if the SEC can regulate NFTs, two main questions need to be addressed: (1) whether certain NFTs can be classified as a security under federal law; and (2) if NFTs are securities, how SEC requirements can be applied to best protect the public’s interests.

To answer these questions, this Note will apply the Howey test to f-NFTs and identify the risks and opportunities of regulating them as securities to better understand how to protect investors while also allowing for the innovation of digital assets. This Note will first conclude that NFTs can be “investment contract” securities and satisfy the four Howey prongs if they are fractionalized. First, when purchasing f-NFTs, buyers make an investment using digital currency that is considered “money.” Second, having an NFT tied to the success of a company or celebrity, or having multiple fractional interests in an NFT that are shared by a pool of investors, are investments in the “common enterprise” of that individual company, celebrity, or whole NFT. Third, f-NFTs have a “reasonable expectation of profit” given that they are easily traded on secondary markets and promoted as a unique way to “unlock liquidity.” Lastly, an f-NFT’s financial return can be derived from the efforts of platforms or issuers to maintain or improve the f-NFT market and support the popularity or price of the digital asset. This Note will then explain that even if f-NFTs are deemed securities, the SEC will need to adopt a clearer regulatory framework for f-NFT issuers, buyers, and platforms by modernizing established regimes of other digital assets. The SEC may have trouble regulating issuers or buyers of f-NFTs because the decentralized networks of f-NFTs already provide a form of digital “registration” that gives sufficient information to investors and prevents fraud through the easily verifiable digital ledgers of an f-NFT’s transactions. However, a platform that creates and trades f-NFTs may be a security “exchange” under federal law, and thus the SEC may be able to place some modified regulations on these f-NFT platforms, such as notice and disclosure requirements or compliance with capacity, integrity, and security standards, which ensure f-NFT and investor protection.

Part I provides a general overview of NFTs by explaining the blockchain technology that powers them. This Part illustrates what NFTs are, how they work, the concept of fractionalizing NFTs, and the principal applications and potential importance of NFTs in the financial markets. Part II lays out the underlying securities law—in particular SEC v. Howey—and the SEC’s current regulatory framework for other blockchain-based financial assets such as cryptocurrencies and digital tokens. Part III applies the Howey test to f-NFTs to show that they can be classified as securities and bolsters this argument by comparing f-NFTs to a digital asset (DAO Tokens) that the SEC has previously determined to be an “investment contract.” Part IV analyzes the pros and cons of regulating certain NFT issuers, buyers, or exchange platforms and provides recommendations for an NFT regulatory framework using comparisons to other developed digital asset platforms. Part V provides a preliminary exploration of existing regulatory models like those that govern traditional stocks and Real Estate Investment Trusts (“REITs”) and how they can be applied to f-NFTs.

I. NFT BACKGROUND: A TECHNICAL OVERVIEW OF NFTS

A. TECHNICAL ASPECTS OF NFTS

An NFT is a cryptographic unit of data or digital signature stored in a “blockchain” that represents the ownership of a unique digital asset or real-life object.¹³ Since they use blockchain technology, NFTs are typically bought and sold online with cryptocurrency.¹⁴ NFTs are similar to cryptocurrencies such as Bitcoin and Ethereum because they all use blockchain technology to create a digital object (currency or token) using units of data on a digital ledger. The only difference is that digital currencies are meant to be fungible, in that one Bitcoin is the same as and interchangeable with another Bitcoin, while an NFT is meant to be non-fungible, in that each one is one-of-a-kind and not exchangeable with another NFT.¹⁵ The underlying data of an NFT is unique because there can only be one owner, and that person is the only one who can access or transfer that NFT. This non-fungibility and use of blockchain allow NFTs to have a built-in proof of ownership that is easily authenticated, create exclusivity, and allow for verified transfers.

B. BLOCKCHAIN TECHNOLOGY

NFTs rely on blockchain technology, which creates a secure, decentralized network for transactions of various digital assets. The blockchain is essentially a “chain” of “blocks,” each containing specific information regarding a digital asset and its transactions that is then stored on a digital, secure, peer-to-peer ledger.¹⁶ An NFT is a digital database that stores data in the form of a “smart contract” and a unique identification hash bundled together in “blocks” that are all “chained” together in a distributed network. A smart contract is defined as “a computerized transaction protocol that executes terms of a contract” and is meant to minimize fraud and transaction costs.¹⁷ In other words, smart contracts are programs stored on a blockchain that automatically execute certain terms of a contract once certain predetermined conditions are met.¹⁸ Each blockchain “block” contains three components: (1) data, (2) the hash of the block, and (3) the hash from the previous block.¹⁹ The hash is a digitally generated string of digits and letters used to identify each block in a blockchain structure and acts as a type of unique fingerprint.²⁰ The data for an NFT “block” includes a “smart contract” that points to where an NFT is located on the internet and how to retrieve it, dictates the terms of a transaction, provides a verification of ownership, and holds a ledger of the token’s ownership history and transaction record.²¹

FIGURE 1: NFT Blockchain Sequence Diagram

An issuer creates an NFT by deploying a code to develop a specific type of “smart contract” that contains a blockchain address, typically on the Ethereum Blockchain, where the smart contract resides.²² Later, when someone buys or sells an NFT, the blockchain automatically creates a new “block” and a new hash for this block to add this new transaction to the “chain.”²³ The blockchain is essentially recording a “change of state” to the NFT in which the “smart contract” updates its internal ledger and changes the structure of the NFT’s underlying blockchain to reflect the transfer of the NFT to and from different addresses.²⁴ In short, whenever an NFT is sold, this new ownership is noted as a “new block” in the blockchain ledger, and the digital hash of that NFT is changed.

When purchasing an NFT, you are only buying exclusive access to the unit of data that contains the NFT’s location and are relying on the issuer’s obligation to ensure authenticity.²⁵ You do not gain any property rights of the actual digital asset, such as intellectual property rights (right to copy, right to destroy, and so forth.). Similar to buying a painting, when buying an NFT, you are only buying display rights or the right to say that you own it, but nothing else. You are mostly buying a digital certificate of ownership and authenticity or unique access to a digital object, not the actual digital object itself.²⁶ In other words, you own the one-of-a-kind map of where the NFT is located and are the only one who has access to it. The underlying NFT is typically hosted or located on a regular Hypertext Transfer Protocol (“HTTP”) Uniform Resource Locator (“URL”) web address on the internet or on an InterPlanetary File System (“IPFS”) hash, which is a “system designed for hosting, storing, and accessing data in a decentralized manner.”²⁷ Using a regular HTTP web address is typically very risky given that a server owner could easily change the underlying content of that particular address and completely erase the actual NFT content that was originally purchased.²⁸ However, when housing an NFT on IPFS, the NFT gets assigned a unique content identifier (“CID”) hash that links to the data in the IPFS network.²⁹ Using an IPFS CID hash, as opposed to an HTTP URL, allows someone to find the NFT based on its content rather than by its location on a server. Thus, if the content of the NFT is changed, the original CID link would break and create a new one.³⁰

Even though NFTs only give a type of “bragging rights,” they provide various advantages that have changed the tech and financial markets. The benefits of an NFT are that it is easy to authenticate its originality, establish its exclusivity, and transfer the asset.³¹ The permanent digital ledger inherent in an NFT acts as a record of ownership and allows for easy traceability across the blockchain network so that the original creator or past owners can be easily traced through their past transactions.³² This has made NFTs a highly valuable avenue to establish verified ownership over assets such as digital artwork, digital trading cards, video highlight reels, social media posts, collectibles, and even real property.³³ An NFT also has the unique capability to internally incorporate royalty agreements into its “smart contract,” where it automatically carries out an agreed-upon payment system whenever the NFT is licensed, resold, or used for some particular purpose.³⁴ This has provided content creators with new ways to continuously and easily monetize their work through NFTs. Lastly, NFTs have created a new way for people to invest their money in digital assets. With billions of dollars recently being poured into the NFT market, many investors have flocked to these digital assets as a potential high-risk investment strategy.³⁵ However, NFTs have become the target of some security breaches and hacking due to their novelty and outdated or inefficient security protocols.³⁶ Additionally, the value of NFTs and their potential returns can be volatile and speculative because they are only worth as much as other people are willing to pay for them.³⁷ An NFT’s appreciating value seems to be derived either from its creator or its scarcity.³⁸ Thus, depending on these two factors, investors could either win the jackpot to resell their NFT for a large gain or end up with a worthless digital asset and a large loss.

C. FRACTIONALIZATION OF NFTS

One major innovation that has disrupted the way people view and use NFTs as investments is the concept of fractionalizing NFTs. Fractional NFTs, or “f-NFTs,” break an NFT into pieces, or “shards,” which can be subsequently traded and sold in the market at a lower price than the NFT as a whole.³⁹ F-NFTs represent a fraction of the larger digital asset in which an investor can now share a partial interest in an NFT with other investors.⁴⁰ Given that NFTs are routinely sold individually for thousands or millions of dollars, f-NFTs democratize these investments such that average investors can now purchase a smaller portion of a high-priced NFT.⁴¹ F-NFTs opened up access to NFT markets and allowed more people to invest in these new digital assets.

There are currently multiple platforms that facilitate the creation and trading of f-NFTs, such as Niftex, Fractional.art, and DAOfi. These f-NFT platforms allow owners to break NFTs into multiple shards and sell them at an initial fixed price.⁴² The shards can subsequently be traded in an open market on the platform. On Niftex, an f-NFT is created through a four-step process: (1) “Owners of NFTs create fractions (‘shards’) by choosing issuance and pricing”; (2) these fractions are then put on sale on the platform at a fixed price for two weeks or until they sell out; (3) once the fixed sale period ends, the fractions can be traded on a secondary market; and then (4) a whole NFT can be fully retrieved by purchasing all of the shards through the platform’s special “Buyout Clause.”⁴³ This Buyout Clause is embedded within an f-NFT’s smart contract and gives f-NFT investors who own a particular percentage of an NFT’s shards the opportunity to purchase the remaining shards to now own the whole NFT.⁴⁴ F-NFT platforms have also incorporated the ability to automatically give issuers a portion of the f-NFT created or to give some type of “curator fee.”⁴⁵

NFT issuers and platforms have become very creative in the ways in which they utilize and develop this digital asset. One theory is that platforms could put numerous NFTs into one basket and sell f-NFTs of that basket as an investment product or security (“f-NFT bundles”).⁴⁶ While some people do not think a traditional NFT could be a security, an f-NFT may be deemed a security under U.S. securities law. SEC Commissioner Hester Peirce warned issuers of f-NFTs that “the whole concept of an NFT is supposed to be non-fungible [meaning that] in general, it’s less likely to be a security,” but if issuers sell fractional interests in NFTs or NFT bundles, “you better be careful that you’re not creating something that’s an investment product—that is a security.”⁴⁷ Peirce argued that “the definition of a security can be pretty broad,” and thus f-NFTs could fall within the SEC’s definition of a security and be subject to some form of regulation.⁴⁸ With the high costs of a single NFT, the growing availability of blockchain platforms in the mainstream, and the large development of decentralized finance and decentralized applications, “the continued fractionalization of NFTs is almost inevitable.”⁴⁹

II. LEGAL BACKGROUND: DEFINING “SECURITIES”

The main statutes governing securities regulation are the Securities Act of 1933 (“Securities Act”) and the Securities Exchange Act of 1934 (“Exchange Act”). While the Securities Act mostly deals with the issuance of securities, the Exchange Act governs exchanges, brokers, and trading on secondary markets.⁵⁰ Together, these statutes establish a registration and disclosure regime that requires any offer or sale of securities to register with the SEC and any issuers of securities to provide accurate and complete disclosures of material information regarding their securities offering or company. These requirements provide key information to investors so that they can make the most informed decisions. The consequences of being subject to these registration and disclosure requirements include filing documents with the SEC any time you sell securities, such as a Form S-1 registration statement, and filing continuous, periodic reports regarding the company’s business operations and financials, such as Form 10-K, Form 10-Q, or Form 8-K.⁵¹ These statutes, along with regulatory rules, provide definitions and tests to help determine whether an asset is a “security” or an organization is an “exchange” that is subject to federal regulation.

A. SECURITIES ACT OF 1933

The Securities Act makes it illegal for an issuer to offer or sell any unregistered security within interstate commerce unless the security is exempt from registration.⁵² This statute defines an “issuer” as “every person who issues or proposed to issue any security,” where “person” includes “an individual, a corporation, . . . [or] any unincorporated organization.”⁵³ It also provides a broad definition of different types of assets that could be considered securities under U.S. federal securities law.⁵⁴ This definition specifically includes “investment contracts,” which can be seen as a catch-all term for any type of asset that behaves and feels like a security. Thus, it is sometimes difficult to determine if something falls within the definition of a security.

B. SEC V. HOWEY

In SEC v. Howey, the U.S. Supreme Court created the Howey test to help clarify what an “investment contract” security is under the Securities Act. The defendant, W.J. Howey Company, sold real estate contracts for orange groves in Florida for a fixed price per acre.⁵⁵ Howey then encouraged purchasers to set up service contracts in which they would lease the land back to the company to farm the orange groves, and in exchange the buyers would receive a share of the profits.⁵⁶ The Supreme Court held that these orange grove service contracts were “securities,” because purchasers were buying shares in Howey’s profits from the orange groves through these service contracts, not the actual orange groves themselves.⁵⁷ The Court developed a four-pronged test in which “an investment contract for purposes of the Securities Act means a contract, transaction or scheme whereby a person invests his money in a common enterprise and is led to expect profits solely from the efforts of the promoter or a third party.”⁵⁸ There have since been a variety of cases that helped develop and clarify each of the four Howey prongs. The first prong of “an investment in money” does not need to be in the form of cash and can be satisfied using a different form of contribution or investment, such as cryptocurrency.⁵⁹

The second prong of “in a common enterprise” requires that the fortunes of the investor be linked to the success of the overall venture or enterprise. “Fortunes” refers to the “profits” (and benefits) or “losses” (and costs) that occur from a certain asset and that affect a person’s position.⁶⁰ There needs to be a kind of commonality or relationship, either among investors or between the “promoter” and investors, in which the investor depends on the actions and decisions of the promoter of the asset.⁶¹ A promoter is defined as any individual or organization that helps found and organize the business or enterprise of an issuer of any security or that receives ten percent or more of any class of the issuers securities or proceeds from the sale of such securities as consideration for their services or property.⁶² Federal courts have typically required that there be either “horizontal commonality” or “vertical commonality” for an asset to satisfy the “common enterprise” prong.⁶³ Horizontal commonality is defined as the relationship between investors and a pool of other investors. There is commonality when an individual investor’s fortunes are tied to the fortunes of other investors in a common venture by the pooling of assets, usually combined with the

pro-rata distribution of profits.⁶⁴ Vertical commonality is defined as the relationship between the promoter and the body of investors.⁶⁵ Commonality exists when there is a connection between the fortunes (strict vertical commonality) or efforts (broad vertical commonality) of the promoter and the fortunes or efforts of the investors. This type of commonality does not require a pooling of funds.⁶⁶

The third prong of “a reasonable expectation of profits” requires investors to realize some form of appreciation on the development of the asset or participate in the earnings resulting from the use of investors’ funds.⁶⁷ The SEC defines “profits” as “capital appreciation resulting from the development of the initial investment or business enterprise or a participation in earnings resulting from the use of purchasers’ funds.”⁶⁸ Courts also include “dividends, other periodic payments, or the increased value of the investment” in the definition of profits.⁶⁹ However, the SEC notes that “price appreciation resulting solely from external market forces (such as general inflationary trends or the economy) impacting the supply and demand for an underlying asset generally is not considered ‘profit’ under the Howey test.”⁷⁰ This prong is very fact-sensitive, and the SEC looks at several factors, like the trading of the asset on secondary markets, identity of the buyers, and marketing efforts, to determine whether an asset satisfies this prong.⁷¹

Finally, the fourth prong of “from the efforts of others” is satisfied when the promoter or issuer of an investment creates or supports the market for these assets or the value of the asset is dependent on the promoter’s efforts in generating demand.⁷² In Howey, the Supreme Court understood that the Securities Act’s definition of a “security” is broad, so it argued that “[f]orm was disregarded for substance and emphasis was placed upon economic reality.”⁷³ Thus, when determining whether something can be considered a security, one needs to focus on the specific circumstances, facts, and economic impact of the particular asset.⁷⁴ For an asset such as digital currencies or tokens, this test may consider factors such as the token’s design, issuance, and how it interacts with its platform or blockchain. Depending on how an NFT is created, structured, marketed, and sold or distributed, such NFTs could be deemed securities. This would mean that any sale of this NFT would be subject to the existing securities law framework.

C. CURRENT CASE LAW

Although there are few settled cases regarding whether certain digital assets are securities, there are a couple of key cases making their way through the court system. One of the leading cases being decided is SEC v. Ripple Labs, Inc., in which the SEC filed an enforcement action against Ripple Labs for selling crypto tokens that the SEC believed were unregistered securities.⁷⁵ The SEC argues that Ripple Labs failed to register its offer and sale of about $600 million of its digital asset called XRP to retail investors, which was used to finance the business. The SEC stated that XRPs were investment contract securities because purchasers of XRP invested into a common enterprise, given that XRP’s demand is tied to Ripple’s success or failure in propelling its trading, and Ripple publicly promised investors that it would “undertake significant entrepreneurial and managerial efforts to create a liquid market for XRP” that would in turn increase its uses, demand, and price, and led reasonable investors to expect profits from XRPs.⁷⁶ Another notable case that provides arguments for and against regulating NFTs as securities is the class action lawsuit filed against Dapper Labs.⁷⁷ Dapper Labs created the National Basketball Association’s (“NBA”) Top Shot, which sells NFTs of NBA highlights or “Moments” that can be bought or sold using the blockchain and marketplace Dapper Labs developed.⁷⁸ This class action argues that Dapper Labs is selling securities due to how it operates its resale marketplace and promotes the value of its NFTs. The plaintiffs allege that Moments were sold with “the expectation of profit” where “[t]he reality is that the growing fanatical NBA Top Shot database is all about the investment, speculation and appreciation of the Top Shot NFTs and the NBA Top Shot Marketplace.”⁷⁹ However, the plaintiffs conceded that NBA Top Shot’s Service Terms of Use state that users “are using NFTs primarily as objects of play and not for investment or speculative purposes.”⁸⁰ NBA Top Shot is promoting the NFTs as collectables as opposed to investments, which weighs in favor of the NFTs not being securities. Nevertheless, some argue that NBA Moments may still be “investment contracts” because Top Shot creates and maintains the sole marketplace for these NFTs and thus could be an unregistered exchange.⁸¹

D. SECURITIES EXCHANGE ACT OF 1934

Once an asset is deemed a “security,” the SEC and the Exchange Act impose numerous regulatory requirements on the “exchanges” or platforms that facilitate the trading of those assets. Section 5 of the Exchange Act makes it unlawful for any broker, dealer, or exchange to effect any transaction in a security unless the exchange is registered as a national securities exchange under section 6 of the Exchange Act or an appropriate exemption applies.⁸² Registration as a national securities exchange requires any person or entity that offers or sells securities to the public to provide “full and fair disclosure” through the delivery of a statutory prospectus that contains information necessary to give prospective purchasers the proper opportunity to make an informed investment decision.⁸³ Under the Exchange Act, an “exchange” is defined as any organization or group of persons (whether incorporated or unincorporated) that maintains or provides “a market place or facilities for bringing together purchasers and sellers of securities” or conducts functions commonly performed by stock exchanges.⁸⁴ The Code of Federal Regulations attempts to clarify when an entity must register as a national security exchange and provides a functional test to assess whether an entity meets the definition of an “exchange” under the Exchange Act. Rule 3b-16(a) states that an organization, association, or group of persons is considered to constitute or maintain an “exchange” if it (1) “brings together the orders for securities of multiple buyers and sellers” and (2) “uses established, non-discretionary methods (whether by providing a trading facility or by setting rules) under which such orders interact with each other.”⁸⁵ Rule 3b-16(b) then lays out what is excluded from the definition of an exchange.⁸⁶ The SEC has argued that when analyzing whether a “system operates as a marketplace and meets the criteria of an exchange under Rule 3b-16(a),” one must look to “the activity that actually occurs between the buyers and sellers—and not the kind of technology or the terminology used by the entity operating or promoting the system.”⁸⁷ Thus, any trading system that meets the definition of an exchange under

Rule 3b-16(a), and is not excluded under Rule 3b-16(b), must register as a national securities exchange or operate pursuant to an appropriate exemption.⁸⁸

Exempted entities do not need to register as a national securities exchange under section 6.⁸⁹ Rule 3a-1-1(a)(2) states that an organization, association, or group of persons is exempt from the definition of “exchange” if it is operating as an alternative trading system (“ATS”) and is in compliance with Regulation ATS.⁹⁰ ATSs are SEC-regulated electronic trading systems that utilize the process of “dark pools” to match orders for buyers and sellers of securities.⁹¹ The SEC released a report regarding its adoption of new rules and amendments that allow ATSs to “choose whether to register as national securities exchanges, or to register as broker-dealers and comply with additional requirements under Regulation ATS, depending on their activities and trading volume.”⁹² ATSs typically face fewer and simpler regulations than national securities exchanges but still have some requirements, such as registering as a broker-dealer, giving notice of initial operations or material changes, providing fair access, keeping records, complying with capacity, integrity, and security standards, and other reporting requirements to safeguarding customer funds and securities.⁹³

E. RULES, REGULATIONS, AND GUIDANCE FROM AGENCIES

In addition to statutes, issuers and platforms of digital assets also rely on statements, reports, and frameworks from the SEC and other regulatory bodies to guide their decisions. As digital assets grew in popularity, the SEC took notice and came out with formal and informal statements regarding its views on cryptocurrencies and tokens. In 2018, SEC Chairman Jay Clayton testified before a Senate committee arguing that cryptocurrencies could be structured as securities products subject to federal securities laws and warned that certain Initial Coin Offerings (“ICO”) structures could implicate securities registration requirements.⁹⁴ More recently, at the Security Token Summit 2021, Peirce warned issuers of NFTs to be cautious when they create f-NFTs because when used in certain creative ways, they could create a security that is subject to regulation.⁹⁵

The SEC created a branch in 2018 called the Strategic Hub for Innovation and Financial Technology (“FinHub”) to coordinate and respond to emerging financial technology (“fintech”); serve as a public resource by consolidating, clarifying, and communicating the SEC’s views and actions related to fintech innovation; and inform policy research in these areas.⁹⁶ In 2019, FinHub published an SEC document called Framework for ‘Investment Contract’ Analysis of Digital Asset, which provided details on how the SEC applies the Howey Test to analyze whether digital assets could be considered an “investment contract” security.⁹⁷ This is one of the few documents available to guide digital asset creators and platforms.

Although guidance from the SEC regarding digital assets is sparse, there is some case law and reports from the SEC. For example, the SEC issued an enforcement order against the creator of EtherDelta, which provides a marketplace for bringing together buyers and sellers of digital asset securities through the combined use of an order book, a website that displayed orders, and a smart contract run on the Ethereum blockchain.⁹⁸ This case held that EtherDelta violated section 5 of the Exchange Act because it issued digital asset securities using blockchain technology as an unregistered exchange.⁹⁹ This is one of the main cases analyzing whether a platform that houses digital assets can be an unregistered security exchange. Other regulatory bodies have provided reports of their research into the intersection of digital assets and securities law. For example, the Congressional Research Service (“CRS”) published a report containing a broad outline of how federal securities laws and regulations apply to cryptocurrencies, ICOs, and NFTs.¹⁰⁰

The SEC also published the Decentralized Autonomous Organization (“DAO”) Report, which discusses U.S. federal securities laws and their applicability to the new paradigm of “virtual organizations or capital raising entities that use distributed ledger or blockchain technology to facilitate capital raising and/or investment and the related offer and sale of securities.”¹⁰¹ The purpose of this report of investigation is to “advise those who would use a [‘DAO Entity’], or other distributed ledger or blockchain-enabled means for capital raising, to take appropriate steps to ensure compliance with the U.S. federal securities laws.”¹⁰² Slock.it created The DAO, which is a “for-profit entity whose objective was to fund projects in exchange for a return on investment.”¹⁰³ DAO Tokens represented a type of “crowdfunding contract” that would help raise “funds to grow [a] company in the crypto space.”¹⁰⁴ The DAO offered and sold DAO Tokens in exchange for Ether (“ETH”), a virtual currency used on the Ethereum Blockchain, and the proceeds from these sales were used to fund projects.¹⁰⁵ DAO Token holders had the right to vote on these projects and were entitled to any anticipated earnings from the projects it funded.¹⁰⁶ The DAO platform also had a group of individuals called “Curators” who were given “considerable power” to perform “crucial security functions” and maintain “ultimate control over what projects would be submitted to, voted on, and funded by The DAO.”¹⁰⁷ In applying the Howey test to the DAO Token, the SEC’s DAO report found that the tokens meet the criteria of a security and The DAO was required to register as an exchange under Rule 3b-16.¹⁰⁸

Even though there is some guidance for blockchain technologies generally, the SEC has not yet provided any guidance regarding NFTs specifically. Given this small amount of advice, many people have requested that the SEC provide regulatory clarity with respect to NFTs so that they know how to proceed.¹⁰⁹ These requests for guidance come in the form of “no-action” letter requests that encourage “the SEC to engage in a meaningful discussion of how to regulate FinTech companies and individuals that are creating NFTs that may be deemed digital asset securities and the platforms that facilitate the issuance and trading of NFTs.”¹¹⁰ The existing securities framework provides a “crude mechanism” for regulating NFTs, and the SEC needs to reevaluate or reapply these old frameworks to new financial technologies to establish sustainable guidance and prevent NFTs from becoming the “Wild West” of digital investments.¹¹¹

III. HOWEY TEST: ARE F-NFTS SECURITIES?

Although there are few articles or regulations specifically addressing NFTs, the current view is that NFTs may not be an “investment contract” security that can be regulated by the SEC because an NFT may gain its value through its uniqueness, as opposed to “a common enterprise” (second Howey prong), and any profits realized through an NFT may be derived from regular supply and demand, as opposed to the “efforts of others” (fourth Howey prong).¹¹² However, to determine an NFT’s ability to be categorized as a security, regulators need to focus on the “economic reality” and specific circumstances, such as how society defines the NFT’s value, how it is utilized, or how it is marketed. On one hand, if the purchaser is a collector and the NFT’s value comes from its uniqueness and artistry, the main purpose of buying the asset is to “consume” it by enjoying its aesthetics; the NFT may also be marketed as allowing buyers to join the ranks of premier owners and connoisseurs of unique digital objects. In such a scenario, an NFT is less likely to be a security. For example, some people may buy a Pudgy Penguins NFT from OpenSea (an NFT exchange website) because they think it is adorable and just want to look at it or display it as a profile picture on social media.¹¹³ On the other hand, if the purchaser is an investor and the NFT’s value comes from its ability to gain a return on investment, the main purpose of buying the asset is to sell it later for a profit; or if it is marketed as an asset that will appreciate in value to give a substantial return, then an NFT is more likely to be security. Some purchasers’ main goal in buying a Pudgy Penguin may be to increase their capital.¹¹⁴ In the end, NFTs may gain value from both their uniqueness and their ability to provide a return on investment.

Another prevailing view is that fractionalizing NFTs could create a type of security that is subject to regulation.¹¹⁵ F-NFTs could be an investment contract under the Howey test depending on the facts and circumstances of the particular f-NFT, such as if you put multiple NFTs into one basket and then sell f-NFTs out of that basket.¹¹⁶ Although the SEC has yet to initiate any enforcement action against creators or platforms that facilitate the offer and sale of f-NFTs, the SEC and courts have held in many cases that fractional interests in an asset can be a security even if the individual asset itself is not.¹¹⁷ This Part applies the four prongs of the Howey test to analyze whether an f-NFT can be an “investment contract” security and compares

f-NFTs to the DAO Token, which has already been deemed a security.

A. “MAKES AN INVESTMENT IN MONEY”

F-NFTs most likely satisfy the first prong of the Howey test given that people buy f-NFTs using cryptocurrency. The SEC argues that most digital assets, such as f-NFTs, pass the first Howey prong because they are purchased through an exchange for value.¹¹⁸ It does not matter that this exchange for value is in the form of digital currency such as cryptocurrency. Courts have held that an “investment of money” does not need to be in the form of cash, and thus purchasing something with cryptocurrency, as is the case with NFTs or f-NFTs, would satisfy this definition.¹¹⁹ When comparing f-NFTs to the DAO Token, both of these digital assets make an “investment in money” because both purchasers of the DAO Token and f-NFTs use ETH, the digital currency used on the Ethereum blockchain, to buy their respective digital assets.

B. “IN A COMMON ENTERPRISE”

A traditional NFT may not pass this second Howey prong because its value stems from its uniqueness—not a common enterprise—and there may not be a relationship between the seller or promoter of an NFT and a buyer or investors in that NFT. However, the SEC’s FinHub stated that a “common enterprise” typically exists for investments in digital assets because the fortunes of individual purchasers of digital assets are tied to other investors or tied to the success of the promoter’s efforts to expand a digital asset platform.¹²⁰ Also, courts have determined that the “common enterprise” prong is a distinct element of an investment contract analysis and “does not require vertical or horizontal commonality per se.”¹²¹ Thus, there are some arguments that f-NFTs may pass the second prong and have a common enterprise.

Horizontal commonality can be shown for f-NFTs through the fact that if a person owns a partial ownership interest in an underlying NFT, the value of this shard is tied to the fortunes of all the owners of the other shards of that fractionalized NFT.¹²² If the value of the underlying NFT increases, the value of each of its shards also increases. Thus, a common enterprise can be found through the relationship between an investor of an f-NFT and the pool of other investors who share ownership of the same fractionalized NFT. One of the very reasons to fractionalize an NFT is to enable smaller investors to “pool resources” together to purchase a smaller interest in an NFT and share in the returns of the whole NFT.¹²³ This is similar to the investors in the DAO Token who pooled together ETH to help The DAO fund large projects with the hope of a return on their investments.¹²⁴ Both the DAO Token and f-NFTs can satisfy horizontal commonality by pooling investors’ assets and tying their interests together. Also, an NFT can be part of a series of similar NFTs, like a collection of artworks by the same person, where the value of one will rise and fall along with the value of the others in the series.¹²⁵ The fortune of one NFT investor in the series may be tied to the increase and decrease in fortune of the other NFT investors in the same collection.

F-NFTs may also satisfy the vertical commonality requirement, given the relationship between the original issuer of the f-NFTs (promoter) and all the purchasers of the f-NFTs (body of investors). A common enterprise exists under broad vertical commonality when the investors are dependent on the promoter’s efforts or expertise for their increased returns.¹²⁶ For f-NFTs, a common enterprise may exist because the success of f-NFT investors gaining returns is dependent on f-NFT companies making the effort to fractionalize or bundle different NFTs and maintain the platform to protect f-NFTs and keep trading running. Additionally, strict vertical commonality can be established if f-NFT platforms gain some type of fee percentage from their efforts in fractionalizing and selling f-NFTs. Thus, if f-NFT platforms actively manage or charge fees for handling these assets, then the fortunes of f-NFT platforms are connected to the fortunes of the f-NFT investors. When f-NFT investors succeed, so does the f-NFT company.

Even certain, whole NFTs may pass the vertical commonality test. For example, many college and professional athletes have been creating NFTs of themselves through digital artwork, highlight reels, and other digital assets.¹²⁷ These NFTs may satisfy the “common enterprise” requirement because the value of the NFT would depend on the rise and fall of the athlete’s career and how much effort that athlete put into increasing their popularity. If the particular athlete who is issuing an NFT does better professionally in their sport or increases in popularity, then the value of their NFT may also increase. In other words, the fortunes of the owners of the athlete’s NFT would increase in correlation with the fortunes or the career of the athlete also increasing. The same argument can also be made for NFTs from specific artists or celebrities, such as Beeple or Martha Stewart.¹²⁸ Investors of Beeple’s NFTs have their fortunes tied to the efforts of Beeple and his other artworks. The value of an investor’s Beeple NFT will benefit from Beeple and his other artwork becoming more popular or valuable. Thus, there are good arguments that f-NFTs fulfill the second Howey prong.

C. “WITH A REASONABLE EXPECTATION OF PROFIT”

F-NFTs can satisfy the third Howey prong if purchasers buy f-NFTs with the expectation that they will realize some type of gain or profit. Given that this prong is heavily fact–sensitive, the SEC provided a list of characteristics that make it more likely for a digital asset to fulfill the “reasonable expectation of profits” prong.¹²⁹ F-NFTs seem to satisfy three of the characteristics listed: (1) the digital asset is “transferable or traded on or through a secondary market or platform,” (2) the issuer continuously “expend[s] funds from proceeds or operations to enhance the functionality or value of the network or digital asset,” and (3) the digital asset is marketed or promoted in a way that would cause a purchaser to have an expectation of profits. To determine whether an f-NFT can be classified as a security under this prong, one needs to focus on the transaction itself and the way the digital asset is offered and sold.¹³⁰

The first characteristic that increases the likelihood of f-NFTs fulfilling the third Howey prong is the fact that investors can transfer or trade these assets on secondary markets or online blockchain platforms.¹³¹ The ability to sell or buy NFTs or f-NFTs on secondary markets such as OpenSea provides proof that the investor may expect to realize some type of return or appreciation on the digital asset through secondary trading. This is much like how DAO Token holders were able to monetize their investments in DAO Tokens by reselling and trading them on various secondary trading platforms and markets.¹³²

The second characteristic that leans in favor of f-NFTs satisfying the third prong is the fact that f-NFT platforms may “provide essential managerial efforts that affect the success of the enterprise, and investors reasonably expect to derive profit from those efforts.”¹³³ The more likely that f-NFT issuers made efforts to increase the demand or value of the digital asset, the more likely the f-NFT will have a “reasonable expectation of profits.” Different cases have clarified that efforts to “increase the demand or value” include when issuers or platforms (1) create and manage an “ecosystem” for the digital asset which allows them to increase in value, (2) develop the network to inspire creative uses of its assets, or (3) add a new functionality using the proceeds from the token’s sales.¹³⁴ First, f-NFT platforms like Fractional.art and Nitfex made “essential managerial efforts” to increase demand or value of f-NFTs by taking continuous, active steps to fractionalize NFTs and make them more accessible to more investors. This created a new ecosystem where average investors could pool their funds together to share in the gains of valuable NFTs. Second, fractionalization networks inspired new creative uses such as bundling various NFTs together and selling f-NFTs of this bundle. The value of these f-NFTs would be dependent on the values of all the individual NFTs that the issuer chooses to place in the basket. Lastly, f-NFT platforms created a new functionality for NFTs by adding the ability to fractionalize one NFT into multiple shards. This allows purchasers to buy smaller interests in many different NFTs to diversify their collection, thus minimizing the volatility of this digital asset and increasing the potential returns.

This view of f-NFTs can be compared to the DAO Token that satisfied the third Howey prong because the proceeds from selling the DAO Tokens were used to fund different proposed projects in which holders had the potential to gain a share of the profits from these projects.¹³⁵ Also, much like how f-NFT platforms have created an ecosystem for the fractionalized assets, The DAO created a type of ecosystem for its “crowdfunding contracts.” While one may argue that f-NFT platforms are not using the proceeds from selling their tokens to directly improve their network, another may argue that f-NFT platforms collect fees from transactions that occur on their platform and then use these fees to maintain a secure network for f-NFT purchasers. Thus, this characteristic may depend on how the specific f-NFT platform is managed.

The third characteristic that makes f-NFTs more likely to provide a “reasonable expectation of profit” is the way in which f-NFTs are marketed to potential buyers. The SEC provided a list of ways a digital asset could be marketed that weigh in favor of the third Howey prong. F-NFTs may satisfy four of these methods: (1) the “intended use of the proceeds from the sale of the digital asset is to develop the network or digital asset”; (2) a key selling feature of f-NFTs is the ability to readily transfer it; (3) “[t]he potential profitability of the operations of the network, or the potential appreciation in the value of the digital asset, is emphasized in marketing or other promotional materials”; or (4) there is an available market for trading the digital asset or the issuer promises to create or support a trading market.¹³⁶ F-NFTs can satisfy these marketing characteristics, and many of them are also found in the DAO Token.

First, although current f-NFT platforms do not directly market that proceeds from f-NFT sales will be used to develop the network, one can assume that these platforms use the fees they collect from sales to maintain the network and allow for continuous fractionalization of NFTs. Second, the fact that f-NFTs are marketed as being easily transferable on platforms such as Niftex, Fractional.art, or DAOfi lean in favor of there being an “expectation of profit.”¹³⁷ This is similar to the DAO Token, which was promoted as being readily available to buy and sell on “a number of web-based platforms that supported secondary trading.”¹³⁸ Third, certain f-NFT platforms emphasize that these assets are a unique and better way to unlock liquidity, gain greater exposure and price discovery for your NFTs as fractions on the open market, trade NFTs with lower cost and greater diversification, get access to a variety of unique and iconic digital assets with low price thresholds, or provide liquidity for shard markets and earn transaction or curator fees.¹³⁹ These platforms focus on f-NFTs’ ability to increase exposure of a particular NFT in a market and diversify one’s investments in NFTs to spread out the risk of a single NFT losing value. Increased exposure and diversification can increase an f-NFT’s profitability, and a platform’s emphasis on this promotes an f-NFT’s appreciation in value. However, f-NFTs may simply be marketed as an easier, more accessible way for the average investor to partake in the NFT market.¹⁴⁰ If this is the case, it is less likely that f-NFTs satisfy the third Howey prong. The DAO platform emphasized its potential profitability by marketing it as an investment where purchasers could share in the profits of the proposed projects the DAO Token funded and thus gain a return on their initial investment.¹⁴¹ Although this is not exactly similar to how f-NFTs’ profitability were marketed, both seem to promise their purchaser some type of liquidity. Fourth, f-NFT platforms provide a readily available market for the trading of various f-NFTs. Creators or purchasers of f-NFTs can easily sell or buy these assets on different websites. These platforms support an f-NFT trading market by providing information regarding how the platform and fractionalization process operates and how the underlying technology works, a “frequently asked questions” section, a link to create or buy and sell f-NFTs, ways to “join the community,” and so forth.¹⁴² This is similar to the DAO Token issuers who supported a trading market for their token by developing a website, a link to detailed information regarding The DAO entity’s structure and source code, and a link to buy DAO Tokens; providing information on how The DAO operated; soliciting media attention; and posting on online forums.¹⁴³

A counterargument is that traditional NFTs are less likely to be a security because purchasers of traditional NFTs buy them for their artistry or bragging rights, proving that NFTs gain their value from their uniqueness, scarcity, or collectable status—not from any expected profits. An NFT’s value may just be based on the normal market forces of supply and demand, which is not considered “profit.”¹⁴⁴ The SEC also notes that digital assets are less likely to satisfy the Howey test if “[a]ny economic benefit that may be derived from appreciation in the value of the digital asset is incidental to obtaining the right to use it for its intended functionality.”¹⁴⁵ The intended functionality of an NFT may just be bragging rights or display rights, such as displaying a rare NFT artwork as your profile picture on your social media account. Thus, when an NFT increases in value, this may just be incidental to using the asset for its intended functionality of bragging rights. Also, if an f-NFT is marketed in a way that focuses on its role as a piece of digital artwork or a collectible, and not as an opportunity to gain any returns, this may work against f-NFTs being a security.¹⁴⁶ For example, some platforms market f-NFTs as a way to create more accessibility to the NFT market and not necessarily as a way to increase one’s returns.¹⁴⁷ Regulators will need to analyze the specific characteristics of certain f-NFTs and f-NFT platforms to determine whether they satisfy the third Howey prong.

D. “THROUGH THE EFFORTS OF OTHERS”

Some argue that although an NFT may provide the purchaser with a reasonable expectation of profits, this increase in financial returns is not derived from the “efforts of others” and instead comes from the NFT’s own scarcity and uniqueness. Thus, it may be more difficult to argue that an NFT satisfies the fourth and final Howey prong, which requires the asset’s increase in value to come from the “efforts of others.” While a traditional NFT may not fulfill this prong given that its value comes from its uniqueness, an f-NFT may be an exception because its value is derived from the efforts of the f-NFT platforms or issuers who support the f-NFT market. The fourth Howey prong is satisfied if an f-NFT issuer supports a market for f-NFTs or the value of these assets depend on the issuer’s efforts in generating demand.¹⁴⁸ Thus, if an NFT issuer or exchange puts in the work to develop the platform and increase buyers, and the purchasers reasonably expect a return based on this work, then an NFT may pass this last prong.

The SEC Framework for “Investment Contract” Analysis of Digital Assets lays out two key questions to consider when determining whether a digital asset can satisfy the “efforts of others” prong: (1) does the purchaser reasonably expect to rely on the efforts of an “Active Participant,” and (2) are those efforts “the undeniably significant ones, those essential managerial efforts which affect the failure or success of the enterprise”?¹⁴⁹ To help answer these questions, the SEC provided a list of six characteristics that lean in favor of a digital asset fulfilling the fourth Howey prong. While none of the characteristics are dispositive, they provide a good framework to help determine when a digital asset gains its value through the “efforts of others.” F-NFTs may satisfy some of the characteristics and thus satisfy the last Howey prong.

The first characteristic is that an issuer is “responsible for the development, improvement (or enhancement), operation, or promotion of the network, particularly if purchasers of the digital asset expect an [issuer] to be performing or overseeing tasks.”¹⁵⁰ Platforms that issue f-NFTs may have this characteristic because they are responsible for promoting the f-NFTs on their platforms, bringing more buyers onto their networks, and improving their networks by offering more products such as f-NFT bundles or automatic royalties embedded in smart contracts.¹⁵¹ These development efforts can increase the value of the actual platform and thus increase the value of the f-NFTs traded on that specific platform. Also, if a platform markets f-NFTs as producing profit based on royalty payments or f-NFT bundles, purchasers may expect that the issuers are putting in some type of managerial efforts to oversee the asset and increase its value. The value of an f-NFT could come from the efforts of a person or entity promoting, selling, choosing, developing, and managing different f-NFT royalties or bundles. This is similar to the DAO Token Curators who managed different projects for investors to create returns by deciding what projects would be submitted to, voted on, and funded by DAO Token holders.¹⁵² DAO investors relied on the “managerial and entrepreneurial efforts” of the Curators to manage The DAO network and project proposals because the creators of The DAO represented that they “could be relied on to provide the significant managerial efforts required to make The DAO a success.”¹⁵³

The second characteristic is that the issuer performs essential tasks or responsibilities, as opposed to “an unaffiliated, dispersed community of network users (commonly known as a ‘decentralized’ network).”¹⁵⁴ This reference to a “decentralized” network may work against f-NFTs being deemed a security because they are inherently run on a “decentralized” network. One can argue that the blockchain technology, smart contracts, and digital ledger perform the “essential tasks or responsibilities” for f-NFTs as opposed to the issuer or platform. However, the DAO Token was still deemed a security even though it utilized blockchain technology, and smart contracts performed tasks for the usage of the DAO Tokens.¹⁵⁵ Although

f-NFTs are run on a “decentralized” network, issuers can perform essential tasks such as fractionalizing NFTs, using their expertise to bundle NFTs, or maintaining the network to ensure the f-NFTs are protected.

The third characteristic is that an issuer “creates or supports a market for, or the price of, the digital asset,” which can include (1) “control[ing] the creation and issuance of the digital asset,” or (2) “tak[ing] other actions to support a market price of the asset, such as by limiting supply or ensuring scarcity” through activities like buybacks.¹⁵⁶ Issuers of f-NFTs, such as Niftex and Fractional.art, may embody this characteristic because issuers set the original fixed price of an f-NFT when they initially fractionalize an NFT, and many f-NFT platforms have some type of “buyout” provision which lets f-NFT investors purchase the remaining shards to gain ownership of the full NFT.¹⁵⁷ This buyout provision is similar to a buyback because the original f-NFT issuer can buy back the whole NFT, which can subsequently support a market price of the f-NFTs. Also, as more NFTs are bought and sold on a platform, the rarity and scarcity of a specific NFT may increase, which then affects the price of that NFT.¹⁵⁸ Thus, if f-NFT platforms support the growth of their platforms to include more f-NFTs or other products, then these platforms can create a market for and support the price of f-NFTs. A counter argument is that an NFT’s lack of exchangeability with other NFTs impedes its ability to be classified as a security. Traditional securities increase their value from price fluctuation and exchangeability, but due to its uniqueness, an NFT only increases its value through profit increases and not exchangeability.¹⁵⁹ This issue may be limited with f-NFTs, whose value is tied to other types of price fluctuations.

The fourth characteristic is that the issuer has a “lead or central role in the direction of the ongoing development of the network or the digital asset.”¹⁶⁰ By simply maintaining the f-NFT network, these platforms are providing an active management role that contributes to the development and stability of f-NFTs and f-NFT networks. Since the actual NFT is typically hosted on external URLs or IPFS, some caution that NFT networks must be maintained to ensure that NFTs sold on the platform do not disappear, buyers do not lose their purchases, and NFTs do not lose their value. This dynamic can create a system in which “the value of the art is tethered to the value of the platform hosting it.”¹⁶¹ The managerial efforts of the NFT platforms would be directly tied to the value of the NFTs because if the NFT platforms are not run properly or are shut down, the value of the NFTs decreases or disappears altogether. An issuer can also take a lead role in continuously developing f-NFTs if the issuer is an artist, athlete, celebrity, or company, and the value of their f-NFT is tied to that specific issuer’s popularity or the efforts they undertake to grow their popularity. When buying an f-NFT, you are not buying the underlying artwork but instead are purchasing the right to gain profits from the increased popularity of the creator, whether it be an artist like Beeple or an athlete like Patrick Mahomes.¹⁶² People may invest in NFTs with the hope that the creator increases in fame, which can then increase the profits from the particular NFT. For example, many college athletes are creating their own NFTs, and as an athlete’s career progresses to professional sports, the value of that NFT could exponentially increase.¹⁶³ NFTs issued by corporations or influential public figures may also satisfy the “efforts of others” prong. For example, Nike recently announced its plan to sell “digital shoes,” which resemble an NFT for its iconic shoes; Martha Stewart also created an NFT collection consisting of digital art of her home décor.¹⁶⁴ Nike and Martha Stewart may have a central role in the ongoing development of their respective NFTs because as they put in effort to continuously grow the popularity and profitability of their brand, their NFTs may also grow in value. If an NFT is tied to a specific company or person, the NFT’s value relies on the efforts of that issuer to increase their popularity, which will in turn help develop the underlying NFT.

The fifth characteristic is that the issuer has “a continuing managerial role in making decisions about or exercising judgment concerning the network or the characteristics or rights the digital asset represents.”¹⁶⁵ Some examples of what constitutes a “managerial role” include: “determining whether and where the digital asset will trade,” having “responsibility for the ongoing security of the network,” and “making other managerial judgements or decisions that will directly or indirectly impact the success of the network or the value of the digital asset generally.”¹⁶⁶ The DAO Curators had a large managerial role over the DAO Token—and its potential value—because investors relied on the Curators’ expertise to monitor the operation of The DAO, safeguard their funds, and determine when proposed contracts should be put to a vote to fund projects.¹⁶⁷ F-NFT platforms may serve this “managerial role” through providing ongoing security for the network. For example, f-NFT platforms must manage their networks to prevent any hacking attempts or fraud that could steal funds during an NFT transaction or destroy the linkage to the underlying NFT.¹⁶⁸ This is similar to how The DAO and its Curators were relied on for “failsafe protection” and for protecting the system from “malicous [sic] actors.”¹⁶⁹ Current f-NFT platforms have yet to show how their managerial decisions can significantly impact the success of f-NFTs, given that they do not have Curator-type workers who actively control f-NFTs. However, if f-NFT platforms sold

f-NFT bundles, investors would have to rely on the platform’s judgment for what types of NFTs were being pooled together in a bundle and sold as

f-NFTs. The platform’s expertise may then affect the value of the f-NFT bundle, and it would be more likely that f-NFTs had continuous management from others.

The sixth characteristic is that “[p]urchasers would reasonably expect the [issuer] to undertake efforts to promote its own interests and enhance the value of the network or digital asset” where the issuer has a stake in the digital asset and can realize its own gain from the digital asset or monetize the value of the digital asset.¹⁷⁰ Issuers or creators of f-NFTs may satisfy this characteristic because they can program a smart contract to automatically charge a type of royalty or curator fee any time an f-NFT is resold or used in a specific way.¹⁷¹ This enables the issuer to monetize the value of the digital asset and promote its own interests in the digital asset. Some platforms such as Niftex have also automatically programmed their f-NFT smart contracts to set aside five percent of an NFT’s fractions for the artist.¹⁷² In this system, instead of the creator taking a cut every time a fraction is traded on the open market, they now get to share in the profits of just owning some of the shards. It seems that f-NFT issuers may promote their own interests and enhance the value of the digital asset, because the higher the value of the asset, the more money they can make off their own shards.

Whether or not f-NFTs satisfy the fourth Howey prong will once again come down to the specific facts of how the f-NFT is marketed to purchasers and the specific platform or issuer. However, given the various SEC characteristics taken together and their application to f-NFTs, there may be a good argument that f-NFTs can gain their value from the “efforts of others.” After analyzing f-NFTs under the four Howey prongs and comparing them to other established digital asset securities, f-NFTs can be considered securities.

IV. HOW CAN NFTS BE REGULATED?

Even if f-NFTs can satisfy all the Howey prongs and be classified as a security, the question still remains whether the SEC should regulate these digital assets and what regulatory framework should be adopted. The SEC cautioned that as financial technologies continue to innovate, there is a possibility that market participants (such as f-NFT buyers, sellers, and platforms) may be conducting activities that fall within the SEC’s jurisdiction in which their transactions, persons, or entities may be subject to registration, regulation, or oversight.¹⁷³ The SEC can regulate three different types of actors: (1) buyers of a security, (2) sellers or issuers of a security, and (3) platforms facilitating exchanges.¹⁷⁴ If designated as a security, buyers, sellers, or platforms of f-NFTs sold without registration may be subject to penalties, registration requirements, or filing periodic reports with the SEC.¹⁷⁵

The SEC needs to discover what types of regulations it can impose on buyers, sellers, and platforms of f-NFTs. This Part analyzes the risks and opportunities of regulating f-NFTs under the existing regulatory framework and how regulations can be applied to the three different actors within the NFT space to recommend a new, modified framework better suited for this digital asset.

A. REGULATION OF BUYERS

The SEC regulates buyers of securities by only allowing certain “accredited investors” to purchase unregistered securities, which typically are subject to fewer requirements and regulations.¹⁷⁶ SEC Regulation D (“Reg. D”) governs unregistered securities and explains the exemptions from being required to register with the SEC.¹⁷⁷ Under Rule 501(a) of Reg. D, accredited investors can be institutional investors and entities such as banks, mutual funds, insurance companies, or pension plans;¹⁷⁸ insiders within an issuer such as officers or directors of the issuer of the securities;¹⁷⁹ or wealthy natural persons such as those with a net worth of greater than $1 million, excluding primary residence and mortgage,¹⁸⁰ or those with an annual income of greater than $200,000 for the last two years ($300,000 if filing jointly with one’s spouse).¹⁸¹

The policy behind limiting buyers from purchasing certain securities through this regulation is to protect less-knowledgeable individual investors, who may not have the financial stability to absorb the high risks of investing in unregistered securities, while also promoting investments into risky entrepreneurial ventures. Accredited investors are treated differently from the general public because they are sophisticated enough to bear the risks, are more knowledgeable, or have the money to hire someone like a financial advisor to help them make informed decisions. Given that f-NFTs may be unregistered securities, the SEC could regulate f-NFT buyers by only allowing accredited investors to purchase them. However, it may be difficult to prevent people from buying a certain digital asset on a decentralized and easily accessible platform. This would mean that every time an f-NFT was created or sold, an issuer or platform would have to go through the

time-consuming and costly process of ensuring that every purchaser complies with the definition of an accredited investor. The whole purpose of fractionalizing NFTs was to make these digital assets more accessible to average investors. Thus, it seems counterintuitive to place a new barrier in front of average investors and their ability to participate in this emerging market. The accredited investor regulation is meant to protect average investors from more risky activities, but there may be other ways to prevent harm to less-knowledgeable investors than completely cutting them off from these new assets, such as requiring NFT platforms to provide easily accessible and relevant information regarding trading NFTs and maintaining certain security protocols to protect f-NFT investors and their funds. Thus, it is unlikely that the SEC could or should place any regulations on buyers of f-NFTs.

B. REGULATION OF SELLERS OR ISSUERS

The SEC may be able to place registration requirements on the initial creators or issuers of f-NFTs. Under section 5 of the Securities Act, any issuer offering or selling an unregistered security in interstate commerce must register non-exempt securities with the SEC.¹⁸² These registration requirements serve two main goals: (1) to provide investors with financial and other material information regarding the securities being offered or sold and (2) to prohibit and minimize fraud, deceit, misrepresentations, and other dangers in the sale of securities.¹⁸³ Requiring issuers to provide information regarding their assets to investors through the SEC increases the likelihood that investors will make well-informed decisions and provides a certain standard to minimize fraudulent sales. If f-NFTs are deemed to be securities, the individual or entity that initially fractionalizes the NFT and sells these

f-NFTs may be considered an issuer under section 5 and thus be subject to SEC requirements such as filing a registration statement and periodically disclosing material information.¹⁸⁴

The SEC has cracked down on digital assets and ICOs by bringing and winning enforcement actions against a variety of issuers who have offered and sold digital assets that are deemed securities and were not registered pursuant to the Securities Act.¹⁸⁵ In 2019, the SEC brought two high-profile enforcement actions against Kik Interactive Inc. (“Kik”) and Telegram Group Inc. (“Telegram”) arguing that the Kik and Telegram tokens were sold to investors as unregistered securities and thus violated federal securities law. The courts applied the Howey test and found that both tokens were securities because the funds from the token sale were used for operating the companies’ respective ecosystem and messaging apps, the tokens were marketed to prospective investors as a way “you could make a lot of money,” and the value of the investments depended on the companies’ respective efforts to develop their messaging apps.¹⁸⁶ While some issuers of digital assets like cryptocurrency were subject to registration requirements, other issuers of digital assets such as tokens for a membership rewards program (TurnKey Jet, Inc.) or tokens for video game currency (Pocketful of Quarters, Inc.) were given “no-action” letters from the SEC promising that the it would not take any enforcement action against these issuers for selling the digital assets without registration.¹⁸⁷ The SEC held that these rewards and video game tokens were not securities because none of the funds from the token sales were used to develop the issuer’s platform, the tokens were immediately usable for their intended functionality (purchasing air charter services or gaming) at the time they were sold, token transfers were restricted to only the company’s internal “wallets,” and the tokens were both marketed in a way that emphasized the functionality of the token for consumption.¹⁸⁸

Given the unchartered territory of f-NFTs, it is difficult to apply the regulation of issuers to the creators of f-NFTs. Although selling f-NFTs may look like a type of ICO, there may be policy reasons not to require registration every time creators wish to fractionalize their NFT. Registering the sale of an asset is a time-consuming and costly process, and it seems unnecessary to require extensive disclosures given that the costs of registration may outweigh the benefits of having an accessible f-NFT marketplace. The main goals of these registration requirements are to provide investors with sufficient information regarding the f-NFT and to prevent fraud.¹⁸⁹ However, f-NFTs’ blockchain and smart contract technology may satisfy these goals without the need for costly registration. Many platforms always display relevant information regarding an NFT right next to the image of the NFT. This information typically includes a description of the NFT, the total supply of fractionalized shards, the valuation, and some type of table showing all the transactions of that specific NFT, the date on which each sale occurred, the buyers and sellers for each sale, and the price at which it was sold.¹⁹⁰ Thus, potential purchasers can already easily see the relevant financial information regarding the assets to help them make an informed decision. Also, since each f-NFT has a digital ledger that automatically records every transaction and every buyer and seller of that f-NFT, it can be easier to fend off certain types of fraud and easily authenticate true ownership. F-NFTs’ blockchain technology, decentralized network, and easy authentication process can help satisfy the goals that registration requirements aim to reach.

One may argue that if an f-NFT is being sold by a specific entity, artist, or athlete, and the value of that f-NFT is tied to that entity or individual’s external success, then the issuer may need to provide disclosure regarding the entity or individual. For example, would a professional athlete’s f-NFT issuance require a registration statement about their professional sports career? Brands such as Nike and Martha Stewart have recently announced their digital asset plans such as Nike’s “digital shoes” and Martha Stewart’s NFT collection of digital images depicting her home decor and designs.¹⁹¹ Thus, if a company or brand is issuing an NFT or f-NFT, it seems more likely that the SEC may impose registration requirements and disclosures regarding that specific company or brand. Even if the SEC decides to impose registration requirements for the initial fractionalization of an NFT, there should be exemptions for small NFTs of little value or where there is a low number of shards in the initial fractionalization. For example, Reg. D under Rule 504 provides an exemption from registration requirements for companies that issue a small amount of securities, in which they are not allowed to sell more than $10 million worth of securities in any twelve-month period.¹⁹² This rule could easily be applied or adapted to fit small sales of f-NFTs such that issuers would not be required to register the sale of their f-NFTs if the total value of the sale was below a certain threshold. The SEC will need to balance the costs of the registration requirements for initial

f-NFT issuers with the need to promote or encourage new markets and assets and not stifle innovation and creativity.

C. REGULATION OF PLATFORMS OR EXCHANGES

Although it may be more difficult to regulate buyers or the initial creators of f-NFTs, it may be more reasonable to focus securities regulation on f-NFT platforms or networks that provide for the fractionalization of NFTs and manage the secondary market trading of these digital assets. If an f-NFT platform such as Niftex, Fractional.art, or DAOfi satisfies the definition of an “exchange” under Exchange Act Rule 3b-16(a)’s test, then these types of platforms will need to register with the SEC under section 6 of the Exchange Act as a national securities exchange or be exempt from registration, such as by operating as an alternative trading system (“ATS”) in compliance with Regulation ATS.¹⁹³ The registration requirements for exchanges apply regardless if the issuing entity is a decentralized autonomous organization as opposed to a traditional company, if purchased using virtual currencies as opposed to traditional paper currency, or if distributed through ledger technology as opposed to certificated form.¹⁹⁴

Under the Exchange Act Rule 3b-16(a), an entity is an “exchange” if it (1) “brings together orders for securities of multiple buyers and sellers,” and (2) uses “established, non-discretionary methods.”¹⁹⁵ The SEC clarifies this two-pronged functional test by stating that a system “brings together orders” “if it displays, or otherwise represents, trading interests entered on the system to system users” or “if it receives subscribers’ orders centrally for future processing and execution.”¹⁹⁶ The SEC also explains that a system uses “established, non-discretionary methods either by providing a trading facility or by setting rules governing trading . . . among the multiple buyers and sellers entering orders into the system.”¹⁹⁷ These methods include a computer system in which orders interact, a “trading mechanism that provides a means or location for the bringing together and execut[ing] of orders,” or rules that impose execution procedures or priorities on orders.¹⁹⁸

Recently, this test was applied to the EtherDelta, which is an online trading platform that allows buyers and sellers to trade digital assets such as Ether and ERC20 tokens in secondary market trading. The SEC entered an enforcement order arguing that EtherDelta violated section 5 of the Exchange Act because its digital token was a security and the EtherDelta platform was an unregistered “exchange” that was transacting in a security.¹⁹⁹ This enforcement action found that EtherDelta satisfied the criteria of an “exchange” under Exchange Act Rule 3b-16(a) because it (1) operated as a marketplace for bringing together the orders of multiple buyers and sellers of a digital asset that was considered a securities under the Howey test “by receiving and storing orders in token in the EtherDelta order book and displaying the top 500 orders (including token symbol, size, and price) as bids and offers,” and (2) “provided means for orders to interact and execute through the combined use of the EtherDelta’s website, order book, and pre-programmed trading protocols on the EtherDelta smart contract.”²⁰⁰ The EtherDelta website also had numerous features that were similar to online securities trading platforms, such as providing access to the EtherDelta order book, sorting the tokens by price and color, and providing account information, market depth charts, lists of user’s confirmed trades, daily transaction volumes per token, and fields for users to input deposits, withdrawals, and trading interests.²⁰¹ Many of these features are similar to the online trading platforms of f-NFTs. When applying this functional test to f-NFT platforms and comparing them to the EtherDelta, it seems like f-NFT platforms can satisfy Rule 3b-16(a)’s two requirements.

First, f-NFT platforms bring together multiple buyers and sellers onto a single network to transact orders of f-NFTs. f-NFT platforms satisfy the “multiple buyers and sellers” aspect since there is a wide variety of f-NFTs issuers and multiple buyers who can purchase these f-NFTs.²⁰² These platforms satisfy the aspect of “bringing together” people to “transact orders” because they not only provide a place to fractionalize NFTs but also create and maintain marketplaces for users to trade their f-NFTs. Platforms typically receive and store f-NFT orders in a ledger on the Ethereum blockchain that keeps track of all the transactions of a specific f-NFT, much like the EtherDelta order book.²⁰³ All of these orders and f-NFTs are easily displayed on f-NFT platforms where users can see any past f-NFT transactions and execute orders to buy or sell these digital assets. Similar to EtherDelta, f-NFT platforms like Fracitonal.art also display the top orders and include information such as the token name, number of fractions, and price.²⁰⁴

Second, f-NFT platforms use a decentralized network that acts as a trading facility and sets rules for any f-NFT transaction through the underlying smart contracts that these platforms embed in the f-NFTs.²⁰⁵ Like EtherDelta, current f-NFT platforms provide a network or trading facility for orders to interact and execute through their individual websites such as Niftex, Fractional.art, or DAOfi, their digital ledgers, and their pre-programmed smart contracts with embedded trading protocols.²⁰⁶ These websites provide the “means or location” for bringing together users and executing orders for f-NFTs.²⁰⁷ Also, smart contracts use execution procedures and priorities to impose rules and determine the terms for any

f-NFT transaction on the network.²⁰⁸ Smart contracts can confirm the validity of the transactions and set the conditions of the order by checking certain information, such as whether the f-NFT contains a valid cryptographic signature, if the f-NFT comes with some type of royalty, if there is a buyout option, or if there is some type of curator fee.²⁰⁹ These characteristics provide the “established, non-discretionary methods” that govern how f-NFT orders interact with each other.

If an f-NFT platform is considered an “exchange,” it could still escape registration requirements if it satisfies one of the exemptions in Exchange Act Rule 3a1-1(a). It is unlikely that an NFT trading platform would fall under the 3a1-1(a)(1) (exemption for an ATS operated by a national securities association) or 3a1-1(a)(3) (exemption for an ATS not required to comply with Regulation ATS pursuant to Rule 301(a) of Regulation ATS) exemptions.²¹⁰ However one could analyze whether an f-NFT trading platform could be considered an ATS that complies with Regulation ATS and thus fits into the 3a1-1(a)(2) exemption for ATSs. This exemption would allow f-NFT exchanges to register as a broker-dealer, which has lower regulatory costs and fewer notice and reporting requirements, instead of as a national securities exchange.²¹¹ Although operating under this exemption would still come with some notice, reporting, and recordkeeping requirements, it could prevent f-NFT platforms from spending even more time and money on registering as a national securities exchange and dealing with periodic disclosures.

Although digital asset trading platforms resemble traditional exchanges or alternative trading systems, regulators may need to adjust the regulatory framework, much like they did for ATSs, to account for differing characteristics of blockchain-based exchange platforms. Differences between digital asset exchanges and national securities exchanges can include transparency, fairness, and efficiency.²¹² The decentralized aspects of f-NFT platforms may provide their own form of protection that may be more or equally as transparent, fair, and efficient as the regulations the SEC would impose. Thus, the SEC could adopt another new regulatory framework for exchange platforms of digital assets such as f-NFTs that requires less registration or fewer requirements than a national securities exchange and recognizes the fraud and misrepresentation protection that a blockchain platform already affords.

A decentralized platform may be better than SEC-imposed regulation at detecting fraud and protecting users on these types of f-NFT platforms. First, these platforms’ “decentralized” and public nature provides fairness because no one entity controls the network, and therefore anyone can easily access and interact on the platform and all transactions are verified by others on the network. Second, “decentralized” exchanges provide efficiency because the blockchain technology allows them to easily show users “verified business logic [in a publicly verified smart contact],” which a centralized exchange could not do.²¹³ Third, f-NFT platforms provide transparency because while traditional exchanges hold your funds with an “exchange owner,” decentralized ones hold your funds through easily verifiable and public digital ledgers that also contain a list of all transactions for a specific f-NFT, including the buyer, seller, and price.²¹⁴ The cryptographics embedded in f-NFTs make everything in a sense “registered” through its digital ledger, and all transactions are verified through the whole blockchain network. Thus, the sale of these digital assets may not need SEC regulation.

Even in decentralized networks, there is still a chance of hacking, fraud, and loss that may be mitigated through government regulation. Just as the SEC modernized the regulatory framework to “better integrate alternative trading systems into the national market system,” the SEC may need to modernize the regulatory framework again to integrate NFT trading systems and digital asset sales.²¹⁵ For example, the SEC may adopt a new regulatory framework that requires an f-NFT exchange platform to provide or display either convenient one-time reports or costly regular reports on its security protocols and how it deals with bad actors such as hackers that manipulate code to steal the proceeds of an NFT sale.²¹⁶ The SEC may also implement a limiting framework, similar to how Regulation ATS requirements are limited to a subset of ATSs that occupy a certain large percentage of the total trading volume of any security.²¹⁷ For example, the SEC could only require registration for f-NFT exchanges that account for a large volume of the overall traded f-NFTs. This may ensure investor protection from large actors while still allowing for innovation through smaller actors. The SEC can also require platforms to comply with certain capacity, integrity, and security standards to ensure f-NFT investors’ funds and assets are protected, given that an f-NFT’s value may be tied to the platform’s ability to maintain and retrieve the NFT.

SEC Commissioner Hester Peirce’s proposal for a “safe harbor” for digital assets and exchanges shows a glimpse into the beginning of a new framework that can provide guidance for digital asset issuers and exchanges. Peirce proposed a regulation in which digital asset exchanges would be allowed to begin distributing their tokens broadly if they provide disclosures such as plans for the network and who is behind the network.²¹⁸ These exchanges would then have three years from a token’s initial distribution to develop the network before they would be subject to any securities laws.²¹⁹ This three-year safe harbor allows issuers of digital assets to be exempt from SEC regulation for a certain time period and prevents their digital asset from being immediately classified as a security. It also gives digital asset creators time to set up their networks without government regulation and establish whether their digital asset can be classified as a security. This framework may allow creators to innovate digital and financial assets while continuing to protect investors. At the end of the day, the SEC needs to balance “encourag[ing] market innovation while ensuring basic investor protections.”²²⁰

V. PRELIMINARY EXPLORATION OF EXISTING REGULATORY MODELS

The SEC has existing regulation for non-digital securitized products, such as traditional stocks in companies or REITs, which may be applicable to f-NFT products and provide regulators with a starting point from which to develop regulations specific to f-NFTs.

When an individual or entity initially fractionalizes and issues their

f-NFTs, it could be called an “Initial Fractionalization Offering,” or “IFO.” An IFO, in which the issuer sells multiple shards of the same NFT to multiple buyers, is similar to a type of IPO or ICO, in which the issuer sells multiple stocks or tokens of the same company to multiple buyers. F-NFTs can be treated as a stock in the original whole NFT, and the sale of these f-NFTs can be the same as selling a share in an individual company. Thus, instead of developing a whole new set of regulations for f-NFTs, regulators can just look at existing securities laws for traditional stock sales and apply them to f-NFTs sales. The rules governing traditional, non-digital securities such as stocks could be slightly modified to better apply to f-NFT sales. For example, f-NFT creators could be required to register their f-NFT sale or IFOs with the SEC by filing a modified Form S-1 that contains information regarding the past performance of the NFT such as its trading history, information regarding the performance of other similar NFTs if the NFT is part of a collection, or information regarding the company or individual creating the NFT.²²¹ Providing the financial disclosures required by traditional IPOs may be more difficult for traditional NFTs because there is not any managerial or financial information behind a regular NFT besides its intrinsic or artistic value. However, NFTs from a particular brand, celebrity, or company would have an easier time producing accurate managerial and financial disclosures or material information regarding an NFT because these brands and celebrities typically have established financials or data regarding their performance, such as how popular a brand is or the performance statistics of an athlete. For example, Martha Stewart could be required to disclose managerial and financial information regarding her retail company if she tries to issue another NFT collection of her home décor, or Patrick Mahomes could be required to disclose information regarding his football statistics or other brand deals if he issued more NFTs. Thus, traditional registration requirements for issuing stock could be particularly appropriate for a celebrity or company that issues f-NFTs or NFTs and uses the proceeds from the sales to develop their brand or business.

REITs are another securitized product with an established regulatory structure that can be applied to f-NFT regulation. REITs are entities that own and typically operate various “income-producing real estate or real estate-related assets,” such as office buildings, apartments, shopping malls, hotels, or warehouses.²²² In addition to other requirements, a REIT must have seventy-five percent of the entity’s total assets coming from real estate investment, be managed by a board of directors, and distribute at least ninety percent of its taxable income to shareholders annually in the form of dividends.²²³ REITs register and file reports with the SEC, can list and trade their shares on a public stock exchange, and allow investors to invest in and own shares of multiple large-scale, income-producing real estate properties without actually having to buy the real estate.²²⁴ In other words, REITs take a bunch of commercial real estate assets, bundle them together in one company, and then sell shares of that company to investors so they can reap the benefits of owning commercial real estate. Issuing shares of a REIT is like issuing fractional shares of a basket of NFTs. For example, one way to issue f-NFTs is to take multiple whole NFTs, bundle them together in one large NFT basket, and then sell f-NFTs or fractional shares of that basket

(“f-NFT bundles”) to investors so they can own shares in multiple NFTs. Just as REITs sell investors shares of a basket of real estate investment properties, f-NFT bundles sell investors shares of a basket of NFTs.

Given these similarities, securities regulations that apply to REITs may also translate and apply to f-NFTs. Most REITs are registered with the SEC and publicly traded on a stock exchange. Under the Securities Act, REITs are required to register their securities using Form S-11 to make disclosures regarding the REIT’s management team and other significant information and make regular SEC disclosures such as quarterly and yearly financial reports.²²⁵ Regulators could follow this existing regulatory model from REITs and impose similar requirements for fractional shares of NFT bundles. Form S-11 requires REIT issuers to disclose information detailing the price of the deal, how the REIT plans to use the proceeds, certain financial data like trends in revenue and profits, descriptions of the real estate, operating data, information on its directors and executive officers, and other data.²²⁶ These types of requirements could easily be adopted to regulate f-NFT bundles by creating a new, similar form to Form S-11 for issuers of f-NFTs to file with the SEC. For example, issuers of f-NFT bundles could be required to file a form like Form S-11 that discloses information like the price of each f-NFT; how the individual or entity issuing these f-NFT bundles plans to use the proceeds, such as to purchase more NFTs to add to the bundle; a description of the NFTs currently in the bundle as if they are part of a trending collection; certain financial data of each NFT, such as its past transactions or price; information on the individuals managing the bundle, such as their credentials or how they have managed digital assets in the past; and so forth.

However, REITs are different from f-NFTs in that REIT investors earn a share of the income produced through the rent or mortgage interests from the commercial real estate, while f-NFT investors can only earn a share of the increased value of the underlying NFT.²²⁷ It may be possible that an NFT’s smart contract could charge money to anyone who views the particular NFT and then automatically distribute these proceeds out to the

f-NFT investors as a type of dividend, but this has yet to be seen. Thus, REIT regulations may not translate perfectly to regulating f-NFT bundles since it is difficult to see how f-NFT bundles would file quarterly and yearly financial statements regarding just NFTs. REITs can provide financial disclosures regarding the profits and losses of their various real estate properties, but f-NFTs do not have similar financials beside the increase and decrease in value of the various NFTs within the bundle. However, as described above, there may be more financial information when f-NFTs are issued by specific celebrities or companies. Regulators will need to determine how f-NFTs can disclose financial information to best inform investors. Additionally, there has been a surge of investors buying Metaverse Real Estate, which is real estate in virtual worlds bought and sold using NFTs and cryptocurrency.²²⁸ People can now go onto virtual real estate platforms such as SuperWorld where they can buy a plot of land in the form of an NFT and then share in any of the commerce that happens on that piece of property.²²⁹ These types of real estate NFTs would be able to charge rent or gain interest on these virtual properties and thus could then distribute income out to the NFT owners, much like REITs, and be subject to similar regulation. This analysis is outside the scope of this Note, but it is a relevant issue that regulators will need to face in the future. Nevertheless, REITs can provide a baseline to help regulators analyze and develop ways to regulate different types of f-NFTs and NFTs.

CONCLUSION

Given the foregoing analysis, f-NFTs can be deemed an “investment contract” security under the Howey test, and the SEC may be able to regulate the issuers or exchanges that facilitate these fractionalization and trading.

F-NFTs satisfy the four Howey prongs because (1) f-NFT buyers make an investment using money in the form of cryptocurrency; (2) this investment is in a “common enterprise” where the fortunes of the buyer are tied to the successes of either other fractional investors of one NFT or the brand or celebrity that issued the NFT; (3) buyers have a “reasonable expectation of profit” because f-NFTs are traded on secondary markets and promoted as a unique liquidity opportunity; and (4) these financial returns are derived from the efforts of issuers to support the popularity and price of an f-NFT and platforms to maintain and develop f-NFT exchanges and marketplaces.

If f-NFTs or NFTs are deemed securities, the SEC can use the existing regulatory models of digital currencies, traditional stock, and REITs to create initial regulations of a continuously developing digital asset. Due to the wide variety of f-NFTs and the ways in which they are owned and operated, regulators will have difficulty developing one standard that applies broadly. However, by comparing issuers and exchanges of f-NFTs or NFTs to existing securitized products, one can apply slight modifications to established regulations and require disclosures such as an NFT’s transaction history or how an issuer and exchange will use the proceeds from the sale.

Hopefully, this analysis will appeal not only to the legal field and regulators but also to the average investor who is interested in buying, selling, or understanding new digital assets like NFTs. The legal field and the government must face the current issues with NFTs and their classification and regulation as a financial instrument in order to protect investors while also allowing for the innovation of new financial technologies.

96 S. Cal. L. Rev. 253

Download

Lauren Au*

* J.D., University of Southern California Gould School of Law, 2023. B.A., University of California, Los Angeles, 2019.

Ditching Daimler and Nixing the Nexus: Ford, Mallory, and the Future of Personal Jurisdiction under the Corporate Consent and Estoppel Framework

April 2023April 2023Shauli Bar-On*

While personal jurisdiction is intended to assess whether a defendant should be forced to defend a lawsuit in a location due to the defendant’s contacts with that forum, the doctrine has shifted to require the plaintiff to show a connection to the forum, even if the defendant otherwise has substantial contact with it. In its 2014 decision Daimler AG v. Bauman, the Supreme Court further limited the personal jurisdiction of corporate defendants in the spirit of curtailing forum shopping. But the Court’s 2021 decision concerning personal jurisdiction, Ford Motor Co. v. Montana Eighth Judicial District, and the Court’s granting of certiorari in Mallory v. Norfolk Southern Railway Co. cast doubt on the viability of Daimler. The 2021 Ford decision marks the beginning of an expansion of personal jurisdiction for corporate defendants. Justices Thomas, Sotomayor, and Gorsuch have expressed concerns over the protections afforded to corporate defendants under current doctrine. This Note elaborates on that skepticism. It traces the history of personal jurisdiction to reveal that the doctrine originates from the corporate consent and estoppel model—the very model at issue in Mallory. This Note argues that, absent guidance from Congress, courts must apply the original model—one that is inconsistent with Daimler and the nexus requirement. Finally, this Note argues that returning to the pre-Daimler and pre-nexus era produces favorable policy: it removes baseless corporate protections under the guise of the Fourteenth Amendment, clarifies the murky application of the doctrine in internet and stream of commerce cases, opens more fora for plaintiffs to allow free-market considerations to shape state law, and leaves the door open for Congress to legislate if it deems it necessary.

INTRODUCTION

Ask any athlete, and they will confirm the importance of home-field advantage. Over a large sample size, home teams win between 55% and 60% of National Football League games.¹ A similar phenomenon takes place in the Major League Baseball.² In the National Basketball Association, the numbers are usually higher at around 65% home-team wins.³ Needless to say, if offered a choice, teams would prefer to play at home. The same is true for litigants. The Constitution recognizes that a litigation forum, the location in which a lawsuit is permitted to take place, is limited.⁴ The limitation of where a defendant may be sued is known as where the defendant is subject to the court’s “personal jurisdiction.”

Traditionally, a defendant was subject to personal jurisdiction in a particular location if the defendant was “at home” in that location. But the definition of where a corporate defendant is “at home” has changed dramatically. Prior to 2013, corporate defendants were “at home” in any location in which they engaged in “continuous and systematic” contact.⁵ But under the Supreme Court’s 2013 decision Daimler AG v. Bauman,⁶ corporate defendants are now “at home” only in the locations in which they (1) maintain their headquarters or (2) are incorporated.⁷ Consequently, in order for a plaintiff to sue a corporate defendant outside of these two locations, the plaintiff must comply with a significantly more complicated framework, the most perplexing aspect of which is the “nexus” requirement: in order to sue a defendant away from the defendant’s “home” and ensure that the defendant’s due process rights are not offended, the plaintiff must show a connection between the selected location and the plaintiff’s lawsuit.⁸

This relatively new doctrine produces peculiar results. Masquerading as due process, the doctrine inordinately shields corporations from having to defend lawsuits in locations where they previously would have had to. For example, current doctrine forbids Michigan plaintiffs from suing a New York company in California but permits an identical lawsuit in the same venue for the same injuries based on the same conduct by California-residing plaintiffs.⁹ Moreover, the doctrine forbids a Florida-residing plaintiff from suing a Texas corporation in Florida, even though the corporation was registered to do business in Florida; had an agent for service of process in Florida, a distributor in Florida, and a plant in Florida; had been sued for similar claims in Florida; and had itself initiated lawsuits in Florida.¹⁰ In other words, in locations where the defendant is not “at home,” current doctrine erroneously assesses the plaintiff’s connection to the litigation forum in determining whether the defendant’s due process rights have been violated. The scenarios described above, and other recent Supreme Court decisions, illuminate how far astray from its origins personal jurisdiction doctrine has drifted.¹¹

In 2021, the Court handed down its decision in Ford Motor Co. v. Montana Eighth Judicial District Court,¹² which revealed that at least three sitting Supreme Court Justices¹³ are skeptical of the current personal jurisdiction doctrine, arguing that it provides too much protection for corporate defendants under the guise of the Fourteenth Amendment’s Due Process Clause. In April 2022, the Court also granted certiorari to address the corporate consent and estoppel model head on.¹⁴ This model, described in further detail below, suggests that if a corporation registers to conduct business in a forum, it implicitly consents to jurisdiction in that forum and is estopped from arguing otherwise. This Note expands on the justices’ concerns and offers a way forward consistent with the way personal jurisdiction has historically been understood.

This Note will illustrate that the modern personal jurisdiction doctrine—and the nexus requirement in particular—was improperly created to curtail forum shopping.¹⁵ It will then show that while Congress has passed statutes limiting or expanding jurisdiction in other contexts,¹⁶ and has narrowed jurisdiction of federal courts through venue statutes,¹⁷ it has not done the same to limit personal jurisdiction. Therefore, the sole consideration for personal jurisdiction is due process. And under the Due Process Clause, personal jurisdiction is based on the corporate consent and estoppel model, which inquires only into the corporate defendant’s contacts with the selected forum—it is not so concerned with the plaintiff’s connection to the forum. Accordingly, Daimler and the nexus requirement are inconsistent with this traditional model. This Note will also show how a reversion to this model of personal jurisdiction will clarify the doctrine’s application to cases involving internet sales and the “stream of commerce.”¹⁸

This Note begins by synthesizing the genesis and evolution of personal jurisdiction doctrine, discussing first the nineteenth century norms and moving into how Supreme Court jurisprudence has developed under the lens of the Fourteenth Amendment. The next Part of this Note narrows in on the relatively new distinction between general and specific personal jurisdiction¹⁹ and the “nexus” requirement that has attached to the latter. The Note continues by listing reasons the nexus requirement is troublesome and difficult to apply, given the narrowing of the “at home” definition for corporate defendants.²⁰ Finally, it ends with a preview of where the Court may be heading: given the granting of certiorari in Mallory, the Court appears to be in favor of reverting to the corporate consent and estoppel model and determining personal jurisdiction through assessing the defendant’s connection to the selected forum alone, consequently ditching Daimler and nixing the nexus requirement.

BACKGROUND
An Explanation of Personal Jurisdiction

Personal jurisdiction refers to the power a court has to make rulings relating to a party.²¹ Practically, it refers to the location in which a plaintiff may sue a defendant and hold the defendant to answer for that lawsuit. If a defendant is subject to personal jurisdiction in a particular location, known as a “forum,” the defendant must respond to the lawsuit, and any decision impacting the defendant can be enforced in other jurisdictions.²² If a case is in state court, personal jurisdiction answers the question “which state’s court system?” If the case is in federal court, personal jurisdiction answers the question “the federal court in which state?”

Personal jurisdiction analysis is twofold: statutory and constitutional.²³ States are free to pass statutes defining the personal jurisdiction of their state courts. These are referred to as “long-arm statutes,” as they extend or retract how far the “arm” of their court system can reach. Under Rule 4(k)(1)(A) of the Federal Rules of Civil Procedure, a federal court applies the long-arm statute of the state in which it is located.²⁴ In practice, a federal court in California will first determine whether there exists personal jurisdiction over a defendant under California’s long-arm statute. After making a determination under the long-arm statute, the court would turn to the constitutional analysis of personal jurisdiction.

The constitutional analysis of personal jurisdiction is based on the Due Process Clause of the Fourteenth Amendment. The analysis involves considerations of state sovereignty, federalism, and fairness. Because most states have long-arm statutes that permit personal jurisdiction to the limits of the constitution,²⁵ the personal jurisdiction analysis often blends into just a constitutional question. As such, courts²⁶ in states with to-the-limits-of-the-constitution long-arm statutes will only undertake a single analysis and have to answer one question: Is the exercising of personal jurisdiction in this forum consistent with the Due Process Clause? The remainder of this Note focuses only on the constitutional analysis of personal jurisdiction.

The Distinction Between General and Specific Personal Jurisdiction

Another concept crucial to the understanding of this Note is the distinction between two kinds of personal jurisdiction: “general (sometimes called all-purpose) jurisdiction and specific (sometimes called case-linked) jurisdiction.”²⁷ The former refers to a forum in which any plaintiff can bring any cause of action against the defendant.²⁸ The latter is a forum in which, under current doctrine, plaintiffs may only bring causes of action that “arise out of or relate to” the forum.²⁹ The specific personal jurisdiction requirement that the claim “arise out of or relate to” the forum is known as the “nexus” requirement because the plaintiff must show a “nexus” between the claim and the selected forum.

An example may help illustrate how the doctrine functions. Suppose a defendant is subject to general jurisdiction in Delaware. In that situation, a plaintiff from New York may sue the corporation in Delaware, even if there is no relation between the claim and Delaware—that is, even if the wrong alleged in the complaint took place in Maine. By contrast, suppose that the same defendant is not subject to general personal jurisdiction in Delaware. In the event that the wrong alleged in the complaint took place in Maine, the courts in Delaware would not have personal jurisdiction over the defendant and would not be able to adjudicate the dispute—this is because the plaintiff is unable to show a “nexus” between the claim and the selected forum in Delaware.

The Venue Statutes

Besides the due process requirements that plaintiffs must comply with in deciding where to file a lawsuit, Congress has acted to pass statutes narrowing potential venues for litigation. Specifically, Congress has outlined three locations in which a civil action may be brought: (1) where a defendant “resides”;³⁰ (2) where a “substantial part of the events or omissions giving rise to the claim occurred”;³¹ and (3) if there is no venue that fits (1) or (2), wherever the defendant is subject to the court’s personal jurisdiction.³²

In situations where a plaintiff files a lawsuit in a location in which the defendant is subject to personal jurisdiction, Congress permits defendants to nevertheless file motions to transfer venue or dismiss the case.³³ Congress envisioned two main reasons to permit a transfer of venue despite compliance with the requirements of personal jurisdiction. The first reason is when the plaintiff complies with the requirements of personal jurisdiction but does not comply with the requirements of the venue statute.³⁴ For example, suppose that a corporation is headquartered in San Francisco, California (which is in the Northern District of California) and finds itself to be the defendant in a federal-law dispute³⁵ with an employee over conduct that took place in San Diego, California (which is in the Southern District of California). If the employee files suit in the Central District of California, the defendant corporation may request a transfer to either the Northern or Southern District of California because, while the corporation is subject to personal jurisdiction in California, the Central District of California is an improper venue (it is not the venue where the defendant corporation is located,³⁶ and it is not the location where a “substantial part of the events or omissions giving rise to the claim occurred”).³⁷

The second reason is for convenience.³⁸ That is, even if a plaintiff complies with the requirements of personal jurisdiction and with the requirements of the venue statutes, a defendant may nevertheless request and be granted a motion to transfer venue if “the interest of justice” so demands.³⁹ In making the discretionary determination to transfer a case for convenience purposes, courts consider the following factors, among others: the relative ease of access to sources of proof, the cost of obtaining the attendance of required witnesses, administrative dealings of court congestion, and the local interests of having controversies decided where they took place.⁴⁰ For example,⁴¹ suppose a plane company is headquartered in Great Britain and flies a plane in Scotland. The plane’s parts were manufactured in Pennsylvania and Ohio. While the company was flying the plane in Scotland, it crashed and killed everyone on board. The heirs of the passengers sued the plane company in Pennsylvania.⁴² The Pennsylvania court could dismiss the case under a forum non conveniens theory, concluding that the case should be tried in Scotland.⁴³ In reaching this conclusion, the court would note that the crash had occurred in Scotland; the crash investigation had been conducted in Scotland; the witnesses are in Scotland; and the pilot’s estate, the plane’s owners, and the charter company were all located in Scotland.⁴⁴

The venue statutes supplement personal jurisdiction doctrine. Importantly, though, they are acts of Congress and not judge-made interpretations of the Due Process Clause of the Fourteenth Amendment. As for the forum non conveniens doctrine, there is a common law background to the doctrine, and it existed before the ratification of the Fourteenth Amendment.⁴⁵ This is crucial for originalist judges who believe that that Court should apply common law doctrines only if they existed at the time of the ratification of the amendment at issue. Of course, should Congress desire to narrow or expand the jurisdiction of federal courts and permit more or fewer fora for plaintiffs to file lawsuits, Congress is free to do so.⁴⁶

Recent Supreme Court Doctrine

The Supreme Court has historically been deadlocked in its personal jurisdiction doctrine. Justices seem to agree on dispositions but not the underlying reasoning for them.⁴⁷ In March 2021, the Court handed down its decision in Ford Motor Co. v. Montana Eighth Judicial District.⁴⁸ That decision doubled down on the Court’s previous personal jurisdiction decision, Bristol-Myers Squibb Co. v. Superior Court,⁴⁹ in which the Court required plaintiffs to show a nexus between their claim and the forum state in order to establish specific personal jurisdiction over a defendant corporation that has long been established in the forum selected for the litigation.⁵⁰ In both Ford and Bristol-Myers Squibb, the defendant corporation was not subject to general jurisdiction in the forum state despite its significant market presence there;⁵¹ in other words, the defendant corporation had purposefully availed itself of the forum state, arguably had continuous and systematic⁵² contact in the forum, but nevertheless was not “at home”⁵³ there. In Ford, a plaintiff purchased a malfunctioning car outside of Montana, yet she was permitted to sue Ford in Montana because Montana was the plaintiff’s home state.⁵⁴ In Bristol-Myers Squibb, a group of plaintiffs from Michigan was not permitted to sue in California (even though a group of California residents was permitted to sue there) because the Michigan plaintiffs had no connection to California.⁵⁵ The plaintiffs’ place of residency was thus determinative in failing to establish personal jurisdiction over the corporation defendant and offended the corporation’s right to due process under the Fourteenth Amendment.⁵⁶ Under both the first personal jurisdiction case since the passing of the Fourteenth Amendment, Pennoyer v. Neff,⁵⁷ as well as under the revamped “minimum contacts” test in International Shoe,⁵⁸ both Ford and Bristol-Myers Squibb would arguably be permitted to proceed in the selected fora. This Note will explain how the doctrine has evolved to the point of irreconciliation with these landmark cases.

THE STAKES AND HISTORY OF PERSONAL JURISDICTION
The Stakes of Personal Jurisdiction

Before delving into the history of personal jurisdiction and its development over the turn of two centuries, it is necessary to explain why it has been an area of such fierce contention. Personal jurisdiction is not about geography, not about which physical courthouse may entertain a controversy. Rather, it is about who adjudicates that controversy.⁵⁹

The Concern of State Judges and Congress’s Statutory Remedy

In most states, state judges are elected by the general public.⁶⁰ Accordingly, a case pending in state court is adjudicated by a judge subject to at least some public pressure. The case also has the potential of being tried before a jury composed of individuals from that state. These factors may create disadvantages for an out-of-state corporation, especially if the plaintiff is from the forum state.⁶¹ Hence the saying, “Though the courtroom be an adversarial arena, [the judge] is more than a referee . . . more than a linesman. [The judge] is the game.”⁶²

Congress has addressed the fairness concerns of defendants being sued in state courts outside their place of residence through the mechanism of federal court removal.⁶³ The process of removal, a product of congressional statute, allows defendants to move a case from state court—where judges are usually elected, and plaintiff-friendly state procedural law is likely to apply—to more defendant-friendly federal court if certain criteria apply.⁶⁴ One such criterion is when there exists “diversity jurisdiction.” Diversity jurisdiction occurs when the litigating parties are citizens of different states.⁶⁵ Corporations are citizens of the state in which they are incorporated and the state in which their headquarters is in.⁶⁶ Notably, though, diversity jurisdiction is permitted only in cases of “complete diversity,” which requires all parties on either side of the litigation “v” to be citizens of different states.⁶⁷ Given the complete diversity requirement, plaintiffs will oftentimes strategically sue along with a co-plaintiff from the same state as the defendant in order to preclude removal under diversity jurisdiction.⁶⁸ Congress has taken steps to address these concerns as well. In class actions, defendant-corporations rely on the Class Action Fairness Act, another congressional statute that allows defendants to remove a case to federal court so long as the amount in controversy exceeds $5 million and there is diversity of citizenship.⁶⁹ The Class Action Fairness Act does not require complete diversity.⁷⁰

The Concern of Forum Shopping and Congress’s Inaction

Then there is the issue of “forum shopping.” This term refers to plaintiffs seeking fora that offer the best choice-of-law and substantive law combinations to benefit their case.⁷¹ Plaintiffs also prefer to file claims in their hometown jurisdictions, where juries and judges are more likely to be sympathetic to the hometown plaintiff.⁷² Put another way, plaintiffs will choose to sue in locations where the law the court applies is most favorable to them and courtroom decisionmakers are more likely to favor them. One prominent example of the implications of forum shopping is the application of anti-SLAPP laws in various states and their availability in federal court.⁷³ SLAPP stands for “strategic lawsuits against public participation.” An anti-SLAPP motion is a state-law procedural rule available in many states that allows a defendant to repel and quickly dismiss lawsuits that threaten the defendant’s free-speech rights or matters of public concern. When this motion applies, the burden shifts to the plaintiff to show a likelihood of prevailing in the lawsuit. Without such a showing, the plaintiff’s case is dismissed. Because anti-SLAPP motions are not creatures of federal law, different circuits have different interpretations of when they can apply in federal court.⁷⁴ Some circuits permit the invocation of state anti-SLAPP motions in federal court in diversity jurisdiction cases while others do not.⁷⁵ The difference in these circuits could mean extra litigation costs and a higher potential for settlement.⁷⁶ Accordingly, the location of where a lawsuit is filed is a crucial strategic decision plaintiffs make.

Congress has not fully addressed forum shopping concerns by statute. While Congress has required certain claims to be litigated exclusively in federal court,⁷⁷ Congress has few guidelines about which federal court plaintiffs are required to file in.⁷⁸ This is where personal jurisdiction comes in. Personal jurisdiction’s roots are grounded in the Constitution alone, but its newfound application is in part to curtail forum shopping. The tension between personal jurisdiction doctrine’s roots and its modern significance, along with Congress’s inaction to curtail forum shopping, is the premise of this Note.

The History of Personal Jurisdiction
The Consent and Estoppel Model for Corporations

In the nineteenth century, corporations were subject to personal jurisdiction only in the state in which they were incorporated because they did not have the privilege to exist in other states.⁷⁹ Other states could agree to recognize a corporation by a process called comity.⁸⁰ As part of comity, states could require corporations to consent to being subject to the personal jurisdiction of the state in which they are licensed to conduct business.⁸¹ Accordingly, the estoppel model took form: if a corporation exercised corporate privileges in a state, it would be estopped from arguing that it was not subject to the personal jurisdiction of that state.⁸²

The history⁸³ of this model arose in the 1800s to address the “injustice”⁸⁴ that would result if a corporation could not be subject to suit in a forum where it does business but nonetheless is not headquartered. States passed statutes that required corporations to consent to being sued in the state in exchange for the privilege of doing business in the state. One of the first cases to recognize this model was Ex parte Schollenberger.⁸⁵ The Pennsylvania statute at issue in Schollenberger required corporations to appoint an agent to receive service that would have “the same effect as if served personally on the company within the State.”⁸⁶ The statute in question did not explicitly grant jurisdiction, but the Court held that

if the legislature of a State requires a foreign corporation to consent to be ‘found’ within its territory, for the purpose of the service of process in a suit, as a condition to doing business in the State, and the corporation does so consent, the fact that it is found gives the jurisdiction, notwithstanding the finding was procured by consent.⁸⁷

A few years later, the Court explicitly held that this model was constitutional.⁸⁸

Importantly, the consent and estoppel model did not originally require a nexus between the litigation and the forum. Instead, courts have held that the corporation’s consent to be sued subjects the corporation to general jurisdiction in the forum. For example, in Pennsylvania Fire Insurance Co. v. Gold Issue Mining and Milling Co.,⁸⁹ an insurance company based in Pennsylvania conducted business operations in Missouri and, as required by Missouri law, appointed a Missouri in-state agent for service of process.⁹⁰ The insurance company contracted with an Arizona company to insure its buildings in Colorado.⁹¹ After the Colorado property was struck by lightning and significantly damaged, the Arizona company sued the Pennsylvania insurance company in Missouri over the Colorado contracts.⁹² The Pennsylvania insurance company argued that it was not subject to personal jurisdiction in Missouri because the contracts did not involve Missouri whatsoever; that is, there was no “nexus” between Missouri and the plaintiff’s claim.⁹³ The Court disagreed, explaining that “the construction of the Missouri statute thus adopted hardly leaves a constitutional question open.”⁹⁴ The appointment of an agent to receive service in Missouri, the Court held, showed the insurance company’s consent to be sued in Missouri.⁹⁵ This line of reasoning continued in at least three other cases.⁹⁶

The Erosion: Shift from Corporate Consent to Corporate “Presence”

The explicit corporate consent model could no longer hold up after the Court, in International Textbook Co. v. Pigg,⁹⁷ held that states could not impede interstate commerce by denying out-of-state corporations from exercising corporate privileges in their states. Put another way, the Court forbade states from denying corporations permission to conduct business within their borders. As such, corporations no longer affirmatively consented to being subject to the personal jurisdiction of states in which they engaged in business activities.⁹⁸ To remedy the doctrine, the Court, in International Harvester Co. v. Kentucky,⁹⁹ held that when a corporation was “present” in a jurisdiction, it was subject to the personal jurisdiction of that forum through, presumably, an implied consent.¹⁰⁰

Corporate “presence” proved to be a tricky term to define.¹⁰¹ Nevertheless, the remnants of the consent model held up well.¹⁰² Pennsylvania maintained its consent-by-jurisdiction framework and was the only state to explicitly inform corporations of what they were agreeing to by doing business in the state. Under Title 42, Section 5301(a) of the Pennsylvania Consolidated Statutes, registration to do business in Pennsylvania—which foreign corporations are required to do—constitutes consent to general jurisdiction in Pennsylvania courts.¹⁰³ For a time even after International Shoe, courts continued to enforce the consent and estoppel model in Pennsylvania. For example, the Third Circuit in Bane v. Netlink¹⁰⁴ held that there was no need to conduct a personal jurisdiction analysis (that is, to assess whether the defendant had systematic and continuous contact in the forum) because the defendant corporation consented to being subject to general personal jurisdiction in the state by virtue of the Pennsylvania statute.¹⁰⁵ The court distinguished that situation from another Third Circuit case, Provident National Bank v. California Federal Savings and Loan Association,¹⁰⁶ where the defendant had not registered to do business in Pennsylvania.¹⁰⁷

However, in late 2021, the Pennsylvania Supreme Court struck down the law requiring out-of-state corporations to submit to jurisdiction as a requirement of registering to do business in the state, finding that the statute is incompatible with the Fourteenth Amendment, as interpreted in Daimler.¹⁰⁸ The most recent Pennsylvania Supreme Court decision highlights the split over the constitutionality of such statutes. A number¹⁰⁹ of other state high courts have reached similar conclusions, rejecting the constitutionality of jurisdiction-by-consent statutes.¹¹⁰ And a number of other state high courts have reached the opposite conclusion, finding that such statutes are constitutional.¹¹¹ Other state high courts have used state law to reach conclusions in this area of the law.¹¹² In April of 2022, the Supreme Court granted certiorari over the Pennsylvania Supreme Court’s decision.¹¹³

But what about states that do not explicitly by statute inform defendant corporations that they would be subject to general personal jurisdiction in the state? Nearly every state requires foreign corporations to appoint an agent to receive service of process in the state.¹¹⁴ Courts were split as to whether these schemes subjected corporations to general personal jurisdiction in the state. Minnesota, for example, has a statutory scheme that allows service of process over a foreign corporation through service on the Minnesota Secretary of State.¹¹⁵ In that situation, though, the service is valid “only when based upon a liability or obligation of the corporation incurred within this state or arising out of any business done in this state by the corporation prior to the issuance of a certificate of withdrawal.”¹¹⁶ Various Minnesota state and federal courts have interpreted these statutes as creating consent to general jurisdiction for registered foreign corporations.¹¹⁷ In Knowlton v. Allied Van Lines,¹¹⁸ the Eighth Circuit held that the Minnesota statute requiring a registered agent within the state creates general jurisdiction in that state when service is processed on that agent.¹¹⁹ Particularly, the court noted that “[t]he whole purpose of requiring designation of an agent for service is to make a nonresident suable in the local courts,” and, as such, “appointment of an agent for service of process . . . gives consent to the jurisdiction of Minnesota courts for any cause of action, whether or not arising out of activities within the state.”¹²⁰

A nearly identical phenomenon has occurred in Iowa. Iowa federal courts, relying on Knowlton, found that an Iowa statute is “almost identical to that of Minnesota.”¹²¹ As such, even though it does not explicitly address jurisdictional consequences of registration, the statute confers general jurisdiction in Iowa courts.¹²² The same has been held to be true in Kansas¹²³ and New Mexico.¹²⁴ The Georgia Supreme Court reaffirmed the concept as well.¹²⁵ And even after International Shoe fundamentally changed the personal jurisdiction analysis, several circuit courts continued to hold that consent by registration obviated the due process analysis and that states could exercise general jurisdiction based on that consent.¹²⁶ This is not to say that there are no federal circuits holding to the contrary. While six circuits have found jurisdiction-by-consent statutes to be constitutional,¹²⁷ five circuits reached the opposite conclusion.¹²⁸ And two circuits avoided the constitutional question.¹²⁹ These decisions are all in flux, given the Supreme Court’s decision in 2022 to grant certiorari and review the Pennsylvania statute.¹³⁰

But “presence” and “consent” are two distinct ways of submitting to jurisdiction. Putting aside the question of whether a corporation “consents” through registering to do business—the question that the Supreme Court will aim to answer in Mallory—there is a simpler way to determine the existence of personal jurisdiction: assessing whether the corporation has engaged in systematic and continuous contact in the forum state. The Supreme Court’s guidance in Ford sheds light on where the Court may be heading on the “presence” front. The hallmarks of due process in the context of the consent and estoppel model are reciprocity and fairness.¹³¹ Ford seemed to reiterate the underlying theme of “reciprocal obligations” between a defendant and the forum as the basis for what makes the exercise of personal jurisdiction “fair.”¹³² In that case, because Ford Motor Company enjoyed “the benefits and protections” of state law while doing business in the forum, “allowing jurisdiction in these cases treats Ford fairly.”¹³³

The Fourteenth Amendment: Personal Jurisdiction and the Current Doctrine

With the passing of the Fourteenth Amendment in 1868, the Supreme Court saw it proper to provide guidance on personal jurisdiction under a now-federalized due process standard. In Pennoyer,¹³⁴ the Court held that a court may exercise personal jurisdiction over a party only if that party was served with process in the state seeking to adjudicate the controversy.¹³⁵ As explained above, this ruling was consistent with the consent and estoppel model and the subsequent corporate presence model.

Despite Pennoyer’s overruling by International Shoe, the Court remained true to the spirit of the corporate consent and estoppel model. Pennoyer was overruled and substituted with the “minimum contacts” test in International Shoe.¹³⁶ Under the new International Shoe standard, a defendant becomes subject to the personal jurisdiction of a state with which it engages in “minimum contacts.”¹³⁷ The test was later refined in Hanson v. Denckla¹³⁸ to define “minimum contacts” as contacts that demonstrate a defendant’s “purposeful availment” of the jurisdiction.¹³⁹ In other words, a corporate defendant becomes subject to the personal jurisdiction of a forum if it takes a purposeful action to benefit from the privilege of doing business in that forum.¹⁴⁰ Similarly, under World-Wide Volkswagen Corp. v. Woodson,¹⁴¹ the foreseeability of causing injury in a particular location was held not to be enough to subject a corporation to the personal jurisdiction of the courts in that location.¹⁴² Therefore, while the International Shoe test, along with its refinements in Hansen and World-Wide Volkswagen, departed from the Pennoyer service-of-process test, it remained consistent with the consent and estoppel model and the corporate presence model. “Minimum contacts” and “purposeful availment” became the tests to determine whether a corporation was “present” in a forum such that it should be subject to the personal jurisdiction of the forum. Foreseeability of injury, on the other hand, is not synonymous with corporate presence and therefore was not a basis for personal jurisdiction, just as a corporation cannot be “present” in a location based on foreseeability of injury alone and cannot be said to have “consented” to jurisdiction based on foreseeability of injury alone. Applying the new test, the Court in McGee v. International Life Insurance Co.¹⁴³ found that a California court could subject a Texas insurance company to its personal jurisdiction, even though the insurance company had a single policy contract with a California resident.¹⁴⁴ The corporation was found to have been present in California because it entered into a contract directly in California.¹⁴⁵

The adherence to the origins of the personal jurisdiction consent and estoppel and corporate presence model did not last. In Burger King v. Rudzewicz,¹⁴⁶ the Supreme Court subtly revised its test for personal jurisdiction beyond McGee and bifurcated what was previously a one-step “minimum contacts” test.¹⁴⁷ The Court fractured the original intention of International Shoe, holding that personal jurisdiction can be established if two elements are met: (1) the defendant engaged in minimum contacts/purposeful availment of the forum state; and (2) the subjugation of personal contacts does not offend “traditional notions of fair play and substantial justice.”¹⁴⁸ The Court, in a split decision,¹⁴⁹ later created five factors by which to determine whether establishing personal jurisdiction over an out-of-state defendant would violate “traditional notions of fair play.”¹⁵⁰

Two concurrences are most telling in just how far this doctrine has gone adrift. Justice Brennan’s concurrence in Asahi v. Superior Court argued that a defendant’s placing of a product into the stream of commerce may very well satisfy the minimum contacts prong but that it would not satisfy the “fair play and substantial justice” prong. That is, showing minimum contacts is not enough.¹⁵¹ Justice John Paul Stevens, in concurrence, agreed that jurisdiction would be “unreasonable and unfair,” but he did not join Justice O’Connor’s opinion, in part because the Court should not have even considered minimum contacts. He wrote that “it is not necessary to the Court’s decision. An examination of minimum contacts is not always necessary to determine whether a state court’s assertion of personal jurisdiction is constitutional.”¹⁵² Minimum contacts, however, is the key framework under which corporate presence is determined.¹⁵³

III. THE HISTORY OF THE NEXUS REQUIREMENT

Under current doctrine, a defendant is subject to the specific personal jurisdiction of a forum if the controversy “arises out of” or “relates to” the defendant’s contact with the forum state. This was first hinted at in Shaffer v. Heitner,¹⁵⁴ in which the Court held that in rem jurisdiction—jurisdiction based solely on the presence of a defendant’s property in the forum—is insufficient on its own to establish personal jurisdiction.¹⁵⁵ In Shaffer, plaintiffs filed a shareholder derivative suit against a corporation and corporate executives.¹⁵⁶ The basis for personal jurisdiction in the selected forum was the defendant’s property in the forum. The Court held the following:

The presence of property in a State may bear upon the existence of jurisdiction by providing contacts among the forum State, the defendant, and the litigation, as for example, when claims to the property itself are the source of the underlying controversy between the plaintiff and defendant, where it would be unusual for the State where the property is located not to have jurisdiction. But where, as in the instant quasi in rem action, the property now serving as the basis for state-court jurisdiction is completely unrelated to the plaintiff’s cause of action, the presence of the property alone, i.e., absent other ties among the defendant, the State, and the litigation, would not support the State’s jurisdiction.¹⁵⁷

The Court further explained that

although the presence of the defendant’s property in a State might suggest the existence of other ties among the defendant, the State, and the litigation, the presence of the property alone would not support the State’s jurisdiction. If those other ties did not exist, cases over which the State is now thought to have jurisdiction could not be brought in that forum.¹⁵⁸

In making its determination, the Court acknowledged that it was backtracking from the “long history of jurisdiction based solely on the presence of property in a State”¹⁵⁹ by now requiring a “relationship among the defendant, the forum, and the litigation” in order to establish personal jurisdiction.¹⁶⁰ Therefore, the Court engaged in a policy analysis in its departure from the traditional doctrine. It did so presumably to curtail the shareholders’ forum shopping, despite the fact that the defendant corporation was “present” in Delaware due to its property in the state. As such, the Court looked beyond the original understanding of personal jurisdiction. Under the “long history of jurisdiction,” personal jurisdiction could be established “based solely on the presence of property in a State.”¹⁶¹ Even under the “minimum contacts” test from International Shoe, if a defendant owns property in a state, then that defendant has the minimum contacts necessary to subject it to personal jurisdiction in that forum. Given that Congress has provided no guidance on jurisdiction besides the venue statutes, the proper remedy for defendants faced with lawsuits in locations they prefer not to litigate in is to seek to transfer the case to a more appropriate venue.¹⁶²

In future cases, the Supreme Court attempted to assert that the nexus requirement is, in fact, rooted in the original understanding of the Fourteenth Amendment’s due process standard. In Daimler AG v. Bauman,¹⁶³ the Court explained that the concept of “reciprocal fairness” between corporations and the states in which they conduct business implies a nexus requirement. The Court never attempted to argue that the nexus requirement is rooted in the Pennoyer test, but the Court quoted a passage in International Shoe in support of its argument:

The exercise of th[e] privilege [of conducting corporate activities within a State] may give rise to obligations, and, so far as those obligations arise out of or are connected with the activities within the state, a procedure which requires the corporation to respond to a suit brought to enforce them can, in most instances, hardly be said to be undue.¹⁶⁴

But the reliance on International Shoe for this proposition is not entirely accurate. While International Shoe blessed the exercise of jurisdiction in cases where the suit arose out of the defendant’s contact with the state, it explicitly left open the possibility of the exercise of jurisdiction without such a nexus requirement:

While it has been held in cases on which appellant relies that continuous activity of some sorts within a state is not enough to support the demand that the corporation be amenable to suits unrelated to that activity, there have been instances in which the continuous corporate operations within a state were thought so substantial and of such a nature as to justify suit against it on causes of action arising from dealings entirely distinct from those activities.¹⁶⁵

Historically, regarding the personal jurisdiction of corporations, there were instances in which a nexus requirement was explicitly rejected, that is, situations in which the exercise of jurisdiction was upheld despite the lawsuit not arising from the defendant’s contact with the forum.¹⁶⁶

While the origins of the nexus requirement have to do with the defendant’s presence connecting with the litigation filed against it, the nexus requirement has now shifted to require the plaintiff’s connection with the forum state as well. The case cited by recent decisions for this proposition is Helicopteros Nacionales de Colombia, S.A. v. Hall.¹⁶⁷ But importantly, Helicopteros’s understanding of “general jurisdiction” differs from what the term means today. Heliopteros specifically maintains that

[e]ven when the cause of action does not arise out of or relate to the foreign corporation’s activities in the forum State, due process is not offended by a State’s subjecting the corporation to its in personam jurisdiction when there are sufficient contacts between the State and the foreign corporation.¹⁶⁸

The Court cited Perkins v. Benguet Consolidated Mining Co.¹⁶⁹ for this proposition. In Perkins the Court found that a foreign corporation not incorporated or headquartered in Ohio could be subject to general jurisdiction in Ohio in a suit filed by a nonresident of Ohio when the cause of action did not arise out of or relate to the forum because “the foreign corporation, through its president, ‘ha[d] been carrying on in Ohio a continuous and systematic, but limited, part of its general business,’ and the exercise of general jurisdiction over the Philippine corporation by an Ohio court was ‘reasonable and just.’ ”¹⁷⁰ In other words, Helicopteros does not require a nexus between the litigation and the forum so long as there is a “continuous and systematic” existence of the corporation in the forum.¹⁷¹ Presumably, then, Helicopteros only requires the plaintiff to show that the litigation “arises out of” or is “related to” the forum in situations where the defendant is not “continuos[ly] and systematic[ally]” present in the forum.¹⁷²

The root of the confusion regarding the nexus requirement is that it was created before the concepts of “specific” and “general” jurisdiction existed or were properly defined. The Helicopteros court, relying on Perkins, held that if a corporation’s presence was “systematic” and “continuous” in a forum, then it would be subject to general jurisdiction in that forum such that no nexus is required at all.¹⁷³ This is no longer the case today. A major reason why this is no longer the case is because of a prophetic article written by two Harvard Law School professors, which influenced the Court significantly.¹⁷⁴ These professors have dubbed the terms we currently refer to as “general jurisdiction” and “specific jurisdiction,” while also defining the two to their near identical meanings in the current doctrine.¹⁷⁵ The Court in Daimler adopted the policy proposed by the article, holding that a corporation is subject to general jurisdiction only in its place of corporation and its principal place of business.¹⁷⁶ There is one crucial problem with the article, however: it is not premised on the Due Process Clause of the Fourteenth Amendment; instead, it is premised on creating the best policy for which to adjudicate matters and includes forum shopping and convenience for the parties as some of its major supporting propositions. But the underlying reasoning for personal jurisdiction is not convenience or effective policy—these are considerations Congress ought to consider in statutes dictating proper venues for litigation. The sole consideration in personal jurisdiction jurisprudence is due process.

After the terms general and personal jurisdiction were given their current definition, the Court in Bristol-Myers Squibb applied the Helicopteros rule without consideration of the Helicopteros Court’s understanding of general jurisdiction.¹⁷⁷ As a result, it muddied the waters significantly. In Bristol-Myers Squibb, a group of medicine users sued in California state court a corporation that manufactured the drugs in California.¹⁷⁸ Some of the plaintiffs were not California residents.¹⁷⁹ The Court held that the non-California plaintiffs could not sue in California because there was no nexus between their litigation and the forum; the non-California plaintiffs’ claims did not “arise out of” or “relate to” California.¹⁸⁰ Put another way, the Court held that it violated the defendant’s due process right to be sued in California by one group of plaintiffs but not another group of plaintiffs for the same cause of action and the same set of events, and the differentiating factor was the plaintiffs’ place of residency¹⁸¹ This analysis of the plaintiffs’ connection to the forum is the current understanding of personal jurisdiction, specifically the nexus requirement.

Notably, for the sake of judicial economy, the Bristol-Myers Squibb litigation was consolidated through multi-district litigation, commonly referred to as “MDL,”¹⁸² and pretrial proceedings for both groups of plaintiffs took place jointly in California. Through the MDL process, the two groups of plaintiffs could litigate only pretrial issues together in California without regard to personal jurisdiction. Courts have struggled with the application of personal jurisdiction to MDL proceedings.¹⁸³ While personal jurisdiction in MDL is outside the scope of this Note, this set of events illustrates that courts take no issue with altering personal jurisdiction doctrine to promote judicial economy and the MDL process, yet they will continue to unnecessarily protect corporate defendants by rigidly upholding the nexus requirement in cases that are not large enough to consolidate through the MDL process.¹⁸⁴

ISSUES WITH THE CURRENT DOCTRINE

Based on the above explanations, personal jurisdiction doctrine has strayed away from its original roots of the Fourteenth Amendment and has drifted into a way of curtailing forum shopping. In many circles, this reason alone is enough to demand alteration.¹⁸⁵ However, as I explain below, not only is the current doctrine inconsistent with the original understanding of personal jurisdiction, but it also causes complications in the context of internet sales and stream of commerce cases. In this section, I detail the current doctrine’s shortcomings. In the following section, I preview a direction the Court may be heading: a reversion to the original understanding of personal jurisdiction based on the corporate consent and estoppel model.

It Provides Corporations with Protections They Are Not Entitled to

While the inconsistency with prior case law and the historical application of personal jurisdiction doctrine are by themselves sufficient to question the nexus requirement in its current form, the present standard is also problematic from a policy perspective. It provides corporations with additional protections not mandated by the Constitution—and nonexistent under statute—under the guise of due process.

Take Ford as an example. The Court held that the Ford Motor Company can be subject to personal jurisdiction in Montana for a case involving Ford Explorer vehicles because it sells Ford Explorers in Montana such that it “cultivated a market” there.¹⁸⁶ But, presumably, if Ford sold different models in Montana and did not sell Explorers, there would be no jurisdiction over the plaintiff’s case in Montana because requiring Ford to answer a complaint in Montana under those circumstances would violate the Due Process Clause. This framing of the “market” being “cultivated” is shaky at best. Does it matter which year Ford started selling the Explorer in Montana? What if the model in question was older than the models Ford has sold in Montana? Does the trim of the model matter? What about the model’s color?¹⁸⁷

The Court also held that the plaintiffs’ contacts with Montana are determinative.¹⁸⁸ The Court held that if Ford sells Explorers in Montana, then Montana can decide any case involving an Explorer accident within its borders, regardless of how it got there, so long as the plaintiff has a connection to Montana.¹⁸⁹ So, no matter how extensive Ford’s contacts with Montana might be, the determinative factor is the plaintiff’s connection with the forum. But what difference does it make to Ford whether the Explorer crash took place in Montana or in Idaho? If Ford already has significant contact with Montana such that it has cases pending in Montana, Ford would not be required to conduct any additional expenses to defend itself in Montana. Requiring Ford to defend one lawsuit in Montana while allowing Ford to dismiss an identical lawsuit solely on the basis of the plaintiff’s place of residency and connection to Montana is perplexing. Requiring Ford to defend the first lawsuit is no more a violation of Ford’s due process rights than it is to require Ford to defend against the second lawsuit.

Similarly, Bristol-Myers Squibb emphasized that the Michigan plaintiff’s suing the defendant corporation in California violated the defendant’s due process rights because the Michigan plaintiffs “did not ingest Plavix in California.”¹⁹⁰ Nevertheless, a group of plaintiffs from California was permitted to sue in California for the same cause of action relating to the same drugs. The only difference between the two groups of plaintiffs is where they ingested the drugs. But that fact should not have been determinative. What difference does it make if a Texan brings his pills on a California vacation and ingests them there or if the Texan ingested the pills in Texas? ¹⁹¹ It is odd to argue that these hypothetical cases, as opposed to the ones previously before the Court, would not violate the defendant’s due process rights by allowing jurisdiction in each of these otherwise-identical cases. Figures 1 through 4 below illustrate how the doctrine plays out.

Figure 1. Nexus with Forum State Through Plaintiffs’ Residence

Figure 2. Nexus with the Forum State Through Plaintiffs’ Vacation

Figure 3. No Nexus Despite Defendant’s Continuous

Figure 4. Scenarios Analyzed Under the Current Doctrine
	Scenario #1	Scenario #2	Scenario #3
Did the Manufacturer purposefully avail itself of CA?	YES. It sold pills in CA.	YES. It sold pills in CA.	YES. It sold pills in CA.
Do the plaintiffs have a “nexus” to CA?	YES. They bought and ingested the pills in CA.	YES. They ingested the pills in CA.	NO. They did not buy or ingest the pills in CA.
Conclusion	CA courts have personal jurisdiction over CA plaintiffs’ claims.	CA courts have personal jurisdiction over TX plaintiffs’ claims.	CA courts do not have personal jurisdiction over TX plaintiffs’ claims.

The Ford decision also raises questions about the general jurisdiction framework. The Court seems to erode that concept, perhaps unintentionally. If Ford can “cultivate a market” in a forum, then it can be subject to personal jurisdiction for claims relating to that forum, so long as the plaintiff has a connection to the forum as well. As explained above, the “market” being “cultivated” can prove to be a difficult term to define. And requiring the plaintiff’s connection to the forum results in illogical and arbitrary grants and denials of jurisdiction, as illustrated in the above figures.

Under current general jurisdiction jurisprudence, a corporation is subject to general jurisdiction wherever it is “at home,” which has been held to mean its place of incorporation and headquarters.¹⁹² But why is it any more compliant with due process for a plaintiff residing in Idaho to sue General Motors (incorporated in Delaware and headquartered in Michigan) in Delaware and Michigan as opposed to Texas, where the company has had a factory and has done business since 1954?

The Court’s attempt at showing that Ford cultivated a market in Montana begins to bleed into the general jurisdiction framework. It would be a much simpler and more predictable test to ask whether Ford has “minimum contacts” such that it “purposefully availed” itself of the privilege of doing business in Montana, and consequently, it is subject to personal jurisdiction in Montana.

Justice Sotomayor pointed this out in her Daimler concurrence, noting that limiting general justification to a corporation’s principal place of business and its place of incorporation would lead to “deep injustice.”¹⁹³ She pointed out that “the majority’s approach unduly curtails the States’ sovereign authority to adjudicate disputes against corporate defendants who have engaged in continuous and substantial business operations within their boundaries.”¹⁹⁴ She then called into question the special protections corporations would be receiving under the newly defined due process requirements:¹⁹⁵ “Put simply, the majority’s rule defines the Due Process Clause so narrowly and arbitrarily as to contravene the States’ sovereign prerogative to subject to judgment defendants who have manifested an unqualified ‘intention to benefit from and thus an intention to submit to the[ir] laws.’ ”¹⁹⁶

There is some indication based on Ford, the Court’s personal jurisdiction case from 2021, that at least some of the Justices are questioning the existing precedent. In particular, Justice Gorsuch’s Ford concurrence, which was joined by Justice Thomas, expressed skepticism of the “at home” test for corporations regarding general jurisdiction.¹⁹⁷ He wrote, “[I]t seems corporations continue to receive special jurisdictional protections in the name of the Constitution. Less clear is why.”¹⁹⁸

Difficult Application to Internet Sales Cases

The current doctrine does not adequately address how courts should apply it to cases involving internet sales. When it comes to determining purposeful availment, courts look to whether online conduct was purposefully directed at the forum state.¹⁹⁹ Courts also use a sliding scale to determine whether the contacts constitute purposeful availment.²⁰⁰ For example, if a website is passive because it only advertises or posts information without any option for users to interact with it, the website may not provide a basis for personal jurisdiction. On the other hand, if the website involves making transactions or entering into contracts through knowing and repeated transmission of files over the internet, personal jurisdiction seems more likely. If the website falls in between these two categories of interactivity, the level of the interactivity and the nature of the website must be examined. In other words, the greater the commercial nature and interactivity associated with the website, the more likely the website operator engaged in purposeful availment of the forum state.

As a corollary to the sliding scale, courts have also recognized that tortious conduct that takes place online can subject a defendant to personal jurisdiction.²⁰¹ If the defendant’s actions were intentional, uniquely or expressly aimed at the forum state, and caused harm in the forum state, personal jurisdiction is proper there, because the defendant is said to have “purposefully directed” actions at the forum state.²⁰² A refined test examines whether the defendant knew and intended the consequences of its actions to be felt in the forum state, not just that the defendant knew where the plaintiff lives.²⁰³ That is, if the mention of the state is incidental and not included for the purposes of having the consequences felt in the forum state, there is likely no personal jurisdiction there.

For example, suppose that an Idaho newspaper, which distributes only in Idaho and the bordering towns in Washington State, publishes a story defaming a California celebrity. Can it be said that the newspaper intended the consequences of its story to be felt in California, given that it does not distribute in California? The newspaper has no contacts with California, so how can it be said that the newspaper purposefully availed itself of the privilege of doing business in California?

The Ford case adds an additional element that muddies the water even more. What if a corporate defendant “cultivates a market” in a forum? Under Ford, plaintiffs would be permitted to sue in that forum so long as they have a nexus to that market. In the above hypothetical, would the California celebrity be permitted to sue in Washington State because of the market the newspaper cultivated there? Also, as explained above, it is difficult to define the product that a company cultivates a market for, and framing the market being cultivated is highly malleable. For example, does Amazon cultivate a market for delivery in California? Or is the cultivated market analyzed by specific products, as it was in Ford? Assuming the latter, what is the justification of looking at the plaintiff’s connection to the forum to assess the due process rights of the defendant?

Difficult Application to Stream of Commerce Cases

There is no agreed-upon framework by which to address stream of commerce cases. A “stream of commerce” case refers to a situation where a manufacturer sells products to a regional distributor and the regional distributor sells the products elsewhere. For example, assume that a car company manufactures its cars in China and then sells the fully manufactured cars to a distributor in California and no distributors in Oregon. Then assume that the California distributor sold the cars to a dealership in Oregon, and an Oregon resident bought a car from that dealership. If the car malfunctions, may the Oregon resident sue the manufacturer in Oregon? The question is whether the car manufacturer engaged in minimum contacts or purposefully availed itself of doing business in Oregon through the stream of commerce that brought its product to Oregon.

Justice White, in dicta in in World-Wide Volkswagen, suggested that there may exist personal jurisdiction over a manufacturer in a forum even if the manufacturer itself did not sell in that forum; he wrote that personal jurisdiction would exist in such a situation only if the sale in the forum was “not simply an isolated occurrence, but arises from the efforts of the manufacturer or distributor to serve, directly or indirectly the market for its product.”²⁰⁴

The Court has been unable to agree on what these instructions practically mean. Justices Breyer and Alito understand this to refer to the number or substantiality of the sale in the forum.²⁰⁵ Justice O’Connor’s plurality opinion in Asahi held that there would be personal jurisdiction over a defendant manufacturer only if Justice White’s criteria were satisfied and the manufacturer engaged in “additional conduct . . . [that indicates] an intent or purpose to serve the market in the forum state.”²⁰⁶ This could include designing the product for the forum state, advertising the product in the forum state, or establishing channels for providing regular contact to consumers in the forum state. Justice Brennan, writing for the split court in Asahi, indicated that if the maker foresees and benefits from the contact with the forum, personal jurisdiction is satisfied, even without an intentional act targeting the forum.²⁰⁷

The Court’s split continued in McIntyre, in which the Justice Kennedy plurality held that a foreign manufacturer that sold products to a U.S. distributor was not subject to personal jurisdiction in the states the distributor subsequently distributed to.²⁰⁸ Justice Ginsberg dissented and wrote that, in her view, there is personal jurisdiction in a state when a manufacturer chooses a distributor who distributes to the entire United States.²⁰⁹ Justices Breyer’s and Alito’s concurrence explained that personal jurisdiction should be dependent on the number of products sold in the state.²¹⁰

Figure 5 synthesizes current steam of commerce doctrine:

Figure 5. Current Stream of Commerce Doctrine

The Ford case presented a slightly nuanced version of the hypothetical discussed above. In Ford, the vehicle that malfunctioned was designed, manufactured, and sold outside of Montana.²¹¹ Later resells and relocations by consumers brought the vehicle to Montana, where it malfunctioned.²¹² As such, it was only through the “stream of commerce” that the particular vehicle at issue was brought to Montana.²¹³ The Court united in its holding that Ford’s advertising in and manufacturing in Montana constituted sufficient purposeful availment, and held that the plaintiffs had a sufficient nexus to the forum simply because the car malfunctioned in Montana, even though they did not purchase the vehicle in Montana.²¹⁴ However, if, through the stream of commerce, the plaintiffs were in the neighboring state of Idaho, then presumably there would be no nexus and no personal jurisdiction in Montana, even though the facts—and Ford’s purposeful availment of the Montana forum—would be identical. It is unclear why Ford’s due process rights would be violated if an Idaho plaintiff sues Ford in Montana but would not be violated if a Montana plaintiff who purchased Ford’s product in Wisconsin and drove to Montana sues in Montana.

Figure 6. The Ford Litigation

WHERE THE COURT IS HEADED AFTER FORD

A reversion to the constitutional underpinnings of personal jurisdiction doctrine means removing the corporate protections available under the guise of the Fourteenth Amendment. The case is easier for removing plaintiffs’ requirement to show a nexus to the litigation when suing a corporation in a forum in which the corporation has systematic and continuous contact. As explained above, the nexus requirement came into being with the explicit understanding that it is a requirement only if the corporate defendant has no systematic and continuous contact with the forum. In other words, the case is easy for overruling Daimler and removing the narrow understanding of general jurisdiction, finding instead that general jurisdiction exists wherever corporations implicitly consent to personal jurisdiction through systematic and continuous contact.

However, some Justices may propose going a step further and removing the distinction between specific and general jurisdiction as it is inconsistent with the Court’s original understanding of personal jurisdiction. Accordingly, plaintiffs would be permitted to pursue causes of action in any forum in which a corporation engages in minimum contacts sufficient to constitute purposeful availment without showing a nexus to the litigation. Corporate defendants would be permitted to transfer cases under the venue statutes alone.

In situations where a corporation had purposefully availed itself of a forum in a previous one-off occurrence, the claim brought in that forum must allege conduct that took place during the purposeful availment of the selected forum.²¹⁵ That is, a corporation would not be able to retroactively cease purposeful availment.

I have already explained why this model is consistent with the original understanding of personal jurisdiction. In many circles, this reason alone would be sufficient to adopt it. However, in this Part I detail why a reversion to this original understanding is good policy as well. It increases fora for plaintiffs, makes for a more predictable personal jurisdiction doctrine (especially in cases involving internet sales and the stream of commerce), and leaves room for Congress to act should it find the need to.

Removing the Distinction Between General and Specific Personal Jurisdiction

One avenue of development post-Ford envisions significantly expanding general jurisdiction in the way it is understood today. This would mean overruling the holding in Daimler, which permits general jurisdiction over a corporation only in its principal place of business and place of incorporation.²¹⁶ As explained above, the case is easy to revert to the pre-Daimler jurisprudence, where general jurisdiction existed in each location where a corporation engaged in continuous and systematic contact with a forum, because that was the original understanding of personal jurisdiction. Nothing in the original understanding of personal jurisdiction, early cases dealing with the doctrine, or the Fourteenth Amendment compels affording corporations protections from being forced to defend lawsuits in fora besides their place of incorporation and headquarters. Such protections limiting personal jurisdiction can only come from statutes, and Congress has not legislated in the arena of personal jurisdiction.

However, assessing personal jurisdiction solely through the lens of purposeful availment reveals that the concept of general jurisdiction is unnecessary, especially after it was eroded in Ford. Ford held that if a corporation systematically serves a market, and the plaintiffs are from that forum state, it is as if there is general jurisdiction for those specific plaintiffs in the forum.²¹⁷ But if a corporation is already prepared to defend against lawsuits in a particular jurisdiction, it does not offend due process rights to require the corporation to defend against all lawsuits in that jurisdiction, subject to transfer of venue “in the interest of justice.”²¹⁸

Furthermore, as Douglas D. McFarland points out in his scholarship,

The original, unpolished International Shoe test is a one-step, unitary test. A court is not required to find “minimum contacts” and “fair play and substantial justice.” Neither is a court required to find “minimum contacts” or “fair play and substantial justice.” The opinion requires a court find “minimum contacts with [the state] such that the maintenance of the suit does not offend “traditional notions of fair play and substantial justice.”²¹⁹

Given this understanding, it becomes clear that it is more consistent with the Due Process Clause and International Shoe to allow jurisdiction for all claims in a forum where the defendant corporation engaged in “minimum contacts.”²²⁰ There need not be a difference between the type of claim permitted to originate in that forum. In other words, personal jurisdiction in a forum should be defined by defendant, not by claim. If a defendant is subject to personal jurisdiction in a particular location, then that defendant should be subject to personal jurisdiction in that location for all claims and should be permitted to transfer cases under the guidelines provided by Congress alone. The following figure is identical to Figure 4 above, referencing the three scenarios in Figures 1, 2, and 3, but it includes an extra row showing how the ultimate conclusion would change should Daimler be overruled.

Figure 7. Scenarios Analyzed Under the Consent and Estoppel Model
	Scenario #1	Scenario #2	Scenario #3
Did the Manufacturer purposefully avail itself of CA?	YES. It sold pills in CA.	YES. It sold pills in CA.	YES. It sold pills in CA.
Do the plaintiffs have a “nexus” to CA?	YES. They bought and ingested the pills in CA.	YES. They ingested the pills in CA.	NO. They did not buy or ingest the pills in CA.
Conclusion under current doctrine	CA courts have personal jurisdiction over CA plaintiff’s claims.	CA courts have personal jurisdiction over TX plaintiff’s claims.	CA courts do not have personal jurisdiction over TX plaintiff’s claims because there is no nexus.
Conclusion without Daimler	CA courts have personal jurisdiction because the manufacturer purposefully availed itself of California law.	CA courts have personal jurisdiction because the manufacturer purposefully availed itself of California law.	CA courts have personal jurisdiction because the manufacturer purposefully availed itself of California law.

Clarifying the Doctrine for Internet Sales Cases

A straightforward and predictable test for personal jurisdiction solves issues relating to internet sales cases. Internet sales would be analyzed in the same way as all other sales cases: if the seller does business in the forum, then the plaintiff should be permitted to sue the seller in that forum. Doing business means selling a product in that forum. If a seller wants to avoid being subject to personal jurisdiction in a particular forum, then it can choose not to sell in that forum.

Take, for example, a 2021 Third Circuit case²²¹ involving a lawsuit against Imgur and Reddit, two internet companies, alleging that the companies were compliant in an authorized use of the plaintiff’s likeness when a photo of her in a convenience store began circulating on these websites in an advertisement for erectile dysfunction and dating websites. Living in Pennsylvania, the plaintiff decided to sue Imgur and Reddit in Pennsylvania, despite knowing neither the convenience store’s location nor how the image was posted online. Both companies conceded that they had purposefully availed themselves of the privilege of doing business in Pennsylvania.²²² They nevertheless argued that their minimum contacts with Pennsylvania were not related to the litigation—in other words, they argued that there was no nexus between the plaintiff’s claim and the forum. The Third Circuit agreed with the District Court’s dismissal for lack of personal jurisdiction.²²³

There are troubling implications with this holding. First, a plaintiff is required to do additional research before the opening of discovery to determine where online harm originated. The court found unconvincing the argument that personal jurisdiction is proper in Pennsylvania because that is where the harm took place. Second, the court’s attempted distinction²²⁴ from Ford draws an arbitrary line. Just as in Ford, in which the motor company “systematically served a market in Montana and Minnesota for the very vehicles that the plaintiffs allege malfunctioned and injured them in those States,”²²⁵ here, the internet companies systematically served a market for the very product that was used to cause the harm—the platform in which the unauthorized posting of the plaintiff’s photo took place.

In other words, distinguishing the type of product that the market was “systematically served” with is unpredictable and malleable. A much more straightforward approach would be to look only at whether Reddit and Imgur continuously and systematically served the market, which they in all likelihood did. And even if they did not continuously and systematically serve the market, they certainly had minimum contacts with Pennsylvania such that they purposefully availed themselves of the state. The burden would then be on the defendant corporations to move for a transfer of venue. The presumption should be that due process is not violated because of the companies’ purposeful availment within the forum. If the corporate defendants seek to transfer venue, they would need to provide evidence for why there is a more suitable venue.

Accordingly, removing the nexus requirement and analyzing personal jurisdiction solely through purposeful availment—and assessing whether the online activity is in fact a purposeful availment—resolves the issue. To be clear, a company may “cultivate a market” in a forum without ever stepping foot into that forum.²²⁶ As such, in the case described above, the defendants Imgur and Reddit would be subject to personal jurisdiction in the selected forum because of their admitted purposeful availment and presence in the forum.

This approach is consistent with other cases that have analyzed the issue of internet sales: under current doctrine, a corporate defendant can expect to be subject to personal jurisdiction in a venue in which a “substantial number of copies are regularly sold and distributed.”²²⁷ In Keeton v. Hustler Magazine, Inc.,²²⁸ the Supreme Court upheld the exercise of jurisdiction in New Hampshire over a nonresident magazine publisher defendant.²²⁹ The Court reasoned that although the magazine publisher had a nationwide audience and had not targeted the forum particularly, it should reasonably anticipate an action “wherever a substantial number of copies are regularly sold and distributed.”²³⁰ The same should be true when it comes to internet sales. Therefore, if a corporation wants to avoid being subject to personal jurisdiction in a particular location, it may cease its business operations in that state.²³¹

Clarifying the Doctrine for Stream of Commerce Cases

A reversion to the original understanding of personal jurisdiction would simplify the analysis in stream of commerce cases. Removing the nexus requirement shifts the analysis solely to determining whether the defendant purposefully availed itself of a forum. Courts would look not at whether the plaintiff’s alleged harm has a connection to the forum, since these are venue concerns, not due process concerns.

Instead, courts would assess, as the Court did in Ford, whether the manufacturer “cultivated a market” in the forum state such that it is fair and just to require the defendant to defend a lawsuit in that jurisdiction. As with internet sales, if a manufacturer does not want to be subject to personal jurisdiction in a particular state, it may direct its distributor not to distribute products into that state.²³² Without such instruction, and if the distributor supplies products in a state, the manufacturer would have minimum contacts with that state that constitute purposeful availment. Under the original understanding of personal jurisdiction, any plaintiff would be permitted to sue the manufacturer in that state, irrespective of whether the plaintiff’s cause of action arises from the manufacturer’s contacts. The manufacturer would then be permitted to transfer the case using the venue statutes.

Addressing Forum Shopping

It is clear that reverting to the previous personal jurisdiction doctrine would pave a path to forum shopping.²³³ As an initial matter, one must ask whether the negative effects of forum shopping warrant such significant constitutional maneuvering to counter the practice. Perhaps a free market that permits forum shopping is beneficial, as some scholars have argued.²³⁴ Forum shopping may cause beneficial competition among states to alter their laws if they want to stimulate businesses.²³⁵ Just as a company considers taxes, state law, and other benefits, so too should companies consider being subject to personal jurisdiction in a state if they want to maintain a presence in that state.²³⁶

Some scholars have pointed out that the possibility of forum shopping provides judges with incentives to make the law more pro-plaintiff and that these judges’ actions have the possibility of creating wide-ranging effects, given that their courts will likely attract a disproportionate share of cases.²³⁷ Professor Dan Klerman points to several examples of this phenomenon taking place, the most prominent being the patent-plaintiff-friendly Eastern District of Texas and plaintiff-friendly mass tort jurisdictions such as Madison County, Illinois. Both of these venues have seen a dramatic uptick in the number of claims filed there.

As a result of these observations, scholars conclude that “[c]onsideration of forum selling helps justify constitutional constraints on personal jurisdiction. Without constitutional limits on jurisdiction, some courts are likely to be biased in favor of plaintiffs in order to attract litigation.”²³⁸ However, these policy considerations are for Congress to consider. The solution to these concerns is not judge-made constitutional limits on jurisdiction, because the Constitution is silent on forum shopping.²³⁹ Instead, the solution may be statutory limits on jurisdiction.

It goes without mentioning that parties engage in forum shopping in drafting forum-selection and choice-of-law clauses, which require any dispute arising from a transaction to be filed in a particular location and apply particular law.²⁴⁰ If the Constitution prohibits forum shopping, it presumably prohibits forum shopping no matter the context and whether both parties engage in it. Given that courts have continuously upheld forum selection and choice-of-law clauses,²⁴¹ it cannot be said that forum shopping is per se unconstitutional.

More fundamentally, it is important to remember that personal jurisdiction is rooted in due process. Those who argue that it is the Court’s, rather than Congress’s, job to curtail forum shopping assert that impartial judging is a core concept of due process and, as such, personal jurisdiction is the proper route to address these concerns. However, this argument fails to consider that corporations have the option to remove themselves from being subject to personal jurisdiction wherever they feel the judging would not be impartial.²⁴² Therefore, so long as the defendant purposefully availed itself of a forum, the defendant should be prepared to face a lawsuit in the forum, irrespective of whether the Constitution permits forum shopping.

Furthermore, should forum shopping cause such significant burdens, or should the public demand reform, Congress has authority to act. Congress’s venue statutes currently permit parties to transfer venues in cases of forum shopping,²⁴³ and Congress has permitted removal to federal courts specifically to address bias in state courts.²⁴⁴ In cases where the defendant is subject to personal jurisdiction in a forum, the defendant may, if it is more convenient for witnesses or collecting evidence, move to transfer the case to a different jurisdiction.²⁴⁵ Various articles have also proposed statutes that codify personal jurisdiction.²⁴⁶

Miscellaneous Considerations

There is merit to the argument that corporations should be permitted to organize their business strategically to avoid lawsuits in unfavorable locations. This is especially true in situations where the removal statutes do not permit a corporate defendant to remove a proceeding to federal court.²⁴⁷ The way the Court is heading comports with the notion of strategic business organization. Corporations may choose to engage in business in locations by considering whether the risk of liability is worth the profits of doing business in the forum. Just as corporations assess tax, employment law, and various other factors, so too should personal jurisdiction be another factor. This potential future course undoubtedly increases the fora where a business may be sued, and it may encourage states to pass laws that are more plaintiff friendly. The free market should correct any radical laws because corporations can choose whether to engage in commerce in a particular forum based on the laws that forum passes.

True, without Daimler, corporations that engage in internet sales would be subject to personal jurisdiction in many more locations than they otherwise would have been. But these corporations can decide as a matter of corporate policy not to sell to individuals located in a certain jurisdiction for lack of desire to be forced to defend a lawsuit there.²⁴⁸ To be clear, a corporation would not need to suspend access to its passive website in certain locations to avoid being subject to personal jurisdiction there. Making a website available solely for consumer browsing (not purchases) in a certain location would not constitute “purposeful availment” because the website would be passive in nature²⁴⁹ and the corporation’s contact with website visitors would be unilateral action on the part of the website browsers, which the Court has already ruled is insufficient to constitute “purposeful availment.”²⁵⁰ For similar reasons, a corporate defendant would not be subject to personal jurisdiction in a forum if, by the stream of commerce, one of its products makes its way into a state where the corporation does not “serve [the] market.”²⁵¹ Therefore, to avoid being subject to personal jurisdiction in certain locations, a corporation can decide not to cultivate a market in the locations where it wants to avoid defending lawsuits.

Another consideration is the discretionary nature of transfer of venue and choice of law. Review of personal jurisdiction is a matter of law that is conducted de novo.²⁵² By contrast, transfer of venue is discretionary and is conducted under an abuse of discretion standard.²⁵³ But appellate courts have not been shy to tell the district courts they have abused discretion when it comes to motions for transfer of venue. In other words, plaintiffs would be encouraged to forum shop and choose venues that are less willing to transfer cases out of their jurisdiction. While a valid concern, it is not one that should factor into a constitutional analysis of personal jurisdiction. Congress may feel compelled to alter the venue statutes. Even so, despite the discretionary nature of venue transfer, courts have not been afraid to reverse denials of transferring venue, even under the abuse of discretion standard.²⁵⁴

CONCLUSION

This Note began with an analogy to sports teams preferring to play in front of their home crowds. There is no question that teams have such a preference. But the defiance of this preference does not constitute a violation of rights. Surely the Los Angeles Lakers, because the team plays in the National Basketball Association, must play away from home across the nation, including in front of less-than-welcoming Boston fans when they face the Celtics.

When a corporation conducts business in a particular location, it avails itself of that location. Under the traditional corporate consent and estoppel model, the privilege of conducting business creates a reciprocal obligation on the corporation to subject itself to the jurisdiction of that location, irrespective of who sues it there.

While personal jurisdiction purports to assess whether a defendant should be forced to defend a lawsuit in a forum due to the defendant’s contacts with that forum, the doctrine has shifted to requiring the plaintiff to show a connection to the forum, even if the defendant has substantial contact with the forum. This Note has explained the history and development of personal jurisdiction doctrine and showed how the Court has narrowed where corporate defendants are “at home.” Consequently, the Court requires the plaintiff to comply with the nexus requirement when suing in locations besides the corporation’s “home.” In doing so, this Note revealed that the evaluation of personal jurisdiction doctrine is a diversion from the traditional corporate consent and estoppel model and is a result of the Court substituting its judgment for Congress’s regarding the need to curtail forum shopping. It offered a prediction of where the Court may be headed: toward an expansion of corporate personal jurisdiction—by ditching Daimler and nixing the nexus requirement.

96 S. Cal. L. Rev. 207

Download

Shauli Bar-On*

* University of Southern California Gould School of Law, 2023. B.A., University of Southern California, 2020.

Toward a New Fair Use Standard: Attributive Use and the Closing of Copyright’s Crediting Gap

April 2023April 2023John Tehranian*

A generation ago, Judge Pierre Leval published Toward a Fair Use Standard and forever changed copyright law. Leval advocated for the primacy of an implicit, but previously underappreciated, factor in the fair use calculus—transformative use. Courts quickly heeded this call, rendering the impact of Leval’s article nothing short of seismic. But for all of its merits, Leval’s article failed to acknowledge or consider the salience of another largely underrecognized and heretofore unnamed factor: attributive use.

This Article attempts to address this oversight, particularly when viewed in light of the current law of crediting in the twenty years since Dastar Corp. v. Twentieth Century Fox Film Corp., the Supreme Court’s decision to permanently foreclose the most common method by which creatives had previously vindicated their crediting interests—the Lanham Act’s prohibition on false designations of origin. After assessing the recent body of empirical work highlighting both the quantitative and qualitative importance of attribution to authors and the value of crediting to consumers, investors, and the broader public, the Article scrutinizes the current state of attribution rights to argue that, post-Dastar, the remaining legal mechanisms for securing crediting, including private contracting, have proven insufficient.

To address this crediting gap in the law, the Article considers, but rejects, calls to overturn Dastar or enact an independent general attribution right under the Copyright Act. Instead, I propose a more modest solution that needs no congressional action. Like transformative use, attribution promotes progress in the arts by motivating and incentivizing authorial production. Moreover, as this Article’s careful exegesis of the relevant case law demonstrates, issues of crediting have long shaped the contours of the fair use defense. As such, I advocate for the formal adoption of attributive use as an express consideration in the fair use calculus. The Article therefore builds on Leval’s influential work and calls for the formulation of a new fair use standard that more closely calibrates the defense with the utilitarian goals of our copyright regime.

INTRODUCTION

A. Pierre Leval’s Toward a Fair Use Standard and Copyright’s Crediting Gap

Thirty years ago, Pierre Leval penned¹ what would become one the most influential pieces of legal scholarship of the past generation.² As a federal judge who had then served for twelve years on the Southern District of New York, Leval crafted Toward a Fair Use Standard after he had watched two of his copyright decisions, including his finding that a biographer had made fair use of J.D. Salinger’s unpublished letters, eviscerated by the Second Circuit.³ In reflecting upon these repudiations, Leval critiqued his erstwhile approach to the fair use calculus as excessively ad hoc and sought, instead, to fashion a series of governing principles to guide application of the doctrine in future cases.⁴ In so doing, Leval scrutinized the metes and bounds of the copyright monopoly holistically and posited that infringement claims and fair use defenses both serve the overarching utilitarian goal of the copyright regime, which “stimulate[s] activity and progress in the arts for the intellectual enrichment of the public.”⁵ With the premise that fair use is not an exception to copyright protection but, rather, a part of the design of copyright law to encourage creativity, he then identified transformative, or productive, use—use that “employ[s] the [original] matter in a different manner or for a different purpose”⁶—as a driving⁷ concern⁸ in the calculus.⁹

guarantee success in claiming fair use. The transformative justification must overcome factors favoring the copyright owner. A biographer or critic of a writer may contend that unlimited quotation enriches the portrait or justifies the criticism. The creator of a derivative work based on the original creation of another may claim absolute entitlement because of the transformation. Nonetheless, extensive takings may impinge on creative incentives. And the secondary user’s claim under the first factor is weakened to the extent that her takings exceed the asserted justification. The justification will likely be outweighed if the takings are excessive and other factors favor the copyright owner.

Id. at 1111–12.

Toward a Fair Use Standard quickly precipitated a sea change in the way courts approached application of the fair use doctrine. Only a few years after its publication in the Harvard Law Review, the Supreme Court drew heavily on Leval’s article in famously holding that the transformative nature of 2 Live Crew’s unauthorized parody of Roy Orbison’s Pretty Woman insulated the song from infringement liability under the fair use doctrine.¹⁰ In the process, the Supreme Court elevated the standing of Leval’s work, enshrining it as a seminal tome on copyright law—one that took a rightful place right beside the actual text of section 107 of the Copyright Act in guiding fair use determinations. Toward a Fair Use Standard continues to enjoy a prized place in the copyright firmament. In 2021, in its first fair use pronouncement since Campbell v. Acuff-Rose, the Supreme Court liberally sprinkled citations to Leval’s article through the course of its opinion determining that Google’s exploitation of Oracle’s copyrighted Sun Java application programming interface (“API”) constituted fair use.¹¹ Transformation once again lies at the heart of the analysis, as the Court posited that Google’s actions helped “expand the use and usefulness of Android-based smartphones. . . . [by] creat[ing] a new platform that could be readily used by programmers” to develop new programs in the Android environment.¹²

It should come as no surprise then that, in the words of one observer, “Leval’s commentary on the centrality of transformativeness in interpreting fair use decisively changed the way the copyright doctrine was interpreted. He leveraged the forum successfully to accomplish what he had been unable to accomplish thereto in judicial decision-making.”¹³ This view is not merely anecdotal or impressionistic. The rapid rise of transformation as a crucial, if not decisive, factor in fair use decisions due to Leval’s article is nothing short of stunning. A recent empirical study determined that almost ninety percent of cases now ultimately turn, at least in part, on determinations of transformative use.¹⁴

Thus, with Toward a Fair Use Standard, Leval achieved what most authors of law review articles can only dream of. Of course, his deserved reputation as a thoughtful jurist no doubt assisted in propelling his proposal, and his article’s placement in the venerable Harvard Law Review did not hurt either. But, above all, his prescient thoughts on the limitations on copyright protection embodied in the fair use doctrine made eminent sense in any era when courts were just beginning to grapple with the digital implications of a Copyright Act written before the advent of the modern internet.

To be sure, Leval’s work is not without its critics—in industry, on the bench, and in the bar. These interventions have largely questioned the primacy that Leval’s article and interpreting courts have given to transformative use.¹⁵ Yet for all of its merit, Leval’s article wholly ignored one area of grave importance in both the utilitarian logic of copyright law and, implicitly, the extant jurisprudence on fair use: attribution. Crediting serves as a prime motivator for authorial production, goes to fundamental issues of equity in our copyright regime, and has enjoyed a tacit (but not entirely express) role within the fair use calculus. Nevertheless, it finds no place in Leval’s article, a fact that Leval’s critics have ignored as well. Indeed, even though Leval dedicated a portion of his article to pondering (and rejecting) the value of “other” fair use factors not expressly detailed in section 107’s text—including good faith, artistic integrity, and privacy—he never expressly discusses or even implicitly addresses the issue of crediting.¹⁶ In short, attribution appears to play no role in Leval’s analysis of fair use.

Of course, crediting was not Leval’s focus. Nevertheless, this Article attempts to address and assesses this oversight, particularly when read in light of the current law of crediting in the twenty years since the Supreme Court announced its decision in Dastar Corp. v. Twentieth Century Fox Film Corp.,¹⁷ which permanently foreclosed the most common method by which creatives had previously vindicated their crediting interests—the Lanham Act’s prohibition on false designations of origin. Specifically, this Article proposes to supplement Leval’s work—which lead to the formal adoption of transformative use as a critical part of the first factor in the fair use analysis—by advancing a proposal for the explicit introduction of attributive use¹⁸ to the fair use balancing test.

B. Giving Credit: Toward an Attributive Fair Use Standard

Give credit where credit is due. It is a principle widely embraced in our social norms.¹⁹ But like many of the things we learned in kindergarten, adherence to the precept is far from perfect. Moreover, while the exhortation remains a universal aspiration, it enjoys little legal bite. Indeed, the lack of development in the law of crediting is nothing short of surprising. As Jane Ginsburg has argued, “Of all the many counter-intuitive features of US copyright law—and they abound—the lack of an attribution right may present the greatest gap between perceived justice and reality.”²⁰

If the Copyright Act and our broader intellectual property regime seeks to serve its constitutionally mandated purpose—to promote progress in the arts—by incentivizing the creation of works of authorship, it should ideally respond to what actually motivates creators. To be sure, the exclusive rights of reproduction, distribution, public performance, public display, and derivatization secured for authors under section 106 of the Copyright Act appeal to authorial incentives in at least two ways. First, they serve utilitarian interests by providing monetary rewards to creators by necessitating licenses for the exploitation of their work. Second, they promote natural law and the dignitary interests of authors by enabling them to decide whether (and under what terms) their works are made available to the public at all.²¹ But control over reproduction, distribution, public performance, public display, and derivatization are not the only rights that galvanize creators. Specifically, authors gain value—both monetary and otherwise—through other mechanisms. For example, building one’s brand and reputation for creative excellence—achieved only through attribution—is a powerful means toward earning long-term economic rewards and satisfying the dignitary interests that can also motivate authors. As a result, traditional copyright enforcement is not necessarily profit maximizing for creators, and there is often a disconnect between how creators feel about the unauthorized exploitation of their work and how distributors/publishers might feel about it. Meanwhile, authors who may value attribution over enforcement of their copyrights are not necessarily immune to the temptations of the marketplace or more noble than the rest of us. To be sure, some authors may create solely to fulfill their own needs or to edify, amuse, or impact others, and they may merely seek recognition rather than profit. But there is also a monetary component to proper attribution. In the long run, attribution promotes one’s name and its standing, a phenomenon that eggs on economic demand in a variety of forms—whether it is for further creative production, appearances, endorsements, or ancillary activities. Indeed, attribution is part and parcel of the economic equation of copyright and its incentive structure.

With all of this in mind, if our intellectual property regime serves to encourage progress in the arts by motivating and incentivizing authors, the absence of attribution rights would appear to leave the regime wanting. The Lanham Act, which for a time served as a powerful vehicle for protecting attribution rights, can no longer do so as a result of the Supreme Court’s decision in Dastar two decades ago. Meanwhile, for a variety of reasons we shall explore, alternative legal theories for protection have proven inadequate to provide for general crediting rights. Thus, current law provides little protection for crediting. And it is this crediting gap—what it means, how it came into existence, and how it might be solved—that is the focus of this Article.

Before proceeding further, two important caveats bear mentioning. As a preliminary manner, it is important to lay out what this Article means when it talks about giving credit. Put in the traditional parlance of moral rights, crediting issues can take two general forms. First, there is the positive right of attribution—the ability to have one’s name associated with one’s work. Second, there is the negative right of attribution—the ability to prevent having the work of another falsely attributed to you.²² The former right is about giving credit when credit is due and forms the subject matter of this Article. The latter phenomenon, while important and rife for further analysis, is about misattribution and therefore falls outside of the scope of this analysis.

In addition, as its title suggests, this Article seeks to directly build on Pierre Leval’s influential article. Indeed, it is no less ambitious than Leval’s piece and aims to highlight the fact that considerations of attributive use already permeate (with good reason) the jurisprudence applying and interpreting section 107 of the Copyright Act and seeks to alter the way that courts formally frame the fair use calculus going forward. At the same time, however, the author also understands that he is no Pierre Leval and, as detailed infra,²³ is prone to delusions.

With these caveats in mind, the Article’s analysis begins by scrutinizing the value of crediting. Rather than resting on the mere intuition that attribution matters, Part I delves into both the quantitative and qualitative literature on crediting to determine just how and to what extent crediting fuels authorial motivations and serves broader societal interests. We start with an anecdote to illustrate how authorial reactions to infringement both with and without attribution can differ radically. In the process, we identify and critique the peculiar disconnect between our current legal regime—which fetishizes protection against infringement over failure of attribution—and the economic and dignitary interests of at least a sizeable percentage of creators. Next, we examine the burgeoning scholarship in law, economics, psychology, and organizational behavior to assess and interrogate the value of attribution to creators. As we see, a growing body of empirical work supports the intuition that crediting matters—a lot—and, in fact, authors are often willing to forgo substantial amounts of compensation in return for securing attribution. As such, crediting can and does play a primary role in motivating authorial production. At the same time, a fulsome attribution regime would not merely serve authorial interests. As I argue, it would also inure to the benefit of consumers, investors, and society at large by promoting the efficiency of resource allocation in intellectual property-driven fields (thereby benefiting investor welfare and broader economic interests in optimized markets operating with superior information), reducing consumer search costs, advancing the organizational integrity and coherence of literary and artistic endeavors, and even enhancing public support for the protection of intangible rights such as copyright by bringing the legal regime governing creative works in greater harmony with norms of equity and by humanizing creativity-driven products.

Having established the value of crediting to both authors and the public, this Article turn its attention to assessing the current state of attribution law. Part II therefore begins by exploring the rationale and implications of the Dastar holding and detailing the ways in which the decision effectively ended the ability of creators to bring attribution-related reverse passing off claims under the Lanham Act. Next, we identify the crediting gap left in Dastar’s wake by examining what alternative theories of liability remain to vindicate crediting interests post-Dastar and how said theories have fared in the intervening two decades. In the process, we scrutinize the extant jurisprudence on false advertising claims under the Lanham Act,²⁴ attribution claims under the Visual Artists Rights Act (“VARA”),²⁵ falsification and removal/alteration of Copyright Management Information (“CMI”) claims,²⁶ state unfair competition law, and private contracting. As this Article’s analysis suggests, these theories are insufficient to protect the crediting rights of the vast majority of creators. False advertising claims can, at best, only provide relief to famous authors and only in circumstances of material reliance by consumers in purchasing decisions. VARA claims suffer from myriad subject matter constraints that makes the protections available to only a small corner of the creative universe (certain types of visual art works that are not works made for hire, only in originals or prints of two hundred or less, potentially only in digital form). Claims pertaining to falsification, removal, or alteration of CMI have a high double-scienter requirement that has made relief unlikely. Meanwhile, the statutory scheme regarding CMI primarily serves the goal of infringement prevention rather than the protection of any independent interests that authors may have in crediting. Finally, state unfair competition laws have suffered either from federal preemption under Dastar or from the fact that they are viewed as coterminus with Lanham Act protections.

In the end, therefore, we are left only with private contracting for relief. And while a few notable industries (such as Hollywood and academia) have implemented meaningful attribution regimes, private ordering suffers from the leverage and bargaining disparities inherent to contractual solutions. Indeed, as we demonstrate, the history of private crediting systems is riddled with instances where power dynamics trump actual origination.²⁷ Drawing on several notable examples where crediting abuses fell on racial, gender-based, and socioeconomic fault lines, I argue that continued reliance on such systems could have particularly deleterious implications for social justice issues in intellectual property. In short, therefore, there exists a sizeable crediting gap—a vast disconnect between the high value of attribution to authors and the public and the low value given to it by way of legal protection. Moreover, without the legal vesting of more stout attribution entitlements, ongoing reliance on the pure operation of the marketplace for crediting determinations could continue to have vexing consequences.

Part III considers potential reform to the current state of affairs. Although I caution that social value should not always translate into legal mandate and that good norms do not always make good law, the particular inadequacies of the extant crediting regime and the social and economic (rather than private or familial) interests at play warrant examination of potential legal solutions. To that end, I evaluate but reject two of the most significant mechanisms for change: reversal of the Dastar holding by legislation amending the Lanham Act and passage of an affirmative cause of action under the Copyright Act to provide for crediting rights. As I argue, despite its shortcomings, Dastar revealed the poor fit that the Lanham Act—with its focus on consumer confusion—ultimately provided for attribution protection. Moreover, even if an amendment to the Lanham Act were limited to situations involving works still under copyright protection (so as to avoid the issue of erstwhile rightsholders with expired copyrights attempting to extend their monopoly over creative works through trademark law), it would still raise significant concerns about the potentially onerous scope of crediting requirements and the fine line between providing proper attribution and triggering false endorsement claims. Meanwhile, amendment of the Copyright Act to provide for an affirmative crediting claim has its own shortcomings. In particular, this Article examines the way in which the allocation of a formal crediting entitlement could stifle licensing efforts in the marketplace, a result exacerbated by the combined impact of the endowment and creative effects—behavioral phenomena that have caused economists to question traditional neoclassical assumptions in entitlement allocations in recent years. Finally, as a practical matter, I also observe the particular difficulties in pursuing a legislative change.

Instead, I propose a more modest solution, and one that I argue is already an implicit part of the existing jurisprudence: formal accounting of attributive use as a part of fair use calculus. In conducting a careful exegesis of the extant case law on fair use, I argue that courts have often woven consideration of attributive use into the first, fourth, and “fifth” factors—a move that the important Second and Ninth Circuits have blessed.²⁸ Further, I argue that the implementation of an attributive use subfactor makes doctrinal sense given both the utilitarian and equitable functions of the fair use doctrine and that such a consideration is strongly supported by our existing copyright clearance norms. Thus, just as Pierre Leval identified transformative use as a critical but underappreciated consideration in the fair use calculus, I make a similar argument with respect to attributive use. In the process, I call for crediting to take its place alongside commercial and transformative considerations in courts’ assessment of the first fair use factor—the purpose and character of use. In this way, I advance an incremental, but important, step toward recognition of the value of crediting while also avoiding some of the broader concerns that a general right of attribution—whether achieved under the Lanham Act or the Copyright Act—might present.

I. WHY CREDITING MATTERS

A. One Author, Two Moments

I begin my assessment of the importance of attribution by reflecting upon the perspective of one author—this author—on what motivates creative enterprise. Such a focus is admittedly biased and may be completely unrepresentative. But it illustrates how the current state of affairs in copyright law—where infringement receives stiff punishment but failure to credit receives none—can be inadequate to protect the motivating interests of at least some creatives. Specifically, two recent incidents involving the use of my published work provoked strong, but diametrically opposing, reactions within me. And while I do not claim that my attitude toward these events reflects on how typical authors might respond, my contemplations are nonetheless instructive as to how some authors might experience issues related to infringement and crediting.

Not long ago, while doing some research on the dirty underbelly of the piratical dark web, I came across a site that resembled a veritable Library of Babel,²⁹ providing free access to a remarkable collection of digital books to all comers. While publishers would not hesitate to characterize this “celestial jukebox”³⁰ of books as a cesspool of wanton infringement, I felt compelled, as a good academic, to investigate further before drawing any definitive conclusions. So, in an act of curiosity and thorough vanity, I punched my own name into the site’s search box. To my surprise, a beautiful e-book edition of one of my tomes popped up, available freely to all who had interest in it. I can neither confirm nor deny that I immediately downloaded a copy, but some context might help explain why I may have made a decision to do so.

A number of years earlier, I had posed what I thought was an innocent request of the publisher of one of my forthcoming books: I sought a final PDF copy of the work for my records and personal use. The response was rapid and reproachful. “We don’t do that, John.” The implication was clear: they had to protect against piracy, even if it meant denying authors copies of their book in digital format. Instead, they offered me a compromise: the first three chapters. Since they made it clear this was not a negotiation, I took what they gave me. Fast forward a decade and it should be easy to understand why I may have been elated when this website offered me what my own publisher had denied me: a final, electronic form of my book without any encryption or digital rights management (“DRM”) associated with it.

My book’s appearance on the website also pleased me for an entirely different and more fundamental reason. As a delusional academic, I dream of my ideas getting attention, impacting the way people might approach or think about an issue and, ultimately, influencing policy. So I naturally fantasized about individuals (at least one or two!) potentially stumbling on this website, finding my book, and then reading it when they otherwise may have never known about my work or ponied up the cash necessary to buy it. The royalties I see from my writing are trivial and economically irrelevant. Instead, I want my books to reach as many people as possible (that is, more than my mother) and I want to maximize their exposure. Whether that is accomplished through sales or piracy, I care not. After all, as science fiction writer and librarian Eric Flint once put it, “The real enemy of authors— especially midlist writers—is not piracy . . . It’s obscurity.”³¹

So far from making me angry, the discovery of my book pirated online for anyone to read resulted in nothing but sheer delight. Yes, this was infringement of my copyright, pure and simple. But I was all smiles.

Around the same time, I had another experience related to one of my publications—but, this time, what occurred was far less welcome. One day, I received an email from a local law school about an upcoming distinguished lecture. While I might usually give only fleeting attention to such a notification, this one caught my eye because of the topic, which just so happened to be the exact subject matter of my first book, which I wrote in 2008. So I naturally took an interest and thought about attending. But as I read further, my curiosity turned to disappointment. The talk was about a recently released book whose description was, almost word for word, based on the book I had written. And then I saw the name of the author, which was one I recognized.

I had met the author a decade ago when she was a graduate student attending a talk I had given about my own book. I distinctly remember her chatting with me after my lecture and expressing how much she had enjoyed and appreciated my book. It turns out those comments weren’t mere puffery. She had proven that by writing her own version. As I read the full description of her lecture and then found her recently released book, I was struck by how the summation literally encapsulated my own book in its entirety.

Imitation is the sincerest form of flattery, I told myself in a failed attempt to downplay the anger I felt. But more than flattery, I wanted acknowledgement. Of course, no one’s work is wholly original. But her book came uncomfortably close to mimicry of mine. It was not just drawing on or borrowing ideas to build and expand on my book; it was literally taking my entire work and rehashing it as purportedly original material. Specifically, and most egregiously of all, the use of my work was wholly without proper credit. And, to add insult to injury, as the email before me indicated, she was now giving a distinguished lecture at a local university that had never shown the least bit of interest in my work.

Even if I were inclined to pursue some kind of legal remedy against her, there was none readily forthcoming. Because of the nature of the use, an infringement claim would be difficult to make. Meanwhile, current law provides no general right of crediting or attribution. Admittedly, I did have a form of extralegal relief; if I wanted to pursue the matter, university policies against plagiarism and the failure to properly credit sources offered some remedies. Certainly, I could have notified the author’s publisher and her university-employer to trigger potential investigations. But, at the end of the day, such an effort would be purely punitive and would not undo the real damage I had already suffered; the book, after all, was already published. So, in the end, I concluded that, instead of going through the pain of a vindictive letter-writing campaign that would only waste my own time, I would work out my issues far more constructively: by writing a law review article.

These two incidents—close in time—provided a remarkable study in contrasts.³² In the first matter, I encountered the wholesale piracy of my work, and I found myself not merely indifferent but hopeful. After all, I had received credit for my work and the work’s unauthorized distribution helped disseminate my ideas more broadly. I would take whatever boost I could get. The website in question had undoubtedly infringed my work, but I was perfectly content to let that happen. In the second instance, while it was arguable whether the subsequent author had infringed my work, she had indisputably failed to give me proper credit for my work—upon which she had indisputably and heavily drawn—and had, in my view, violated basic norms of attribution. I was disturbed and troubled by what had happened.

While I found the second incident far more offensive than the first, the law saw things differently. I possessed a colorable claim for infringement if I were inclined to fight the piracy of my book. By contrast, I had little hope of a legal remedy for my fellow academic’s abysmal failure to provide me appropriate attribution. In short, our copyright regime provided no shortage of remedies for an injury that I cared little about—infringement. By sharp contrast, it provided no remedy for something that more directly motivates my production of content—crediting and recognition.

As a result of these two incidents, I began to wonder whether I was the kind of author the Copyright Act wanted to encourage in the first place.³³ As a writer, I meet the definition of what the Framers referred to as “authors” in the Intellectual Property Clause of the Constitution, and my work comes under the subject matter of the Copyright Act. But I am also not the kind of author who makes a living (or even seeks to make a living) on the sale of my works. Consequently, my incentives might be quite different from someone whose income solely or largely comes from authoring works—the kind of author who might care substantially more about piracy.

That said, however, the vast majority of authors make little to no money from their work. Some, of course, may still pursue the craft for (in part) future potential riches. But, for many, remuneration is far lower on the list of their motivations than other factors, such as attribution and recognition. As Laura Heymann points out,

[F]or many creators, particularly individual creators, the profit motivation is not paramount. Rather, the creator is motivated most by the public knowledge that she is the creator—by attribution of the work to her. Indeed, as others have noted, such creators value wide dissemination of their work over compensation, and so benefit from the fair use doctrine and, even, the movement of their work to the public domain, both of which ensure that their work reaches as large an audience as possible.³⁴

As my reaction to the two incidents indicates, I belonged to this class of authors. And, by failing to reflect the importance of attribution and recognition as a motivating factor in the production of creative content, it appeared that the existing copyright regime did not know members of this class very well and did not appear fully responsive to their incentives and needs. For a utilitarian regime dedicated to progress in the arts, this curious result begs further investigation.

B. The Empirics of Attribution

There is no doubt that our moral sensibilities strongly support the practice of proper attribution, and common sense tells us that authors value crediting and recognition as well. As Heymann has posited, “[I]t seems safe to conclude that the two things that virtually all creators desire is to receive credit when appropriate and to eliminate the suggestion of association when it is not.”³⁵ But before taking these assumptions to heart based on mere intuition, it is worth scrutinizing them more closely. While the value of attribution has traditionally received scant attention in the academic literature and little empirical testing, all of that has changed in recent years as an emerging body of data and experimental work has provided overwhelming support for the notion that attribution serves a vital role in motivating and incentivizing creatives.³⁶

One of the largest innovations and behavioral experiments to ever take place in the creative world occurred with the launch of the Creative Commons some twenty years ago. Founded by law professor Larry Lessig, computer scientist Hal Abelson, and literary advocate Eric Eldred, the Creative Commons sought to give creators the ability to opt out of the protection-heavy default rules of copyright, which automatically vest in authors³⁷ the exclusive right to control reproduction, distribution, public display, public performance, and derivatization of their works³⁸ for a period of their lifetime plus seventy years after their death.³⁹ Such rights spring into existence for all original works of authorship fixed in a tangible medium, regardless of formalities. In subverting these default protections and the “permission culture”⁴⁰ that they serve and support, Creative Commons allowed authors to make their works available to the public to promote educational access and spur further creativity by increasing the pool of works from which others can freely build without the need for costly licenses. By ceding their works to the Creative Commons, creators opt into a different regime, where all rights are not reserved. Thus, under various Creative Commons licenses, they can make work available for use without payment for noncommercial purposes—to create new derivative works or for any purpose whatsoever.

The notable success of the Creative Commons and the particular manner in which it has operated illustrates two important points. First, millions of creators have deeded hundreds of millions of creative works to the Creative Commons.⁴¹ As Eric E. Johnson has put it, this fact illustrates “the contemporary existence of an attitude held by at least a significant number of people that the full panoply of copyright entitlements is not important to them.”⁴² Second, while many, but not all, authors want to stop infringement of their works, virtually all authors want attribution and the operation of the Creative Commons provides empirical support for this view. As the data collected over the past twenty years show, authors putting their work on the Creative Commons almost always choose to condition any use on one requirement: proper attribution.⁴³ For at least a certain set of creators, therefore, the right of attribution trumps the right of exploitation and the ability to receive license fees from the use of one’s works.

Even aside from the Creative Commons, the widespread sharing, rather than exclusive reservation, of intellectual property rights in many sectors illustrates the strong incentive social validation can play in promoting creative enterprise. Though long underappreciated in the intellectual property literature,⁴⁴ this widespread “non-market form of exchange,” characterized by sharing, is particularly attractive for an enormous body of works that may not enjoy clear-cut commercial profitability but are also not entirely valueless.⁴⁵ In these sharing regimes, such as open-source software licensing pools and microstock photography collections, pecuniary gain is largely forgone but, quite notably, attribution is retained and reputational satisfaction constitutes a key part of the value proposition for creators, as they derive “a feeling of satisfaction and a sense of social connectedness out of sharing.”⁴⁶

The thriving of the “sharing” economy and of the Creative Commons—where millions of creators are eager to opt out of the default protections of the Copyright Act, but only as long as they continue to receive recognition for their creative efforts—should come as no surprise. Indeed, recent experimental work has validated the intuition and experience that suggests that creators place significant weight on crediting and recognition. Notably, researchers Christopher Sprigman, Christopher Buccafusco, and Zachary Burns have conducted a series of empirical tests in mimic conditions of real-world bargaining meant to put a tangible monetary value on attribution rights. In the first experiment, they found that 180 casual photographers were, in the aggregate, willing to receive far less payment for publication of their work when it came with, rather than without, attribution.⁴⁷ These findings were even more pronounced in their second experiment, which involved professional and advanced amateur photographers.⁴⁸ In short, these tests produced robust results, leading Sprigman, Buccafusco, and Burns to conclude that, on average, creators actually value attribution and the receipt of recognition for their work more than getting paid and that they are “willing to sacrifice financial benefits to obtain [attribution].”⁴⁹

Beyond the valuable case study provided by Creative Commons and the experimental evidence that has quantitatively established the worth of crediting to authors, important qualitative ethnographic and observational work has also supported the stock that creators put in attribution. For example, in her comprehensive qualitative study of innovation, in which she conducted dozens of interviews with creatives and intellectual property professionals across a wide variety of industries,⁵⁰ Jessica Silbey concluded that attribution serves as a primary motivator for creative enterprise.⁵¹ As she notes, “[T]he interviews are replete with expressions of how attribution and integrity are crucial to the work’s optimal promotion and dissemination, whether or not for profit, because they safeguard and manage the development of professional identity and audience.”⁵² Similarly, in her sweeping survey on the legal and normative standards of attribution across a wide range of industries—including Hollywood, journalism, political speechwriting, software, advertising, graphic design, science, and medicine—Catherine Fisk has also documented the significant value creators place on crediting.⁵³ As she puts it, “Attribution is foundational to the modern economy” and, as such, “greater legal recognition of attribution rights is desirable.”⁵⁴

Finally, recent literature in the field of organizational behavior⁵⁵ and psychology⁵⁶ has emphasized the crucial role of crediting in nurturing innovation and promoting perceptions of fairness in creative environments. For example, Teresa Amabile, a leading theorist on creativity and innovation, has highlighted how proper credit allocation can motivate employees to work harder and enhance productivity.⁵⁷ In short, a burgeoning body of work in the social sciences has strongly supported the intuition that crediting matters—a lot.

C. The Societal Value of Attribution

But in focusing largely on the impact of attribution on creatives—both in the way that crediting incentivizes innovation and how it serves the dignitary interests of authors—this emerging literature has actually understated the case of attribution rights. Quite critically, attribution does not merely serve authorial interests. Rather, it also benefits other players in the marketplace for creative works and advances broader societal interests.⁵⁸

First, crediting advances the efficacy of the marketplace, particularly in an information economy dominated by the production of intellectual property. Generally speaking, free and open exchange of relevant information facilitates the optimal functioning of markets by improving the efficiency of allocation decisions. Information about inputs, such as labor, guides the dedication of scarce resources. Crediting provides actionable data about labor involved in the production of intellectual property—data that are often onerous to divine elsewhere or without substantial additional cost. Indeed, “because it is difficult to measure worker knowledge directly in the way that the ability of the typists and machinists of the industrial economy could be tested simply by watching them perform a task,”⁵⁹ credit is particularly valuable in an information economy. A reliable, accurate, and comprehensive crediting regime can therefore dramatically advance interests in the efficient allocation of resources in creative enterprises. Crediting, after all, provides vital information to financiers of those enterprises about the nature and quality of a particular author’s work.⁶⁰

Secondly, crediting advances the interests of those who consume creative works and other forms of intellectual property. Authorship represents a form of branding akin to trademark, and accurate authorship labeling helps promote many of the basic goals of the trademark regime, which serves consumers (and not just authors) by “reduc[ing] the customer’s costs of shopping and making purchasing decisions” and “help[ing] assure a producer that it (and not an imitating competitor) will reap the financial, reputation-related rewards associated with a desirable product.”⁶¹ Heymann has highlighted the value that a meaningful crediting regime provides to even nonauthors. Instead of calling for an attribution right that recognizes an inherent moral right authors might have in proper attribution, she calls for what she dubs “authornymic” attribution, the recognition of crediting for the sake of “organizational integrity”—a “reader-centered” law that ensures that “reader responses [to creative works] will be informed and minimizes the likelihood of confusion a consumer of creative commodities might otherwise experience.”⁶² In this way, she argues, a law of attribution is vital to supporting “efficient literary consumers” who can have “some confidence that the works that we read—and later draw on for our own creative activity—are situated within a coherent literary structure.”⁶³

Thirdly, an attribution regime can also promote public respect for intellectual property law. It does so in at least two different ways. As Stephanie Plamondon Bair argues in her study of the role of fairness in copyright law, when we align copyright law more closely with public perceptions of equity, we heighten the regime’s legitimacy in the eyes of society.⁶⁴ Given the strong popular support for the norm of attribution, a copyright system that protects crediting rights bolsters respect for the regime itself. Separately, the act of putting a real face (or, at least name) behind creative works humanizes them and can help buttress support for the intellectual property rights that protect them. As Catherine Fisk explains, such a task is particularly important “[i]n a world of corporate production, and in particular skepticism about corporate production.”⁶⁵ After all, it is no secret why corporate interest groups bring relatable artists to the forefront when making pitches for greater protection and rights enforcement, especially in the war on piracy—even when those artists are not the real rightsholders.⁶⁶ When the music industry sought to apply pressure to Google to provide more favorable use fees for the exploitation of music on YouTube, it had the likes of Taylor Swift and U2 sign off to an open letter that was used to drum up public support for the cause and to lobby Congress for reform.⁶⁷

[The DMCA] was written and passed in an era that is technologically out-of-date compared to the era in which we live. It has allowed major tech companies to grow and generate huge profits by creating ease of use for consumers to carry out almost every recorded song in history in their pocket via a smartphone, while songwriters’ and artists’ earnings continue to diminish.

Anthony Ha, Taylor Swift and Other Big Names Join the Music Industry’s Campaign Against YouTube, TechCrunch (June 20, 2016, 2:33 PM), https://techcrunch.com/2016/06/20/taylor-swift-dmca-letter [https://perma.cc/JSU6-9NUG] (quoting the letter). And when the Motion Picture Association (“MPA”) sought an alternative to an unpopular litigation campaign against piracy, it put together testimonial advertisements that highlighted the ways in which piracy hurt those people whom we only know as lines at the end of the credit roll.⁶⁸ Thus, attribution promotes the very operation of the intellectual property regime by giving it a human face that legitimizes the sometimes impersonal and intangible rules it enforces. In an era where digital technology has made mass piracy on a global scale all too easy, this function is perhaps of greater value now than ever before.

All told, therefore, our common sense tells us that crediting is deeply important to authors, a position backed by the emerging social science literature on the subject. Meanwhile, a proper attribution regime also has critical benefits to the efficient functioning of the marketplace for creative works and thus has strong benefits for consumers and investors as well. Despite all of this, however, as we have alluded to, the law provides shockingly little protection for crediting rights. This state of affairs that has grown particularly dim in the past two decades in the wake of the Supreme Court’s decision in Dastar, a subject to which we now turn.

II. THE LAW’S SIZEABLE CREDITING GAP

A. Dastar and the Decline of Crediting Law

Although we have established the important value of attribution—to creators, investors, and the public as a whole—we are left with a strange conundrum: the law of crediting is surprisingly thin and underdeveloped. Indeed, it is counterintuitively so, as the wholesale absence of any broad law of attribution runs counter to the assumptions of many in the creative community. As Silbey reported for her survey of artists and authors, “Many interviewees were stunned to learn that copyright law does not require attribution or prohibit misattribution.”⁶⁹

That said, for a period of time in the recent past, rightsholders enjoyed one particular means of crediting protection: a direct vehicle for legal redress when their creative works were being used by others without proper attribution. Specifically, a line of case law had emerged that considered improper crediting of someone else’s work as one’s own to constitute a “false designation of origin . . . or false or misleading representation” actionable under section 43(a) of the Lanham Act.⁷⁰ In these cases, courts found that, in the words of Thomas McCarthy, the Lanham Act “has progressed far beyond the old concept of fraudulent passing off, to encompass any form of competition or selling which contravenes society’s current concepts of ‘fairness.’ ”⁷¹ Such a capacious reading of the Lanham Act allowed for recognition of a cause of action for reverse passing off—when someone passes off the goods or services of another as their own—and, therefore, provided a viable claim for their failure to provide credit.

In 1981, this reading of the Lanham Act received the blessing of the Ninth Circuit for the first time, a move that propelled it to widespread acceptance. In Smith v. Montoro, the Ninth Circuit held that the failure to credit an actor for his role in the movie Convoy Buddies (and, in fact, the substitution of his name with that of another actor in both the film credits and advertising material), constituted reverse passing off under section 43(a).⁷² As a matter of public policy, the court opined, such conduct was “wrongful” in that “the originator of the misidentified product is involuntarily deprived of the advertising value of its name and of the goodwill that otherwise would stem from public knowledge of the true source of the satisfactory product.”⁷³

[B]ig box office names are built, in part, through being prominently featured in popular films and by receiving appropriate recognition in film credits and advertising. Since actors’ fees for pictures, and indeed, their ability to get any work at all, is often based on the drawing power their name may be expected to have at the box office, being accurately credited for films in which they have played would seem to be of critical importance in enabling actors to sell their “services,” i.e., their performances.

Id. The Montoro decision proved widely influential, and within a few short years, federal courts throughout the country were entertaining attribution-related claims under section 43(a) for reverse passing off.⁷⁴ But all of that changed in 2003 when the Supreme Court announced its decision in Dastar.⁷⁵

It was in the shadow of Montoro and its progeny that the Dastar controversy began. To commemorate the fiftieth anniversary of the ending of World War II, Dastar Corporation had decided to put out a new video set titled World War II Campaigns in Europe. The collection made extensive, but unauthorized, use of a television series based on President Dwight Eisenhower’s book, Crusade in Europe.⁷⁶ Twentieth Century Fox had owned the copyrights to this program until it had inadvertently forgotten to renew them and the show fell into the public domain.⁷⁷ As a result, Fox could not sue Dastar for infringement of its Crusade in Europe television series to prevent publication and distribution of World War II Campaigns in Europe.⁷⁸ So, like many other entities who have lost their erstwhile rights in one of our intellectual property regimes, Fox turned to a neighboring intellectual property regime upon which to rest its claims.⁷⁹ It made a Lanham Act claim instead.

The procedural posture of the case was unusual and suggested something significant was afoot by the time it got to the Supreme Court. Both the district court and Ninth Circuit upheld the reverse passing off claim. Indeed, the Ninth Circuit thought so little of the issue’s weight overall significance that the decision was unpublished. The Supreme Court, of course, typically grants certiorari to only a tiny fraction of cases; so, it certainly raised eyebrows when the Supreme Court granted certiorari to a seemingly routine and mundane decision that the Ninth Circuit did not even bother to designate for publication.⁸⁰ The action presaged the Court’s view that the unpublished decision from the Ninth Circuit missed something fundamental and significant, about which the Court appeared ready to opine.

In its decision, the Supreme Court unanimously reversed the Ninth Circuit and rejected Fox’s attempt to use a Lanham Act claim for false designation of origin as a means of preventing Dastar’s reproduction of an audiovisual work (to which Fox had previously owned the copyright) that had fallen into the public domain.⁸¹ In so holding, the Court warned against the risk of creating a “species of mutant” intellectual property protection that would impede the public’s right to make unfettered use of creative works that no longer enjoy copyright protection.⁸²

As the old saw goes, hard facts make bad law. Fox’s gambit to eschew the “limited times” requirement in copyright law by ginning up trademark claims against Dastar struck a nerve with the Court, and the case came before it at a particularly opportune time (as far as Dastar was concerned). As Justin Hughes points out, the close proximity of the Dastar decision to the holding in Eldred v. Ashcroft⁸³ suggests that that the former may have intentionally served as a “2003 Term counterweight” to the latter,⁸⁴ which rejected concerns about the public domain in declining to find a twenty-year extension of copyright terms unconstitutional.⁸⁵ Indeed, concerns about aggrieved former rightsholders, like Fox, attempting to circumvent copyright’s careful calibrated balance between private protection and public access expressly animated the Dastar decision. Specifically, the Court sought to thwart future efforts by lapsed copyright holders to make disingenuous use of trademark law to assert monopolistic control over the exploitation of works that had fallen into the public domain, in contravention of the very intent of the copyright regime and its (constitutionally mandated) policy of allowing ownership over creative works to eventually expire so that the public may make free use of them.⁸⁶

Thus, under Dastar, the Supreme Court found that reference to “origins of goods” in the Lanham Act could not be read to mean the authorial origins of a work; instead, it referred only to the physical source of the embodiment of that work in tangible products.⁸⁷ As the Court rationalized, the reference to “origin of goods” in section 43(a) was “incapable of connoting the person or entity that originated the ideas or communications that ‘goods’ embody or contain. Such an extension would not only stretch the text, but it would be out of accord with the history and purpose of the Lanham Act and inconsistent with precedent.”⁸⁸ So, even if Dastar had failed to give proper credit to the intellectual source(s) of the materials contained in its video collection, this did not, and could not, constitute a violation under the Lanham Act. All that mattered for the purposes of the section 43(a) was that there was not false designation of the origin of the actual physical video collection. Since Dastar literally published and distributed the video collection, self-attribution was entirely proper as far as the Lanham Act was concerned. As such, Fox had no actionable claim for false designation of origin.

At the same time, however, the Court’s holding reached broader than necessary to achieve the laudable goal of protecting the public domain. By grounding its ruling in a reading of the Lanham Act that definitively excluded the intellectual wellspring of a product from the meaning of “origin,” the Court precluded attribution claims under section 43(a) for all creative works. Thus, in the past two decades, courts have generally rejected all such claims, whether they apply to public domain works (as in Dastar) or works still under copyright protection (unlike Dastar).⁸⁹ In the process, therefore, Dastar eliminated relief for those seeking remediation of a harm quite distinct from unlawful reproduction, distribution, display, or performance of a creative work: the act of not giving credit to its original author. In one fell swoop, “the Court swept away close to twenty-five years of precedent that held that failure to give credit to an entertainment product such as a film or song, or providing misleading credit, was a violation of trademark law.”⁹⁰

In part, two other concerns can explain and warrant broader application of the holding to all creative works, not just ones in the public domain. First, the Court noted the difficult position that an attribution-related reverse passing off claim could put manufacturers of products containing creative works. “On the one hand,” notes the Court, “they would face Lanham Act liability for failing to credit the creator of a work on which their lawful copies are based; and on the other hand they could face Lanham Act liability for crediting the creator if that should be regarded as implying the creator’s ‘sponsorship or approval’ of the copy.”⁹¹ In other words, if Dastar had put out its video set and kept the original credits to Fox, Fox could have sued Dastar for violating the Lanham Act for direct passing off by suggesting that Fox sponsored or approved Dastar’s product. Meanwhile, because it had removed Fox’s name, Dastar now faced a claim for failure to attribute under a theory of reverse passing off. If the Court had affirmed the availability of an attribution-related passing off claims, the resulting quagmire could stifle the use of works—both those in the public domain (for which no licensing is required) and for those still under copyright protection (when lawful copyright clearance might leave a licensee subject to exposure for a Lanham Act violation).

Second, the Court raised its concern that an attribution requirement could leave distributors of copyright content with a duty to credit that might grow impossibly burdensome and impractical. The opinion put a fine point on the scope of crediting that a broader reading of section 43(a)’s “designation of origin” reference would compel by assessing the type of attribution that might be required to distribute the film Carmen Jones. As the Court posited, to avoid liability for reversing passing off under Montoroand its progeny, a distributor might have to give attribution “not just to MGM, but to Oscar Hammerstein II (who wrote the musical on which the film was based), to Georges Bizet (who wrote the opera on which the musical was based), and to Prosper Mérimée (who wrote the novel on which the opera was based).”⁹² Determining origin could amount to a complicated task. To illustrate this point, the Court turned no further than the case at hand, opining that

[w]hile Fox might have a claim to being in the line of origin, its involvement with the creation of the television series was limited at best. Time, Inc., was the principal, if not the exclusive, creator, albeit under arrangement with Fox. And of course it was neither Fox nor Time, Inc., that shot the film used in the Crusade television series. Rather, that footage came from the United States Army, Navy, and Coast Guard, the British Ministry of Information and War Office, the National Film Board of Canada, and unidentified ‘Newsreel Pool Cameramen.’ If anyone has a claim to being the original creator of the material used in both the Crusade television series and the Campaigns videotapes, it would be those groups, rather than Fox.⁹³

Interestingly, the Court’s language on this issue referred only to the context of uncopyrighted works, noting that “[w]ithout a copyrighted work as the basepoint, the word ‘origin’ has no discernable limits.”⁹⁴ But unless the reference to uncopyrighted works meant works that had never enjoyed copyright protection in the first place, it is unclear why this problem would be greater with once-copyrighted works that have fallen out of the public domain as opposed to works still under copyright protection.

While these rationales offer substantive justification to eliminate attribution-related reverse passing off claims through the Lanham Act in all instances—not just claims relating to public domain works—the holding in Dastar was not without its significant problems. First, Dastar suffered a seemingly significant incongruity with the purpose of the federal trademark regime. If the goal of the Lanham Act is, indeed, consumer protection, the Supreme Court’s central holding in Dastar—that the Lanham Act’s reference to origin means the source of an actual physical product and not the wellspring of the idea or intellectual property embodied in a particular product—fails to reflect the reality of what factors animate consumer behavior, particularly with respect to intellectual property. Crediting is not just important to authors; it is vital the public’s decision-making process when it comes to consuming entertainment content. As Mary LaFrance has pointed out, contrary to the ultimate thrust of Dastar, which held that trademark law only protects against misidentification of the maker of the actual product rather than the ideas behind it,

in the case of literary works or entertainment works, the identity of the actual author, performer, or creative overseer may frequently be more crucial to the consumer’s purchasing decision, than the identity of the party that manufactured the physical embodiment [because] the identity of key creative participants is often viewed as a source indicator that is an important predictor of the quality or content of the goods.⁹⁵

Indeed, Dastar creates an unusual result for physical products containing intellectual property, as it provides protection to the designation of origin about which consumers arguably care the least. To put a finer point on it, consumers do not care if the movie they are watching was printed on Kodak film or released by Warner Brothers; they care about the fact that it was directed by Martin Scorsese or written by Charlie Kaufman. Readers do not care about whether Random House or Harper Collins was responsible for the paper and ink on which a book appears; they care about whether the book was written by J.K. Rowling or Thomas Pynchon. Music listeners do not care if the album was issued by SubPop or Merge Records; they care about whether it contains performances by Spoon or The Mountain Goats. The disconnect between the law’s protections and this reality could not be more stark or problematic.

Most importantly, for a large swath of creatives, Dastar all but eliminated hope for securing crediting rights through legal claims.⁹⁶ Admittedly, the Supreme Court took pains to caution that its decision had not necessarily eliminated all means to vindicate attribution rights and that Dastar did not speak to alternative causes of action to enforce crediting under common, state, and federal law, including other theories (such as false advertising) available under the Lanham Act. But as one practitioner euphemistically noted in the wake of Dastar, the remaining options relied on “creative lawyering.”⁹⁷ This turned out to be shorthand for shots in the dark that have little chance of working. For as we shall analyze in great detail infra, Dastar marked a significant inflection point in the state of attribution rights—significantly curtailing (if not altogether eliminating) the ability of most creators to receive credit under the law.⁹⁸

B. The State of Crediting Rights in the Two Decades Since Dastar

In his post-Dastar assessment of the state of attribution rights written in 2007, Justin Hughes argued that the crediting gap left by Dastar was not as wide as commonly believed. “[I]f we work through all the possibilities, the practical hole created by Dastar may be operatively modest,” he contended.⁹⁹

Dastar creates a gap in protection for those works and circumstances where there is a failure of appropriate attribution and no cause of action under VARA, under state moral rights laws, under 17 U.S.C. § 1202 for failure to include copyright management information, or under state unfair competition laws in states where the courts hold that Dastarshould not control, and where contract law does not establish a framework to protect attribution.¹⁰⁰

Hughes’s assessment has proven excessively sanguine, unfortunately. Although the legal theories to which he cited as alternative bases for protection may be numerous in quantity, they are qualitatively impoverished and provide scant (if any) relief in the vast majority of situations. Moreover, in the nearly two decades since Dastar, the significant size of the credit gap has become manifest as the jurisprudence of the intervening years has made clear how little bite these alternative legal theories provide for the vindication of crediting interests.

1. False Advertising Claims Under the Lanham Act

The very cause of action to which the Supreme Court cited as continuing to provide attribution protection post-Dastar—the Lanham Act’s prohibition on false advertising or misrepresentations of fact—has proven feeble in this regard. While Dastarexpressly foreclosed the possibility of attribution-related claims under section 43(a)(1)(A), it did not altogether eliminate the ability to vindicate crediting rights under the Lanham Act. Since the Dastar holding only opined as to the meaning of “origin” in the statute (which is invoked in section 43(a)(1)(A), referring to “confusion . . . as to the origin”),¹⁰¹ remaining provisions of the Lanham Act that did not employ that word could still have application to attribution-related issues. This was true for the Lanham Act’s cause of action for false advertising that, under section 43(a)(1)(B), created liability for anyone who “misrepresents the nature, characteristics [or] qualities . . . of . . . goods, services, or commercial activities.”¹⁰² In fact, Dastar expressly pointed to this provision as one ground for relief that may still be possible following the decision. As the Court noted,

If, moreover, the producer of a video that substantially copied the Crusade series were, in advertising or promotion, to give purchasers the impression that the video was quite different from that series, then one or more of the respondents might have a cause of action—not for reverse passing off under the “confusion . . . as to the origin” provision of § 43(a)(1)(A), but for misrepresentation under the “misrepresents the nature, characteristics [or] qualities” provision of § 43(a)(1)(B).¹⁰³

That said, such a path has proven less than promising and the Court’s supposition that such relief might be forthcoming has proven too optimistic, at best—or disingenuous, at worst. First, despite Dastar’s seemingly express exhortations to the contrary, subsequent courts have found that the holding in Dastar actually prevents both section 43(a)(1)(A) and section 43(a)(1)(B) claims on similar facts.¹⁰⁴ Second, and more fundamentally, false advertising claims face additional hurdles not present in an attribution claim under section 43(a)(1)(A). These impediments would be difficult for most plaintiffs seeking vindication of an attribution right to clear. For example, many courts require competitor standing to bring a false advertising suit. Until the Supreme Court recently broke a circuit split, false advertising claims were, in many circuits, per se limited to commercial “actual” (direct?) competitors.¹⁰⁵ Even now, standing remains a significant issue. As the Supreme Court noted in Lexmark International, Inc. v. Static Control Components, Inc., false advertising plaintiffs must show that they “fall within the zone of interests” protected by the statute and must have suffered a harm proximately caused as a result of the act of false advertising.¹⁰⁶ But to fall within the zone of interests, the plaintiff must “allege an injury to a commercial interest in reputation or sales.”¹⁰⁷ Since consumers typically do not lose sales or suffer an injury to reputation, the new standard makes it exceedingly unlikely that consumers can bring a false advertising claim. While plaintiffs need not be direct competitors anymore to bring a claim under section 43(a)(1)(B), they still generally need to be competitors of some sort.

Finally, false advertising is actionable under section 43(a)(1)(B) if and only if the statement is false on its face or the misrepresentation is material, that is, relied upon in consumers’ purchasing decision.¹⁰⁸ This consumer reliance requirement makes eminent sense for false advertising claims, but it makes less sense when dealing with issues of attribution which should be, first and foremost, about vindicating the rights of authors to receive credit for their works rather than rights of the public from being deceived in material consumption decisions. Moreover, while the most famous and acclaimed of authors may survive such a materiality requirement, the vast majority will have a far more difficult time.

2. Attribution Claims Under the Visual Artists Rights Act

On the surface, VARA would appear to provide significant protection for the attribution rights of authors. Codified in section 106A of the Copyright Act, VARA offers creators an independent cause of action “to claim authorship of [their] work,” and “to prevent the use of his or her name as the author of any work of visual art which he or she did not create.”¹⁰⁹ VARA claims are eligible for recovery of both statutory damages and attorneys’ fees and, to make matters even better for putative plaintiffs, unlike for infringement claims, an author does not even need to timely register the work in question as a condition for these remedies.¹¹⁰ Thus, a cursory examination of VARA might elicit hope for the vindication of crediting. But a closer look reveals just how profoundly limited the rights under VARA are.

First, as the very name of the legislation makes clear, VARA’s attribution rights only encompass works of visual art.¹¹¹ As such, the Act fails to apply to large swaths of subject matter otherwise protectible under the Copyright Act, including writings, music, and other important works. But the limits do not end there, as the attribution right does not even attach to all forms of art that might be characterized as visual in nature. Rather, the statute covers only paintings, drawings, prints, sculptures, and photographs created for exhibition purposes only.¹¹² It therefore excludes the most commercially important of visual art—film.¹¹³ It also does not apply to any “poster, map, globe, chart, technical drawing, diagram, model, applied art, . . . book, magazine, newspaper, periodical, data base, electronic information service, electronic publication, or similar publication” or any “merchandising item or advertising, promotional, descriptive, covering, or packaging material or container.”¹¹⁴ In addition, all works made for hire fall entirely outside of VARA’s protections.¹¹⁵ Finally, for the narrow category of visual art works to which VARA might apply, the attribution right only attaches to original versions of those works or limited editions thereof issued in sets of “200 copies or fewer that are signed and consecutively numbered by the author.”¹¹⁶ At the end of the day, therefore, VARA’s attribution right only applies to a limited set of visual art works that are not prepared as works made for hire. In short, VARA provides no crediting protection for the vast majority of authors.

3. Falsification and Removal/Alteration of Copyright Management Information Claims Under the Digital Millennium Copyright Act

Introduced into law with the passage of the Digital Millennium Copyright Act in 1998 (“DMCA”), the provisions of the Copyright Act that make it unlawful to falsify, alter, or remove copyright management information, which includes any authorship and copyright ownership data accompanying a work,¹¹⁷ would seemingly serve as a powerful vehicle to vindicate attribution rights. But while these provisions—codified in 17 U.S.C. § 1202 (“section 1202”)—constitute the sole protection granted to authorship information in all (rather than VARA’s narrow subset of) copyrighted works, their reach is deliberately constrained. Among other things, the structure of the two causes of action provided under section 1202—a claim for falsification of copyright management information (“CMI”)¹¹⁸ and a claim for removal or alteration of CMI¹¹⁹—makes clear that the protections therein are subservient to the goal of fighting infringement and not any inherent value that may come from crediting. In other words, the guiding principle behind section 1202 is preventing further infringement, not vindicating an author’s very real, but potential separate, interest in crediting. As such, section 1202 fails to provide a meaningful right to crediting for authors.

Specifically, a claim for falsification of CMI requires that plaintiffs show that defendants “knowingly and with the intent to induce, enable, facilitate, or conceal infringement . . . provide[d] copyright management information that is false.”¹²⁰ Similarly, a claim for removal/alteration of CMI requires that plaintiffs show that defendants “intentionally remove[d] or alter[ed] copyright management information . . . knowing, or . . . having reasonable grounds to know, that it will induce, enable, facilitate or conceal an infringement.”¹²¹ Thus, both falsification and removal/alteration claims have a strict double scienter requirement that necessitates plaintiffs demonstrate that defendants acted with a particular mens rea—that is, knowingly and with intent to facilitate infringement.

This onerous scienter requirement is significant in at least three ways. First, it contrasts markedly from the complete absence of any scienter requirement in matters of direct copyright infringement.¹²² Specifically, infringement has always been a strict liability tort,¹²³ where a defendant’s state of mind is wholly irrelevant to the issue of liability.¹²⁴ By sharp distinction, to prevail on an attribution claim through section 1202, a plaintiff must meet not one but two (if not three¹²⁵) showings on the defendants’ state of mind.

Second, by conditioning attribution relief on an intent to facilitate infringement, section 1202 firmly grounds its protections in service of the fight against infringement rather than any broad vindication of crediting rights. This position is further buttressed by the fact that section 1202 only protects CMI that is “conveyed in connection with copies or phonorecords of a work or performances or displays of a work.”¹²⁶ Furthermore, some courts have even read the legislative history and intent behind section 1202 to preclude application of falsification and removal/alternation claims to nondigital works.¹²⁷ According to the logic of these courts, section 1202’s primary purpose—fighting the scourge of piracy in the online environment because of the unique ease of digital infringement—compels such a limitation on section 1202 claims. Such a position, however, leaves attribution rights outside of the digital environment unaddressed.

Third, and relatedly, the dual scienter requirement makes it extraordinarily difficult to prevail on a section 1202 claim. To state a cognizable removal/alteration claim, for example, a plaintiff must demonstrate that either a work “came into Defendant’s possession with CMI attached, and Defendant intentionally and improperly removed it” or a work “came into Defendant’s possession without CMI attached, but Defendant knew that CMI had been improperly removed, and Defendant used the [work] anyway.”¹²⁸ A plaintiff may be unable to show how or in what form a work came into the defendant’s possession in the first place¹²⁹ and, even if they can, it is rare to have sufficient evidence showing that the removal/alteration was specifically with the intent to facilitate infringement. For example, an erroneous belief about the copyright status of an image can preclude a finding of the knowledge required to state a claim under section 1202.¹³⁰ Meanwhile, even intentionally cropping out a copyright notice from an image is insufficient to meet the intent to facilitate requirement.¹³¹

Not surprisingly, therefore, CMI claims are frequently adjudicated as a matter of law based on the failure to adequately make even a threshold showing of knowledge and intent.¹³² Consider, for example, the difficulties that an author might face in bringing a section 1202 claim even against someone who both knowingly and intentionally crops an image to cut out the authorship information. Even assuming such authorship information qualifies as actionable CMI, there are myriad reasons (that may have nothing to do with the concealing of infringement) to crop out such authorship information. Among other things, the person making use of the image could claim to have cropped the images for aesthetic purposes, because of inherent space limitations for the usage, or without any idea that they were removing CMI.¹³³ In all of these instances, authors may have legitimate, if not strong, interests in seeing uses of their work include attribution. Yet they would be unactionable under section 1202.

4. Attribution Rights Under State Unfair Competition Law and Other Common Law Theories.

Although there was initially some optimism about attribution rights remaining available under state law post-Dastar, such hopes have proven misplaced. First, in many states, such as California, courts have interpreted unfair competition protections as coextensive with the Lanham Act. Thus, if Dastar renders attribution claims no longer viable under the Lanham Act, such claims must necessarily also fail under state unfair competition law.¹³⁴ Second, in the wake of Dastar, both Tom Bell¹³⁵ and Michael Landau¹³⁶ suggested that copyright preemption issues raised by the decision could preclude use of state or common law theories to protect attribution rights. These predictions turned out to be correct, as courts have regularly read Dastar in such a manner.¹³⁷ In fact, even prior to Dastar, some courts viewed state unfair competition claims seeking credit as preempted.¹³⁸

The absence of clear legal protections for crediting has led some plaintiffs to rely (often futilely) on a veritable smorgasbord of common law theories in an attempt to cobble together some basis for relief. For example, when a Cornell graduate student (Antonia Demas) sued a member of her advisory committee (Professor David A. Levitsky of the School of Human Ecology) for improperly taking credit for research she had conducted into the nutritional habits of elementary schoolchildren, using that research to obtain a significant grant without her name, and then actively and publicly rebuffing her allegations of wrongdoing,¹³⁹ she did not bring a claim under the Lanham Act for misattribution.¹⁴⁰ Instead, she was left reciting the common law’s greatest hits in her complaint by claiming liability for misappropriation, fraud, breach of contract, breach of fiduciary duty, negligence, tortious interference with prospective economic advantage, defamation, and intentional infliction of emotional distress.¹⁴¹ While circumstances may make it possible to prevail on one of these theories, victims of crediting abuse face an uphill battle in meeting all of the required elements of such common law claims.¹⁴²

5. Private Contracting: The Promise and Perils

With false advertising, VARA, CMI falsification/removal/alteration and unfair competition claims providing little relief, creators are left with private contracting to do the work of crediting.¹⁴³ There is no doubt that, in some industries, private contracting and even social norms have gone a long way toward ensuring proper crediting. But significant lacunae remain and, even where private contracting and norms do provide for crediting, it is not always reflective of authorial contributions. As such, reliance on private systems to govern crediting is insufficient to provide for appropriate attribution rights.

There are some fields where private contracting has given rise to deeply nuanced and vigorously patrolled crediting requirements. Two paradigmatic examples are Hollywood and academia.¹⁴⁴ In the movie industry, collective bargaining has helped level the playing field between the studios and talent, and the operative guild agreements have insisted on getting even the most minute of credits done correctly.¹⁴⁵ Though it is not without its flaws,¹⁴⁶ the system has worked relatively well.¹⁴⁷ And even if it might seem a tad onerous to any member of the public who has sat through the credits of a motion picture, those credits instill industry professionals with a sense of pride over their brief moment of acknowledgement on the silver screen. Just as importantly, by making the contributions of industry professional publicly legible in databases such as IMDb.com,¹⁴⁸ the regime also ensures that those individuals can reap the reputational and economic benefits of their credits,¹⁴⁹ or, to give a notable example, help avoid the ruin that might come from an unfair attribution. To wit, from 1968 through 2000, the Directors Guild of America allowed aggrieved directors who believed a studio or other producer had butchered their movie in unimaginable ways to petition to have their directorial credit replaced with the fictional “Alan Smithee” pseudonym, lest the final product sully the real director’s good name.¹⁵⁰

In the Academy of Motion Picture Arts and Sciences, strong attribution norms have given rise to anti-plagiarism codes, which have the bite of law and are frequently enforced against offenders in university disciplinary proceedings. Though not perfect,¹⁵¹ the carrot of the norm and the stick of disciplinary proceedings have served to ensure generally robust crediting practices.

But in other industries without collective bargaining or finely tuned attribution codes, where crediting is just as important and billions of dollars are on the line, there are no such formal crediting regimes. In such endeavors, crediting decisions are often left to general norms and individual negotiations. As a result, crediting often becomes more about power than actual contribution. As Catherine Fisk points out about most fields of entertainment, “Apart from the guild-controlled screen credit system, the credit system for other creative and technical people in entertainment seems to be more governed by norms, charity, and power than by law.”¹⁵² One notable example of this is producer credits, which are not governed by collective bargaining. As a result, producing credits are notoriously corrupt, and a veritable “prestige market” for production credits exists. Meanwhile, even in the guild crediting systems of the Screen Actors Guild-American Federation of Television and Radio Artists, the Writers Guild of America, and the Directors Guild of America, power relations frequently trump creative contributions in determining attribution rights. As Fisk notes,

Because the guild agreements limit the number of people who can be credited in some roles on any one film, power relations among various possible contenders for credit affect who is listed. Individual workers with significant bargaining power (actors, directors, writers, and producers) negotiate for specific treatment on each project, which may or may not reflect the same level of artistic contribution as compared to others who receive a similar type of credit on a different film or who receive the same credit (or no credit) on the same film.¹⁵³

The absence of legal protection for attribution rights outside of private contracting has profound consequences for distributive justice. When viewed through the prisms of race, gender, or socioeconomic disparities, crediting practices have a particularly troubling history. Simply put, those who are not white, male, or wealthy have far too often struggled to receive credit, even when they indisputably authored work. This is because crediting is as much (if not more) about power dynamics and contractual leverage as it is about origination. As K.J. Greene has poignantly noted,

Top directors, such as a Spike Lee or Steven Spielberg, will have no problem obtaining credit [by exercising their bargaining power in negotiations to contract for it], but anyone else dependent on a contract to secure credit will likely lose out. . . . [W]hile Dastar, on its face, seems completely neutral on the subordination issue, it actually promotes greater subordination; despite the Oprah’s and Denzel’s of the world, Blacks, women, and other minorities still occupy the bottom of the totem pole in entertainment hierarchies, making them the most vulnerable to misattribution abuses.¹⁵⁴

Greene’s concern is not speculative or hypothetical. Unfortunately, it is widely reflected in the history of scientific and creative enterprise.

Consider, for example, the systematic undervaluing and underrecognition of innovations by women. In the sciences, the phenomenon even has its own term—the Matilda effect¹⁵⁵—and it is no less prevalent in the world of arts and letters. To take a few illustrative examples, Margaret Keane was the actual painter of the “big-eyed waifs” long credited to her husband, Walter;¹⁵⁶ Elizabeth Magie created the game of Monopoly, not Charles Darrow;¹⁵⁷ and although attributed to Marcel Duchamp, The Fountain—the infamous urinal that rocked the art world at the 1913 Armory Show—was likely the work of Elsa von Freytag-Loringhoven.¹⁵⁸ In short, crediting is often about who has the leverage (and, in the cases of some swindlers, the gall) to claim authorship, not who really created a work.

The dogged persistence of disparities in attribution has far-reaching consequences, exacerbating existing gender gaps in a number of professions, including the law. For example, Jordana Goodman’s empirical study of crediting practices for patent attorneys, which examined a set of over 200,000 patent applications and office action responses before the United States Patent and Trademark Office from 2016–2020, found an alarming divergence between “attribution and presence” for female patent attorneys, even when accounting for nongendered partner-associate power differentials, years of practice, and other relevant experience.¹⁵⁹ In the field of computer software, for instance, Goodman estimates that female attorneys suffered a thirty-one percent shortfall in crediting.¹⁶⁰ As she concludes, the “lack of equitable attribution perpetually disadvantages women, negatively impacts their career progression, and likely creates an insurmountable chasm between their capabilities and their prestige.”¹⁶¹ Ultimately, such practices “contribute[] to women’s systemic underrepresentation at top leadership levels throughout the United States,”¹⁶² a state of affairs presided over by current private ordering regimes such as the workflow structure of modern law firms.¹⁶³

The problematic dynamics in leaving crediting to private contracting are on full display in the music industry. As Fisk points out, “in music there is a not uncommon practice of people who do not contribute to the writing of a song being ‘cut in’ on songwriting credit.”¹⁶⁴ The practice is not always nefarious, of course. Peter Jackson’s Beatlesdocumentary, The Beatles: Get Back,¹⁶⁵ provides a notable example. As the film’s exhaustive studio footage capturing the crafting of the title song makes crystal clear, the work was the singular product of Paul McCartney’s musical ingenuity. But the song’s writing credits—“Lennon/McCartney”—tell a very different tale. In this case, a desire to keep an uneasy (though ultimately unsustainable) peace and to honor the duo’s (soon-to-be dissolved) songwriting partnership came at the expense of accuracy. Less innocuously, however, there are myriad instances where crediting practices reflect power more than creative contribution and cut along disturbing gender or racial fault lines. To take one example, Little Richard coauthored the classic Tutti Frutti with Creole songwriter Dorothy LaBostrie.¹⁶⁶ However, as if it were not bad enough that handlers cajoled him into selling his publishing rights to his record company for a proverbial song (a meager fifty dollars), he also provided songwriting credit to a party that likely had nothing whatsoever to do with the authorship of the song¹⁶⁷—one that may have been the be pseudonym for the owner of Richard’s record label (who reaped the royalties, which continue to be earned on the song to this day).¹⁶⁸ As Greene documents, Richard’s experience was no outlier; it was par for the course. And as he observes, “The fact that minority artists received less protection—or in many cases no protection—for their compositions undermines the incentive theory of intellectual property laws. Many Black artists received little no economic reward for their creations. Others certainly received less than what they should have.”¹⁶⁹

Even beyond issues of race, gender, and socioeconomic status, crediting often reflects relational and power dynamics that may have nothing whatsoever to do with real creative contributions, such as the tenured professor receiving sole authorial credit for a work that includes substantial contributions from graduate students, the law firm partner who has no problem enjoying attribution for the work of a junior associate, or the senator whose “words” are actually those of a speechwriter. Indeed, Spenser Clark’s deep dive into the crediting practices on Hold Up, one of the songs from Beyoncé’s acclaimed concept album Lemonade, illustrates this point in the world of pop music.¹⁷⁰ As Clark explains, Hold Up’s title and some of its lyrics come from a line that Ezra Koenig, the lead singer of Vampire Weekend, had once tweeted (which, itself, was based on a lyrics from Maps, a song by indie rockers the Yeah Yeah Yeahs) and subsequent lyrics Koenig had developed in the studio with Beyoncé’s noted producer, Diplo, while they were working with a loop from an Andy Williams song.¹⁷¹ Beyoncé ultimately gave Koenig and the Yeah Yeah Yeahs songwriting credit on Hold Up and Diplo a producing credit, but, notably, Williams received no credit at all—either as a songwriter or producer.¹⁷² As Clark concludes,

Oftentimes [artist crediting] choices are not based in law, but rather more intangible considerations like the desire to maintain relationships with creators they wish to work with in the future. Andy Williams’ song, for example, was released in 1963, and therefore [Beyoncé] Knowles was probably less concerned with that relationship as she was with other, more relevant artists.¹⁷³

Notably, in the music industry (just as in some other creative fields), crediting is not only of reputational or ethical significance; it also determines payment of royalties related to the exploitation of sound recordings and musical compositions.

Finally, besides the power dynamics inherent in the private negotiation of credits, the fundamental constraints of contracting also limit how far it can go in ensuring proper attribution. Crediting claims that rest on negotiated obligations require privity for enforcement, and the realities of the marketplace dictate that not all uses of one’s work will be by individuals or entities with whom an author could or would contract.¹⁷⁴ Indeed, this is precisely why we do not leave protection against unauthorized reproduction or other unlawful uses of copyrighted works to contracts and, instead, have infringement claims available that require only access and substantial similarity—no privity and, indeed, no knowledge. All told, therefore, while contracting, especially via collective bargaining, has enjoyed some success in certain industries, private law has not proven sufficiently robust to ensure that crediting rights are adequately protected in the many areas of the information economy where they matter vitally to authors, investors, and consumers.

III. REFORM

A. Questioning Attribution Rights: Why Good Norms Do Not Necessarily Make Good Law

Our legal regime’s present crediting gap—the yawning chasm between the high value of attribution and the surprising absence of safeguards in our existing system to protect the practice—would seem to suggest a manifest need for reform. But before wholeheartedly embracing the adoption of some kind of credit-mandating legal regime, it is worth pausing to consider that best practices do not always translate into righteous laws. In other words, while giving credit might be the right thing to do, that does not necessarily mean we should legally require it, either broadly or in limited contexts. To put it bluntly, not all norms need the bite of law. For example, compliance with some norms—like thanking a gift giver—is more meaningful when it results from volition rather than compulsion. Moreover, to limit the scope of potential government intrusion into personal affairs, we do not want the law to microregulate every aspect of human existence. Thus, it is important to approach any effort to expand the law to regulate behavior that was previously not squarely within the aegis of our legal regime with a healthy amount of skepticism.

For example, despite moral entreaties against them, prevarications mostly lie¹⁷⁵ outside of the scope of legal regulation—and with good reason. As Judge Alex Kozinski explained in one of the most mordant and entertaining paragraphs ever to appear in the Federal Reporter, such a societal choice honors the freedom of human expression, regardless of moral valence, and serves greater First Amendment interests. “Living means lying,” Kozinski famously posited (in words that are part of the public domain and thereby forgiving of extended quotation):

Self-expression that risks prison if it strays from the monotonous reporting of strictly accurate facts about oneself is no expression at all. Saints may always tell the truth, but for mortals living means lying. We lie to protect our privacy (“No, I don’t live around here”); to avoid hurt feelings (“Friday is my study night”); to make others feel better (“Gee you’ve gotten skinny”); to avoid recriminations (“I only lost $10 at poker”); to prevent grief (“The doc says you’re getting better”); to maintain domestic tranquility (“She’s just a friend”); to avoid social stigma (“I just haven’t met the right woman”); for career advancement (“I’m sooo lucky to have a smart boss like you”); to avoid being lonely (“I love opera”); to eliminate a rival (“He has a boyfriend”); to achieve an objective (“But I love you so much”); to defeat an objective (“I’m allergic to latex”); to make an exit (“It’s not you, it’s me”); to delay the inevitable (“The check is in the mail”); to communicate displeasure (“There’s nothing wrong”); to get someone off your back (“I’ll call you about lunch”); to escape a nudnik (“My mother’s on the other line”); to namedrop (“We go way back”); to set up a surprise party (“I need help moving the piano”); to buy time (“I’m on my way”); to keep up appearances (“We’re not talking divorce”); to avoid taking out the trash (“My back hurts”); to duck an obligation (“I’ve got a headache”); to maintain a public image (“I go to church every Sunday”); to make a point (“Ich bin ein Berliner”); to save face (“I had too much to drink”); to humor (“Correct as usual, King Friday”); to avoid embarrassment (“That wasn’t me”); to curry favor (“I’ve read all your books”); to get a clerkship (“You’re the greatest living jurist”); to save a dollar (“I gave at the office”); or to maintain innocence (“There are eight tiny reindeer on the rooftop”).

. . . .

Even if untruthful speech were not valuable for its own sake, its protection is clearly required to give breathing room to truthful self-expression, which is unequivocally protected by the First Amendment. . . . If all untruthful speech is unprotected, as the dissenters claim, we could all be made into criminals, depending on which lies those making the laws find offensive. And we would have to censor our speech to avoid the risk of prosecution for saying something that turns out to be false. The First Amendment does not tolerate giving the government such power.¹⁷⁶

We not only insulate certain forms of morally suspect speech from legal liability, but also certain acts. So while we believe cheating on a spouse is repugnant to one’s martial vows, in most states no civil liability attaches to unfaithfulness, and it is largely irrelevant in most divorce proceedings. Thus, while we may have a broad societal consensus that some actions constitute moral wrongs, we do not necessarily criminalize, or impose civil liability on, all of those wrongs.

That said, crediting directly ties to a matter of significant public, rather than solely private or familial, interest. As we have detailed, attribution rights not only strike at the core of the utilitarian function of the copyright regime—advancing progress in the arts by incentivizing the production of creative work—but also other social benefits tied to economic and cultural interests. Indeed, in the conclusion to her exhaustive survey of crediting regimes in a wide variety of industries involved in scientific and cultural production, Fisk concludes that, while private law and norms have provided some protection for attribution rights, the current state of affairs warrants, if not compels, some type of legal intervention.¹⁷⁷ But Fisk also cautions that any reform should supplement, but not supplant, existing practices.¹⁷⁸ Since private ordering and norms have functioned with some success, there appears to be wisdom in approaching attribution rights with a disinclination to implement any new regime that is overly onerous or excessively undermines flexibility. With that caveat in mind, we turn to assess several proposals for reform.

B. The Problem with Overturning Dastar and Amending the Lanham Act

The most immediate and obvious reform measure for addressing the crediting gap created by Dastar and its progeny would involve overturning Dastar’s core holding. For example, as Justin Hughes has suggested, such an action could occur through legislation that amends the Lanham Act to define origin as including the intellectual source of creative works that have not yet fallen into the public domain.¹⁷⁹ In other words, congressional action could restore the availability of attribution-related claims for reverse passing off for works still under copyright protection. But such legislation would also respect the Supreme Court’s rightful concern about limiting erstwhile rightsholders with expired copyrights from attempting to perpetuate their monopolistic stranglehold on the exploitation of creative works by turning the Lanham Act into a “species of mutant copyright law that limits the public’s ‘federal right to “copy and to use” ’ expired copyrights”¹⁸⁰ However, even if the legislation is carefully crafted to apply the holding of Dastar only to works with expired copyrights, such a proposal might create more problems than it solves. Among other things, the Lanham Act is a poor fit for the vindication of attributive interests in creative works, and, even prior to Dastar, those inadequacies and fissures showed.

The ability of litigants to vindicate attribution rights through the vehicle of the Lanham Act has always been less than ideal—the Ninth Circuit’s Montoro decision and its progeny notwithstanding. Indeed, Bobbi Kwall argued this very point in 2002—just before the Dastar ruling—when she highlighted at least three ways in which the extant jurisprudence of the time stunted attribution claims, even under the Lanham Act.¹⁸¹ First, competing interpretations of section 43(a) by the federal courts in different jurisdictions had created a patchwork of inconsistent requirements that hampered the viability of reverse passing off claims for misattribution.¹⁸² Second, courts had sometimes even found such claims preempted under section 301 of the Copyright Act.¹⁸³ Finally, and potentially most problematically, section 43(a)’s ultimate focus on consumer confusion and the prevention of deception¹⁸⁴ led courts to “become preoccupied with different manifestations of ‘falsity’ at the expense of [protecting] an author’s personality and reputational interests.”¹⁸⁵ Thus, dignitary injuries to an artist from a lack of attribution, or even speculative injuries as to the future harm that a lack of recognition may bring, are not cognizable under a section 43(a) claim. So, for example, in 1999, the Fifth Circuit affirmed summary judgment for a record label on a section 43(a) claim for reversing passing off based on the record label’s alleged failure to credit the authors of a digital sample of the authors’ work.¹⁸⁶ Even though it acknowledged the lack of attribution, the Fifth Circuit still denied the claim on the basis that the plaintiffs could not demonstrate a genuine issue of likelihood of confusion.¹⁸⁷ Read strictly, section 43(a)’s requirement of a showing of likelihood of consumer confusion would threaten most attribution claims, especially those that stem from smaller uncredited uses of a work¹⁸⁸ and even larger uses of works by authors that are not sufficiently well-known so as to meet the threshold of consumer confusion necessary to sustain a claim under section 43(a).¹⁸⁹ In other words, the Lanham Act’s conditionality of liability on consumer confusion inherently and significantly narrows the breadth of any protection for attribution it might otherwise provide. Overruling Dastar would do nothing to address this issue.

Meanwhile, although restoration of attribution-related reverse passing off claims for works still under copyright protection might address the Supreme Court’s concern about the potential private recapture of public domain works, it would not address another problem that undergirded the rationale of Dastar: the catch-22 of crediting. As the Dastar Court pointed out, attribution rights can mire users of copyrighted works in a damned if you do, damned if you don’t scenario. On one hand, if they do not provide credit, they might face claims for failure to attribute. On the other hand, if they do attribute, they might face accusations of a type of passing off—effectively engaging in a form unwanted attribution that the attribute regards as connoting sponsorship, endorsement, or affiliation with their product. This, in turn, can produce liability under the Lanham Act.¹⁹⁰

Meanwhile, though the anticompetitive implications of reverse passing off claims related to attribution are most pressing when a work is otherwise in the public domain, the ability of such a cause of action to stifle legitimate uses of works under copyright protection also bears consideration. Specifically, the Supreme Court’s anxieties about a mutant form of copyright law apply more broadly than the re-copyrighting of works that have fallen into the public domain; they apply with equal force to how a crediting regime could entangle and ensnare all sorts of unwitting users of copyrighted works, including properly licensed ones, for failure to make proper crediting. Attribution requirements can be onerous, particularly if we return to the pre-Dastar state of affairs under the Lanham Act, where it was unclear just how much crediting might be required to avert potential reverse passing off claims. Indeed, in the unanimous Dastar opinion, Justice Scalia cited the “serious practical problems” that would result from an attribution requirement without carefully circumscribed limits.¹⁹¹ After detailing the exhaustive list of potential credits that Dastar would have had to give¹⁹² if the Court had found that the Lanham Act’s reference to “origin” required attributions to all of the originators of “the ideas or communications that ‘goods’ embody or contain,”¹⁹³ Scalia quipped that it made no sense to interpret the Lanham Act as requiring a search “for the source of the Nile and all its tributaries.”¹⁹⁴

A restoration of the pre-Dastar state of the law for works still in copyright could adversely impact the rights of legitimate users of copyrighted works and enable a similar type of result as Fox sought to achieve in pursuing its claims in Dastar. An example illustrates this point. If a producer-rightsholder grants a distributor rights to its work and then the distributor properly sublicenses those rights to an exhibitor, there is no issue of copyright infringement and, under our current regime, the exhibitor would feel secure in exploiting the work. But if attribution-related reverse passing off claims are restored under the Lanham Act, all manner of mischief could result in undermining the exhibitor’s properly granted exploitation rights if a purported “source” does not receive the attribution they believe they deserve in conjunction with the exploitation. The broad scope of who or what might constitute a “source”—as illustrated by Dastar’s infamous passage about the “Nile and all its tributaries”—makes this clear.

Thus, it may be with good reason that the period of time during which courts recognized an attribution-related claim for “reverse passing off” was relatively short. Although there were occasional outlier decisions in the distant past, “[w]idespread acceptance of [such] a cause of action began around 1980”¹⁹⁵—meaning that creators enjoyed access to such a claim for less than a quarter century and questions about the practice abounded during that era. Several theorists, for example, argued that use of the Lanham Act in this (admittedly sympathetic) context stood on shaky, if not wholly unjustifiable, legal grounds. Although he acknowledged that the right of attribution was “a commercially valuable right,”¹⁹⁶ Randolph Stuart Sergent asserted that such a claim failed to serve the Lanham Act’s purported goals and was inappropriate under section 43(a).¹⁹⁷ In presciently anticipating Dastar, he bemoaned the power of Montoro-like claims to serve as “a tool for controlling the sale of the underlying product [in a manner that would] reduc[e] marketplace competition . . . to the immediate detriment of consumers.”¹⁹⁸ Meanwhile, John Cross argued that, although “[a]llowing [a] plaintiff to recover for reverse passing off certainly ‘feels’ right,”¹⁹⁹ it is worth noting that “vague feelings of impropriety . . . are not enough to justify a cause of action.”²⁰⁰ In his analysis, imposing liability under the Lanham Act for reverse passing off failed “to prevent or cure any meaningful consumer deception” and undermined the delicate balance between encouraging innovation and promoting competition by allowing original sources to monopolize works that are either ceded to or eventually fall into the public domain by operation of copyright and patent law.²⁰¹ These concerns remain for any effort to overturn Dastar.

In short, even before Dastar, the Lanham Act simply did not provide consistent protection for authorial crediting. As such, simply reforming Dastar does not really get us a proper fix for vindicating attribution rights. Although there is much to criticize about Dastar—the void it has created in the law of crediting and its shaky factual premise—there are also compelling reasons to leave the primary holding of Dastar undisturbed and to eschew reliance on the Lanham Act as a means to vindicate attribution rights. Indeed, if Dastar achieved any good, perhaps it was in taking the issue of authorial attribution out of the scope of the Lanham Act, where it represented a square peg being forced into the proverbial round hole.

C. The Challenges with Creating an Independent Attribution Claim Under the Copyright Act

Other scholars have considered whether it might make sense to amend the Copyright Act to provide for a general attribution right.²⁰² Jane Ginsburg, for one, has advanced such a proposal.²⁰³ While the idea certainly has a great deal of merit, it also suffers from some significant shortcomings. On the positive side, Ginsburg’s proposal seeks to resolve this surprising lacuna in American intellectual property jurisprudence by finally granting creators a general right of attribution. Meanwhile, Ginsburg advocates the duration of the attribution right to match the copyright term—thereby averting instances of attribution liability for the use of public domain works. For reasons that we have also advocated,²⁰⁴ she also recognizes the importance of taking attribution rights outside of the Lanham Act given that attribution should be recognized regardless of proof of economic harm or consumer confusion.²⁰⁵ She also attempts to address potential issues regarding the unwieldy and uncertain scope of attribution obligations pre-Dastar by limiting the affirmative right of attribution to just legal authors and performers.²⁰⁶

But Ginsburg’s proposal has some significant difficulties. While legal authorship is often singular, performers can number into the thousands. Thus, the inclusion of performers in the attribution requirement could create the specter of liability for the unwitting.²⁰⁷ More broadly, crediting of authors is not always practicable. Although Ginsburg addresses this concern by suggesting that the statute would be subject to a standard that incorporates a “reasonableness criterion,”²⁰⁸ such ambiguity is arguably the last thing that copyright law needs. After all, copyright users already have the remarkable illegibility of the fair use analysis with which to contend. Adding an additional crediting requirement that has no restraint other than “reasonableness” adds just another unfortunate layer to the copyright thicket of licensing and clearance requirements that already stifle creative activity.

Indeed, the very example that Ginsburg uses to tout the salutary and nimble nature of an attribution requirement grounded in the ambiguous notion of “reasonableness” demonstrates the very dangers of such a regime. As she writes,

[A] requirement to identify all authors and performers may unreasonably encumber the radio broadcast of a song, but distributed recordings of the song might more conveniently include the listing. This may be particularly true of digital media, where a mouse click can provide information even more extensive than that available on a printed page.²⁰⁹

Admittedly, with its relative dearth of spacing limitations, digital media makes it arguably reasonable (from a spacing point of view) to credit any number of authors and performers. But such a requirement can quickly become onerous. Consider a professor teaching a Russian history course who wants to screen excerpts from Alexander Sokurov’s acclaimed experimental drama Russian Ark, a ninety-six-minute film shot in just a single take one night at the Hermitage with a cast of more than two thousand actors and three orchestras.²¹⁰ Though such an action would likely constitute fair use, meaning the professor could engage in the use without the hassle of payment and permission and without fear of infringement liability, Ginsburg’s proposal would place the professor in jeopardy of a different kind of liability: failure to attribute.

Ginsburg’s proposal also lacks any fair use defense, a point emphasized when she explains that “the test of reasonableness in this context is not the same as for fair use. The question is not whether the use should be prevented or paid for, as it is when fair use is at issue, but whether the use, even if free, should acknowledge the user’s sources.”²¹¹ While such a move is welcome from an equity point of view—for all too long, copyright law has devalued the creative contributions of performers²¹²—it bodes less favorably for those making use of copyrighted works. It is hard enough to prevail on a fair use claim; but now users will have to contend with a whole other issue: liability exposure under an independent and separate cause of action depending on the reasonableness of their crediting practices.

In addition, entitling creators to an affirmative right of attribution could have a surprisingly adverse impact on the functioning of intellectual property licensing markets. For example, while they acknowledge the critical importance of crediting and recognition to authors (a position backed up by their own empirical experiments), Sprigman, Buccafusco, and Burns have suggested that the indisputable value that creators place in attribution should not automatically lead to legislative enactment of an affirmative attribution right.²¹³ As they caution, the operation of a default right of attribution, even if waivable, could result in significant inefficiencies in the licensing market. Most obviously, transaction costs would increase. But less obviously, the combined impact of the endowment and creator effects—which can cause irrational overvaluation of the intellectual property rights held by authors in their creative output—can make licensing transactions increasingly unlikely and more burdensome.

Grounded in the public interest and the efficient functioning of licensing markets, this argument warrants further examination and should give pause to any hasty enactment of attribution legislation. To understand why, an examination of the emerging literature in behavioral economics is in order. Specifically, in recent years, psychologists and economists have observed a phenomenon dubbed the “endowment effect,” wherein the subjective valuation an individual will give a particular object increases significantly when the individual possesses that object, even for a limited time.²¹⁴ As a consequence of this effect, individuals will “demand much more to give up an object than they are willing to spend to acquire it.”²¹⁵ Although not without its critics,²¹⁶ this result appears to subvert neoclassical economic theory, which assumes that an individual’s willingness to pay (“WTP”) for a good should equal the willingness to accept (“WTA”) compensation for the loss of the good. In a now-classic experiment, Kahneman, Knetsch, and Thaler found that randomly assigned buyers valued a particular mug at three dollars, on average.²¹⁷ By sharp contrast, randomly assigned owners of the very same mug required substantially more money (seven dollars, on average) to part with it.²¹⁸ In short, the owners’ loss in divesting themselves of the mug was valued at more than twice the buyers’ gain in acquiring the exact same mug. Thus, under the endowment effect, most people appear to require a much higher price to part with a product to which they hold a legal entitlement (that is, through possession or ownership) than they would pay to purchase the very same product.

As it turns out, the endowment effect can be especially pronounced and dangerous in matters dealing with intangible property such as copyright. The tendency toward overvaluing endowed goods is amplified when measurements of value are more subjective, and the lack of fungibility for creative works can exacerbate holdout problems and make completion of licensing deals more difficult. This leads to what James Surowiecki and others have billed as the “permission problem.”²¹⁹ And the impact is not merely the stifling of creative rights of scholars, critics, satirists, and others. Since the endowment effect raises the price otherwise demanded for access to a copyrighted work, “members of society do not enjoy the increased access to art that the copyright law is designed to provide.”²²⁰

It is at this point that Sprigman, Buccafusco, and Burns’s findings become particularly salient. They found that that endowment effect was particularly extreme when creators engage in transactions involving their own work. This so-called “creativity effect”—what Sprigman, Buccasufsco, and Burns refer to as the “the tendency of creators of goods to assign higher value to their works not only compared to would-be purchasers of the goods, but relative also to mere owners (that is, subjects who had not created but merely been given the works, as in previous studies)”²²¹—can badly “magnify the valuation anomalies associated with the endowment effect. The creativity effect drives creators’ WTA even further away from buyers’ WTP, and in doing so it makes deals over creative goods more difficult to reach.”²²² The data from Sprigman, Buccafusco, and Burns’s work therefore suggests that vesting an affirmative attribution right in creators could serve as a significant impediment on the licensing market and further complicate and stifle the ability of would-be licensees to reach deals for the use of creative content—a cost that impacts consumers of copyrighted works as well as the vast number of authors who draw upon preexisting content to create transformative works.²²³ As a result, they conclude that an affirmative attribution right would ultimately not serve the public weal and could have a disruptive effect on commerce.

But, perhaps most damningly, the biggest drawback against an independent claim for attribution under the Copyright Act is not whether it would make for good law but, rather, whether it would be feasible to pass such legislation in the first place. To illustrate this point, it is worth considering a few salient points about the history of copyright law in our country. It took almost 100 years for the United States to accede to the terms of the Berne Convention of 1886, which, since 1928 and per Article 6bis, requires member states to recognize a right of attribution.²²⁴ When the United States finally acceded to Berne in 1988, the House Report on its implementation concluded that a patchwork of existing laws in the United States already provided sufficient protection for attribution to meet Berne’s minimum standards.²²⁵ The availability of Lanham Act relief for reverse passing off in situations of misattribution was key to this conclusion.²²⁶ Nevertheless, Congress passed a narrow right of attribution under VARA shortly thereafter in 1990 which, as we have discussed, does not cover the vast majority of creative works and provides only scant protection. Furthermore, since Dastar, there has been no meaningful effort to undo its holding in Congress, making the path toward a legislative fix unlikely, at best. As this timeline illustrates, the odds of congressional intervention to add a broad attribution right to the Copyright Act—particularly given how constrained the attribution claim embedded in VARA ultimately became when it was finally passed in 1990—do not seem particularly good.

D. A Modest Proposal: Locating Attributive Use in Section 107

With this analysis in mind, we turn our attention to a modest proposal that I believe would not require legislation and, in fact, already reflects the jurisprudence on fair use: the recognition by courts of attributive use as an express subfactor in the application of the fair use defense to allegations of copyright infringement. This proposal advances the cause of attribution rights in an incremental, but significant, manner; provides flexibility for courts to adapt the concept to contexts and emerging technologies; and bolsters norms of crediting in a way that can lay the framework for future (and bolder) changes in the law.

Moreover, the proposal builds on the important work done by Pierre Leval with his article Toward a Fair Use Standard some three decades ago. Just as Leval argued that transformative use was already, and had good reason to be, playing an important role in fair use determinations, I argue the same with attributive use. In that spirit, as the title of this Article suggests, I advocate a move toward a new fair use standard. As our exegesis of the extant jurisprudence on fair use reveals, attributive use already has an implicit place in the fair use calculus. I argue that courts should lean into this reality and make attribution an explicit consideration in their factor one analysis on the purpose and character of the use. Just like transformative use, which advances the utilitarian aim of the copyright regime to promote progress (by enabling the creation of new work), attributive use serves a key role in the copyright regime by helping advance progress in the arts (by appealing to the incentivizing function of crediting). So, under this scheme, as part of their factor one analysis, future courts would consider: (1) whether a use is commercial; (2) whether a use is transformative; and (3) whether a use is attributive. In short, attributive use would take its place with commercial and transformative use as key factors in determining the purpose and character of a defendant’s unauthorized exploitation of someone’s copyrighted work.

Admittedly, leaving attribution rights to only function as an affirmative defense to infringement still leaves crediting as a tail, wagged by the infringement dog. But this solution avoids the numerous complications posed by either an affirmative attribution right in the Copyright Act or an undoing of the Dastar holding. Under such a proposal, works used with permission can continue to have exploitation governed by licensing terms that can call for proper attribution as appropriate and meaningful, thereby leaving existing crediting regimes in place and enabling further development of new ones. But for unlicensed works, an attributive use subfactor will provide significant encouragement of crediting while not requiring it in every instance and leaving some flexibility around the issue, so that courts can consider the context of a particular use to decide whether attribution is valuable, meaningful, or practicable under the circumstances.²²⁷ As a result, crediting will not become an absolute requirement, thereby addressing the significant concerns that would come from a broad attribution right. Meanwhile, for public domain works, there will be no concern about attribution because such works would not be subject to a fair use defense since their exploitation is, per se, noninfringing. As a result, the proposal averts rightful concern about erstwhile copyright holders using crediting requirements to achieve perpetual protection for works that fall into the public domain.

Moreover, to avoid making attribution overly onerous, the crediting at issue could be limited to legal authorship. As even Ginsburg admits, attribution requirements can be burdensome, potentially causing a problem that Ginsburg characterizes as the “most practical of all”: a regime that mandates “tiny print or endless film credits that no one will look at anyway.”²²⁸ To Ginsburg, criticisms about the potential burdens of crediting requirements are exaggerated. As she opines,

[D]ifficulties in determining whether a contributor at the fringes of a creative enterprise should be denominated an “author” or “co-author” should not obscure attribution claims where authorship is apparent. Moreover, where the creators are multiple, business practice may assist in identifying those entitled to authorship credit. That the resulting credits may not attract most readers’ or viewers’ attention does not warrant forgoing them altogether.²²⁹

But there may also be a simpler refutation to these objections. Specifically, as Ginsburg herself admits, “Our caselaw has enough trouble, in the joint works context, identifying who is an author.”²³⁰ This is certainly true but it is also worth noting that, as a result of this difficulty, courts have shown themselves extraordinarily loathe to recognize joint authorship. Indeed, numerous doctrines, such as the strict reading of the mutual intent requirement, have emerged from courts to avert recognition of joint authorship.²³¹ So, on a practical level, the problem of endless attribution seems quite solvable by considering crediting not of all creative contributors, but of the legal authors—a designation that courts have gone out of their way to make singular and, consequently, quite knowable (despite the many flaws in the way courts define legal authorship). In other words, given that courts already carefully circumscribe the notion of legal authorship in order to avoid the messiness of joint authorship and the accompanying headache it may cause in the fracturing of rights, attribution rights that are limited to recognition of legalauthorship are not quite as complex as objectors may suggest.

All told, this solution draws and expands upon, with some important alterations, a proposal once presented briefly by the late Greg Lastowka at the end of his article considering the (morbid) state of attribution rights post-Dastar.²³² After bemoaning the extant law’s lack of protection for crediting, Lastowka proposed a corrective step: congressional amendment of section 107 to incorporate attribution as an explicit fifth factor in the fair use analysis.²³³ I tweak Lastowka’s proposal for two reasons. First, the addition of an express fifth factor would require legislative amendment, making change less likely (as I have documented with the difficulty in passing any affirmative attribution right in the Copyright Act). Indeed, as the influence of Leval’s 1990 article has suggested, change through the common law is both swifter and more likely. Leval, of course, achieved a dramatic change in the way courts have approached the fair use analysis in the past three decades by emphasizing the importance of a factor that had received scant explicit consideration before: transformation. Secondly, analytically speaking, I argue that attribution already resides in the existing four factors without the need to add a fifth. Most significantly, as I shall detail, courts have both explicitly and implicitly considered attribution in the fair use calculus in the past, often as part of assessing the purpose and character of the use (factor one). Building on the occasional, but unpredictable, judicial solicitude to attribution as a part of the fair use balancing test, I argue that, normatively, such a move makes a great deal of sense.

1. Attributive Use and the Existing Fair Use Calculus

As Lastowka argued, courts have occasionally drawn on attribution as a factor in the fair use calculus. But, as he cautioned,

[w]hat these cases demonstrate is not that attribution is regularly considered by courts as a factor in the fair use analysis. This is most certainly not the case. The cases merely illustrate that in certain cases, plaintiffs and defendants have been successful in persuading courts to incorporate evidence about attribution into a fair use analysis.²³⁴

Lastowka may have understated matters, however. Indeed, a careful exegesis of the relevant jurisprudence—including noted decisions from the two circuits (the Second and the Ninth) that most prominently opine on copyright law, as well as consideration of the broad attributive practices in clearance norms—strongly suggests that attribution is already a guiding factor in the fair use calculus and, either explicitly or implicitly, is playing a (rightful) role in fair use determinations. As such, the proposal advanced here calls for overt recognition of attribution as a key subfactor in how courts weigh the purpose and character of a use.

The fair use doctrine finds its origins in Justice Joseph Story’s influential 1841 opinion in Folsom v. Marsh.²³⁵ Eventually codified in section 107 of the 1976 Copyright Act, fair use typically involves the weighing of a four-part balancing test to determine whether an unauthorized use of a copyrighted work is excused from infringement liability. These factors include:

(1) the purpose and character of the use, including whether such use is of a commercial nature . . . ; (2) the nature of the copyrighted work; (3) the amount and substantiality of the portion used in relation to the copyrighted work as a whole; and (4) the effect of the use upon the potential market for or value of the copyrighted work.²³⁶

However, with its use of open-ended language, the text of section 107 suggests that the four listed factors are not exhaustive of the considerations a court may undertake.²³⁷ As a result, courts have always had the freedom to introduce other relevant factors to their fair use analysis. Indeed, some have accepted the invitation, including many that have made attribution and crediting practices a consideration.

2. The Role of Attribution in the First, Fourth, and “Fifth” Fair Use Factors

In Haberman v. Hustler Magazine, Inc., for example, a Massachusetts district court drew on “equitable considerations” as a fifth factor and found that the defendant’s attribution practices supported a fair use defense against infringement claims for unauthorized reproduction of two fine art photographs in a magazine.²³⁸ Specifically, the defendant’s fair use claim was substantially aided by the fact that it made “no effort . . . to palm [the photos] off as anything other than [the photographer’s] creations.”²³⁹ Thus, in some cases, attribution can and has become a part of the fair use calculus through an unofficial “fifth factor.”

That said, courts do not necessarily have to resort to the introduction of a fifth factor to make room for crediting. In fact, numerous decisions have integrated attributive use into their analysis of the existing four factors.²⁴⁰ This body of case law suggests that attribution already has a place (and voice) in the existing four fair use factors—particularly the first (“purchase and character of the use”) and fourth (“market harm”).

Melville and David Nimmer, for example, have argued that attribution can and should be a proper consideration in the first factor,²⁴¹ as it speaks to nature and character of the use being made by a defendant. Numerous courts have subscribed to this view, whose application is illustrated in Williamson v. Pearson Education, Inc.²⁴² In the suit, Pearson Education offered up a fair use defense to infringement claims stemming from its publication of a book featuring unauthorized quotation of a number of passages from a prior work on General Patton’s leadership principles. Drawing on Harper & Row, Publishers, Inc. v. Nation Enterprises’s instruction to consider the “propriety of the defendant’s conduct” as part of the nature and character of a defendant’s use of a copyrighted work, the Court found that the first fair use factor favored defendants because, among other things, they were “not attempting to pass [the] fact-gathering off as their own. Rather they are crediting [the plaintiff] as the source of the factual information that defendants use to construct some of the arguments in their book.”²⁴³ Similarly, in Rubin v. Brooks/Cole Publishing Co., a court identified the propriety of the defendant’s conduct as one of three subfactors under the “purpose and character of the use” consideration and found this factor favored the defendant since the defendant had “credited [the plaintiff] clearly and favorably in the text.”²⁴⁴ In another infringement case—one involving unauthorized use of a portion of a report about a hydropower facility—a federal court deemed that the defendants’ acknowledgement of the source of the original work helped the first fair use factor “weigh[] heavily in favor of a finding of fair use.”²⁴⁵

It is not just the first factor that makes room for attribution. While “good faith” is the most common doctrinal vehicle through which crediting finds a voice in the first factor, economic factors form the doctrinal vehicle through which crediting finds a voice in the fourth factor. Meanwhile, the market harm factor also leaves room for consideration of attribution. Specifically, as Rebecca Tushnet has argued in the context of fan fiction, giving attribution attenuates the possibility of market harm.²⁴⁶ As she reasons, “Correct attribution helps prevent confusion and preserves the market for the official product and bears an indirect relation to the fourth fair use factor.”²⁴⁷ Tushnet’s view is not merely aspirational; it is also already reflected in some cases. In Richard Feiner & Co. v. H.R. Industries, Inc., for instance, a New York federal district court declined to grant a fair use defense to The Hollywood Reporter/HRI for its unauthorized use of a photograph of Laurel and Hardy in a feature spread on special effects and stunts.²⁴⁸ The failure to attribute the photograph to its author played an important role in the court’s calculus. “HRI’s use of the photograph without attribution to Feiner represents to the world that the photograph is in public domain,” the court concluded, “thus potentially impairing Feiner’s future revenue both in income and in the costs of protecting its rights.”²⁴⁹ Based on this logic, the court found significant market harm in HRI’s actions and found the fourth fair use factor militated against the defendant. Indeed, the Feiner case stands in contrast to Nuñez v. Caribbean International News Corp., in which a different member of the media—a Puerto Rican newspaper named El Vocero—also published an article making unauthorized use of a photograph—an image from the modeling portfolio of the former Miss Puerto Rico Universe 1997.²⁵⁰ In Nuñez, crediting played a role in the fair use analysis under both factors one and four. On the first factor, the court considered good faith, which favored El Vocero since it had “attributed the photographs to Núñez.”²⁵¹ On the fourth factor, the court pointed to the fact that “the only discernible effect of the publication in El Vocero was to increase demand for the photograph”²⁵²—a consequence doubtlessly buttressed by the credit that El Vocero provided to Nuñez, which enabled future licensees to know whom to approach for permissions to use the image.

3. Harper & Row’s Good Faith Admonition

With all of this said, it is important to acknowledge that the consideration of “good faith,” which courts have often used to raise attributive concerns, has come under fire in recent years. On the surface, this might suggest increasing judicial resistance to the factoring of crediting in the fair use calculus. But a closer examination of this trend says otherwise.

The clearest expression of the invitation to consider good faith in fair use determinations (and the basis upon which some courts have invoked attribution) came in Harper & Row, in which the Supreme Court dictated, in seemingly absolutist terms, that “fair use presupposes ‘good faith.’ ”²⁵³ Despite the blusterous verbiage, the exhortation remained inchoate, as the case gave little guidance as to just constituted “good faith.” In that particular instance, the Court was referring to how The Nation’s knowing exploitation of a purloined manuscript that remained unpublished demonstrated bad faith because the magazine had usurped the copyright holder’s valuable commercial rights of first publication. The Court then grafted this assessment of propriety to its consideration of the first, second and fourth fair use factors in finding that The Nation’s actions had no refuge in the defense.²⁵⁴ Notably, the matter of attribution (as a form of good faith or otherwise) had nothing whatsoever to do with that case.

In the intervening years, the Supreme Court has walked back on Harper & Row’s language about good faith. In both of its two most recent fair use pronouncements—Campbell in 1994 and Oracle in 2021—the Court has expressly downplayed the consideration. In Campbell, the Supreme Court acknowledged a split in persuasive authorities as to whether fair use must necessarily presuppose good faith.²⁵⁵ In Oracle’s dicta, the Court cited to Leval’s work to express its skepticism as to whether good faith should ever be a factor in fair use determinations.²⁵⁶Ultimately, however, the Oracle Court eschewed taking a definitive position on the issue, leaving the weight (if any) given to good faith squarely to lower courts to determine. As the Court mused, “We have no occasion here to say whether good faith is as a general matter a helpful inquiry.”²⁵⁷ Nevertheless, it would not be stretch for lower courts to view the commentary in Campbell and Oracle as dampening enthusiasm for further use of a general good faith consideration in the fair use calculus.

The evolving concern about consideration of “good faith” as dictated in Harper & Row is compelling—at least in the manner in which the Harper & Row, Campbell, and Oracle Courts used the term, where they considered if an alleged infringer had engaged in related wrongdoing, such as exploiting a purloined manuscript or proceeding with making use of a work despite being denied permission.²⁵⁸ After all, if someone asks for, and is denied, permission but proceeds with a use anyway, it is worth considering whether that demonstrates good faith or bad faith, or does not necessarily suggest anything at all. As Elina Lae points out, courts have gone both ways on this issue²⁵⁹—a fact that speaks to the unworkability of the factor.²⁶⁰ Or if a defendant genuinely believes that their actions constitute fair use and does not ask for permission, it is hard to understand why that factor should be held against them in the very determination of whether something is fair use.²⁶¹ Indeed, while conceding the potential the utility of a good faith consideration in the fair use calculus,²⁶² the Ninth Circuit long ago pointed out that “to consider [a defendant] blameworthy because he asked permission [and went ahead with the use after not receiving it] would penalize him for this modest show of consideration. Even though such gestures are predictably futile, we refuse to discourage them.”²⁶³

More pointedly, reading the Copyright Act holistically, the introduction of considerations of propriety into the fair use calculus would appear unbalanced, particularly when one acknowledges that courts do not typically weigh such factors in determining putative rightsholders’ entitlement to copyright protection. As Leval has forcefully argued,

Copyright seeks to maximize the creation and publication of socially useful material. Copyright is not a privilege reserved for the well-behaved. Copyright protection is not withheld from authors who lie, cheat, or steal to obtain their information. If they have stolen information, they may be prosecuted or sued civilly, but this has no bearing on the applicability of the copyright. Copyright is not a reward for goodness but a protection for the profits of activity that is useful to the public education. The same considerations govern fair use.²⁶⁴

In the end, therefore, when viewing fair use as an integral part of the utilitarian goal of our copyright regime, the ultimate focus on progress in the arts would appear to preclude consideration of the subjective mental state of a defendant or a defendant’s general intentions.

But, quite critically for our purposes, the discussion about good faith in Harper & Row, Campbell, and Oracle has never once touched on the issue of attribution. And although both Campbell and Oracle cited to Leval’s admonition to decouple fair use from good behavior,²⁶⁵ his entreaty also had nothing to with attributive practices. Indeed, as Leval reasons, fair use should be focused on progress in the arts. As we have detailed, that is precisely the reason why attribution should be considered, while other good faith factors, such as a defendant’s state of mind or general intentions, should not. Thus, despite the concern about good faith expressed in Campbell and Oracle and by Leval, these contemplations do not speak to the issue of crediting. And, if anything, the case law’s continued emphasis on promoting a vision of fair use dominated by the utilitarian goals of the copyright regime favors the consideration of attribution (though perhaps not general good faith) in the fair use calculus.

4. The Existing Role of Attribution in Second and Ninth Circuit Jurisprudence

All the while, attribution has and continues to play a role in the existing body of fair use jurisprudence in the influential Second and Ninth Circuits—from which the plurality, if not majority, of copyright caselaw stems.²⁶⁶ While these decisions that explicitly invoke attribution do not take the next step of formally mandating a role for crediting in the enumerated factors, when read together they suggest that such a role—much like the transformative use element identified by Leval in Toward a Fair Use Standard—would be consistent with the existing fair use rubric. Indeed, if district courts in these circuits hewed more closely to this extant jurisprudence, attribution would enjoy more overt recognition as a fair use factor.

The precedent of the Second Circuit is replete with cases where attribution has played a key, if not decisive, role in the fair use calculus. Weissmann v. Freeman²⁶⁷ is a particularly instructive case because it stemmed from an academic dispute where credits and notoriety, rather than significant profits, were at stake.²⁶⁸ In the suit, a professor at the Albert Einstein School of Medicine, Heidi S. Weissmann, took legal action against her colleague and erstwhile collaborator, Leonard Freeman, for copyright infringement, claiming that Freeman had unlawfully reproduced fifty copies of one of her articles for use in a special review course he was giving at the Mount Sinai School of Medicine. Ordinarily, such a relatively de minimis and educational use of a work would not give rise to infringement litigation. But one key fact made this case atypical: Freeman had deleted Weissmann’s name from the paper and, after adding three additional words to the title, put his own name on it.²⁶⁹ Although Weissmann might have pursued a claim for reverse passing off (albeit with some potential hurdles²⁷⁰), she focused on a copyright infringement claim.²⁷¹ Though the lower court had held Freeman’s use was a fair one, the Second Circuit reversed.

Despite claims that Freeman’s use was noncommercial and for academic purposes, his failure to credit Weissmann (and his substitution of his own name as the author) proved outcome determinative in the Second Circuit, and the court cited the crediting issue in its analysis on the first, second, and fourth fair use factors. On the first factor, the court determined that, in wresting credit from Weissmann, Freeman’s motivations were commercial in nature as he profited from the use, which enhanced his professional reputation. As the court noted, “Particularly in an academic setting, profit is ill-measured in dollars. Instead, what is valuable is recognition because it so often influences professional advancement and academic tenure.”²⁷² On the second factor, the court noted the importance of weighing the incentives for continued creation of scholarly work and thereby linked attribution and progress in the arts. As it reasoned, crediting provided the likes of Weissmann “with an incentive to continue research—an endeavor that, if successful, would and has led to professional and monetary benefits [and that courts] need to uphold those incentives necessary to the creation of works such as [the article].”²⁷³ Finally, on the fourth factor, the court saw clear market harm because “[i]n scholarly circles such as, for example, the small community of nuclear medicine specialists involved in this suit, recognition of one’s scientific achievements is a vital part of one’s professional life.”²⁷⁴

The fact that Dr. Freeman’s planned use of [the article] was for the same intrinsic purpose as that intended by Dr. Weissmann not only undermines Dr. Weissmann’s ability to enjoy the fruits of her labor, but also creates a distinct disincentive for her to continue to research and publish in the field of nuclear medicine.

Id. In short, the issue of crediting permeated the entire fair use analysis in Weissmann. In the end, the court also emphasized that “[n]o case was cited—and [they] found none—that sustained [a fair use] defense under circumstances where copying involved total deletion of the original author’s name and substitution of the copier’s.”²⁷⁵ This observation alone suggests the implicit role of attribution (and the disfavor with which misattribution is viewed) in the extant fair use jurisprudence.

Rogers v. Koons,²⁷⁶ one of the most well-known copyright decisions from the Second Circuit, also emphasizes the role of attribution in fair use determinations. The Rogers controversy started when prominent modern artist Jeff Koons found inspiration in a cheap tourist postcard featuring Art Rogers’s Puppies, a photograph of a couple and their dogs posing in Rockwellian tranquility.²⁷⁷ Koons appropriated the depiction without Rogers’s permission and accentuated various elements to create a sculptural work, String of Puppies, that satirized suburban American aesthetic sensibilities.²⁷⁸

saw sentimentality, inanity and kitsch. When he blew up the image to larger than life size, stuck daisies in the hair of the sickly sweet smiling couple (the flowers were not in the photograph) and painted the finished ceramic, the sculpture acquired a horrific quality quite distinct from the original.

Martin Garbus, Book Review, Lolita and the Lawyers, N.Y. Times, Sept. 26, 1999 (§ 7), at 35. Rogers, unamused by Koons actions, sued for copyright infringement and Koons claimed fair use. The district court rejected Koons’s defense, holding that, among other things, his activities were not sufficiently transformative because they did not criticize or comment upon Rogers’s original photograph.²⁷⁹ On appeal, the Second Circuit affirmed, noting that, “though the satire need not be only of the copied work and may . . . also be a parody of modern society, the copied work must be, at least in part, an object of the parody, otherwise there would be no need to conjure up the original work.”²⁸⁰ Because Koons purportedly did not “need” the original work to make his expressive point, there could be no fair use.²⁸¹ The result of the case was a debacle for Koons. In addition to damages, he was ordered to pay the plaintiff’s attorneys’ fees.²⁸²

To critics of the decision—which came down before Campbell, the trumpeting of Leval’s Toward a Fair Use Standard, and its advocacy for the primacy of an expansive notion of transformative use²⁸³—the court gave short shrift to the transformative use that Koons had arguably made of Rogers’s original work.²⁸⁴ Indeed, one could argue that the Second Circuit implicitly reversed itself in its next case involving Jeff Koons and appropriationist art, in which it reached an exact opposite conclusion.²⁸⁵

That said, the absence of proper crediting served as a critical factor in justifying the court’s rejection of Koons’s fair use defense. Specifically, in considering the first factor—the purpose and character of the use—the court drew on notions of attribution in two separate ways to weigh against a finding of fair use. First, employing the “good faith” criteria, the court pointed to Koons’s decision to remove a copyright notice from one of Rogers’s notecards that he sent to Italian artisans when he commissioned fabrication of his statue based on the Rogers photograph. As the court concluded, this action “suggests bad faith in defendant’s use of plaintiff’s work, and militates against a finding of fair use.”²⁸⁶ Second, the court concluded that the failure to credit supported its finding that Koons’s work was not commenting sufficiently on the original work to qualify for the heightened protection usually given to parody. As the court argued, Koons’s use failed to make the public aware of the original work and the resulting lack of proper crediting undermined a key reason why certain types of commentary—attributive ones—get fair use protection. “If an infringement of copyrightable expression could be justified as fair use solely on the basis of the infringer’s claim to a higher or different artistic use—without insuring public awareness of the original work—there would be no practicable boundary to the fair use defense,” noted the court.²⁸⁷

Koons’ claim that his infringement of Rogers’ work is fair use solely because he is acting within an artistic tradition of commenting upon the commonplace thus cannot be accepted. The rule’s function is to insure that credit is given where credit is due. By requiring that the copied work be an object of the parody, we merely insist that the audience be aware that underlying the parody there is an original and separate expression, attributable to a different artist. This awareness may come from the fact that the copied work is publicly known or because its existence is in some manner acknowledged by the parodist in connection with the parody.²⁸⁸

Courts within the Second Circuit continue to pick up this language from Rogers to emphasize the importance of crediting in the fair use calculus. For example, in a recent published decision, the Southern District of New York drew upon Rogers’s bad faith analysis to find this subfactor within the “purpose and character of use” weighed in favor of the plaintiff when the defendant removed the copyright mark, thereby robbing the use of attribution.²⁸⁹ All the while, at least one Second Circuit decision, NXIVM Corp. v. Ross Institute, has mandated that good faith be considered by any court conducting a fair use analysis.²⁹⁰

The Ninth Circuit has also cited the failure to provide authorial attribution as an important factor weighing against fair use. In Marcus v. Rowley, the court hewed to Harper & Row’s good faith edict in making attribution a part of the first fair use factor.²⁹¹ Thus, the defendant’s failure to provide credit weighed against a finding of fair use.²⁹² As in Weissmann, the complete absence of any pecuniary harm and the use of the work in an academic setting did not trump the failure to credit.²⁹³ Meanwhile, in Narell v. Freeman, the author of a best-selling novel, Cynthia Freeman, was caught having lifted, word-for-word, portions of her work from a previously published book by Irena Narell about the history of Jewish migration to San Francisco.²⁹⁴ Although the Ninth Circuit ultimately found that the misappropriated sections were largely factual in nature and not protectible expression, it nevertheless addressed the issue of fair use. And while it ultimately found Freeman’s actions protected by the fair use doctrine, it held that the first factor “weigh[ed] heavily against Freeman because she did not acknowledge in her work that she had consulted Our City in writing Illusions.”²⁹⁵

The Narell court cautioned that acknowledgment would not, by itself, excuse infringement.²⁹⁶ And that has certainly proven true in other cases. For example, when a luxury fashion designer pushed back against claims that it had infringed a photograph of a model wearing its clothing, it advanced its fair use defense by pointing out that it had credited the photographer. The court demurred, noting that

Defendant has not pointed to any precedent supporting its theories that because Defendant credited the photographer in the caption of the Photograph or because Plaintiff hired the model to wear Defendant’s clothing, Defendant has a right to use the Photograph. Simply put, attribution is not a defense against copyright infringement.²⁹⁷

Even there, however, the court reaffirmed the value of attribution in the fair use calculus by citing to Narell favorably and noting that, while “acknowledgement does not itself excuse infringement,” a “finding [of] failure to properly attribute copyrighted material weighs against fair use.”²⁹⁸

5. The Implicit Role of Attribution in Transpurposive Use Cases

Beyond the case law from the Second and Ninth Circuits explicitly embracing attribution as a factor, there are instances where crediting has played an implicit role in fair use determinations. This is particularly true in a body of recent cases involving technology-related transpurposive uses excused by courts under the defense. So, for example, in Kelly v. Arriba Soft Corp., the Ninth Circuit blessed the unauthorized creation of thumbnail versions of online images to facilitate search engine functionality.²⁹⁹ This judgment was famously reaffirmed in Perfect 10, Inc. v. Amazon.com, Inc., when the Ninth Circuit found that Google’s creation of thumbnails for its search engine was also protected.³⁰⁰

The Kelly suit resulted from Arriba’s practice of crawling websites and indexing them by creating thumbnail versions of images that it found on those sites.³⁰¹ Arriba would then provide these thumbnails to its users, as relevant, in results on its search engine.³⁰² Photographer Leslie Kelly, a chronicler of the American West in her images, objected and sued for infringement. The Ninth Circuit, however, held that Arriba’s activities constituted fair use and the attributive nature of Arriba’s use helped propel both the first and fourth factors.

On the first factor, the court placed great weight on the fact that the use of the photographs was both transformative and attributive. Unlike the original images, which were intended to inform and provide visual enjoyment, Arriba’s thumbnails served no aesthetic purpose and, instead, functioned solely as “a tool to help index and improve access to images on the internet and their related web sites.”³⁰³ Here, the attributive characteristics of Arriba’s thumbnails become important, as they were instrumental to the transformative functionality they embodied. As the court noted, clicking on thumbnails would produce an “Images Attributes” page that would provide a link back to the original page, which was the photographer’s home page, where sourcing information and attribution would naturally follow.³⁰⁴ If the thumbnails did not produce sourcing and attribution, they would not serve a true indexing purpose and, consequently, they would not promote the locational function of search engines.³⁰⁵

As for the fourth fair use factor, the court found no market harm precisely because of the link-back attribution feature: “By showing the thumbnails on its results page when users entered terms related to Kelly’s images, the search engine would guide users to Kelly’s web site rather than away from it.”³⁰⁶ Steering users to the home webpage for the original source of the photographs would then provide them with crediting and attribution information and, if they were interested, the ability to license. Although the Kelly court never invoked the language of crediting, attribution played an implicit role in permitting its creation and use of thumbnails under the fair use doctrine.³⁰⁷ If the thumbnails were created without an ability to link back to the original source (where attribution would be given), they would be neither “transpurposive”³⁰⁸ nor without market harm.

Similarly, when evaluating whether Google’s creation of cached copies of websites and the images of them for its search engine would be excused from infringement liability, the court in Field v. Google Inc., drew on prior Ninth Circuit authority³⁰⁹ and the open-ended language of the Copyright Act³¹⁰ to find authority to weigh as a fifth factor in the fair use calculus “a comparison of the equities.”³¹¹ Notably, the equities and good faith that mattered to the court revolved around Google’s decision to provide a link back to cached websites—a key mechanism for digital attribution.

6. Attribution and Copyright Clearance Norms

Finally, the copyright norms we hold most dear as a society also support attribution as a mitigating factor in practices that might otherwise constitute infringement. For example, although the practice of quotation literally reproduces and distributes copyrighted material without permission, notwithstanding the antics of the James Joyce Estate,³¹² it is universally viewed as per se fair use. Indeed, the decision that gave rise to the fair use defense in the first place, Folsom v. Marsh, announced the obviousness of this proposition when Justice Story wrote that “no one can doubt that a reviewer may fairly cite largely from the original work, if his design be really and truly to use the passages for the purposes of fair and reasonable criticism.”³¹³Quotation inherently involves attribution. The use of inverted commas around a sentence directly indicates that the sentence’s origins lie with a third party and not with the author of the text that one is reading. Critically, those inverted commas are then accompanied by some form of attribution. Admittedly, the attribution is part of the transformative function of quoting. After all, criticism or commentary about someone else’s work cannot occur unless that other work is identified. But the equitable nature of a quotation—giving attribution—distinguishes it from the act of taking someone else’s words without permission in the exact same way but without attribution—an act that can amount to infringement. Or, to put it in the form of a Zen-like koan:

Q: What is a quotation without quotation marks?

A: An infringement.

When using portions of someone else’s work, a key differentiator between infringement and fair use is the act of attribution.

Common beliefs about copyright law reflect our mores (even when they do not entirely match up with the law), and these popular notions have also long viewed attribution as a factor in what actions should or should not constitute infringement. As any copyright practitioner knows, popular misconceptions about fair use abound.³¹⁴ Members of the public will often cite a particular bright-line threshold of use (no more than five percent of a work, for example) as providing insulation from infringement liability; they will claim that, so long as one does not profit from unauthorized exploitation of a copyrighted work, it constitutes fair use; or they will contend that any educational use automatically constitutes fair use. Of course, each of these absolute statements is wrong. But the beliefs are all based on a kernel of truth: the less one uses of a work, the less one profits from an unauthorized exploitation, and the more academic a use, the more likely it is to constitute fair use. The same dynamic is at play with another popular myth: that giving proper credit can insulate you from infringement liability for the unauthorized exploitation of someone else’s work.³¹⁵ While this notion is obviously incorrect, it is not entirely without basis. Although it may be infringing just the same, a credited use is indubitably more fair than an uncredited use, as the former better complies with societal norms against plagiarism and advances authorial interests in attribution. As such, the former is more likely (even if marginally so) than the latter to enjoy a viable fair use defense.

For example, consider a recent controversy at the intersection of intellectual property and cultural appropriation. In 2021, Addison Rae Easterling, a popular social media influencer, appeared on The Tonight Show to “teach” Jimmy Fallon eight of the most popular dances trending on TikTok and other social media sites. Although the dances can enjoy copyright protection as choreographic works,³¹⁶ the routines were not licensed from their creators, and, to make matters worse, Easterling and Fallon originally failed to give any credit at all to the originators of the dances. As Kamilah Moore, an entertainment attorney and the chairperson of the California Assembly’s Reparations Task Force, has noted, this misstep was particularly pernicious because of its racial valence: it “received major backlash” because white TikTok superstars “have often gained notoriety and received millions of views by parroting dance routines primarily created by Black creators and other creators of color” while failing to provide “proper attribution in the marketplace.”³¹⁷ Easterling and Fallon’s unauthorized recreation of the dances could well constitute fair use, but there is a vast difference between an unlicensed use that is also uncredited versus one that is at least attributive. Recognizing this and responding to public pressure, Fallon subsequently attempted to make amends by hosting the dance creators in a later broadcast of his show.³¹⁸ Given the importance of correcting for persistent gaps in crediting that troublingly fall along racial, gender, and socioeconomic lines as well acknowledging the economic value of proper attribution,³¹⁹ a fair use regime that incorporates attribution as a factor would incentivize users of intellectual property to at least recognize the cultural influencers from whom they draw.

As this survey of the fair use landscape and extant jurisprudence has shown, acknowledged or not, crediting is already an implicit fair use factor. The only remaining question is to what degree it is or should be. That query provides rife fodder for future scholarship, as the weight accorded to attributive use, just like the other fair use factors in section 107, would be something left firmly to the discretion of judges who can consider the particular context in an adaptive and holistic manner.

CONCLUSION

The Supreme Court decided Dastar almost twenty years ago. Since that time, the sky has admittedly not fallen.³²⁰ As such, it would be hyperbole to suggest there is some kind of pressing emergency to address the absence of attribution rights. Nevertheless, as we have detailed, the lack of substantive legal protection for crediting in the post-Dastar era has impoverished the responsiveness of our intellectual property regime to the interests and motivations of authors. With this in mind, change through legislation would face difficult odds. After all, it took almost a century for the United States to accede to the Berne Convention and, when it did, that only sparked the most minimal of efforts to comply with Berne’s attribution requirements through the relatively limited scope of VARA. The most significant legislative efforts on attribution in the past generation came through the falsification, removal, and alteration of CMI provisions passed as part of the DMCA. But even there, facilitation of infringement, rather than vindication of crediting interests, drove the statute—a fact made clearly by the claims’ onerous double-scienter requirements. As such, it would be disingenuous to fail to recognize that the prospects for legislative action for broader crediting protection are dim, at best. Moreover, there are good reasons to doubt whether the benefits of providing affirmative attribution rights either by overruling Dastar or by providing an independent cause of action for crediting under the Copyright Act would outweigh the risks. Instead, change can come in another, more incremental fashion—one that averts key dangers of an affirmative cause of action for crediting, has grounding in the existing jurisprudence, and could occur without legislation action.

A generation ago, Pierre Leval’s Toward a Fair Use Standard forced a first major change in fair use jurisprudence. I argue for a second. Like the implementation of transformative use, attributive use fits comfortably into the existing jurisprudence on fair use, speaks to the purpose and character of the use and even market harm factors, and ultimately supports the underlying purpose of the overall copyright regime: promotion of progress in the arts. Since fair use only represents a defense to infringement, such an admittedly restrained approach still tethers attribution to infringement and, as such, it does not represent a complete victory for the independent value of crediting. Nevertheless, as the success of Leval’s appeal to transformative use shows, such an entreaty for change can have an impact. Moreover, if implemented, recognition of attributive use would promote further development of a culture of crediting, a particularly important move at the dawn of the digital age, when all individuals make daily (infringing) use of copyrighted works thousands of times per day.³²¹ Ultimately, therefore, the attributive use doctrine could not only enable fair use to operate in a way that is more consistent with the utilitarian design of our copyright regime, but it can also help build support for further recognition of attribution rights in the future.

96 S. Cal. L. Rev. 1

Download

John Tehranian*

* A.B. Harvard, J.D. Yale Law School. Paul W. Wildman Chair and Professor of Law, Southwestern Law School; Visiting Professor of Law, University of California, Los Angeles (“UCLA”) School of Law.

The Invention of Antitrust

April 2023April 2023Herbert Hovenkamp*

The long Progressive Era, from 1900 to 1930, was the Golden Age of antitrust theory, if not of enforcement. During that period courts and Progressive scholars developed nearly all of the tools that we use to this day to assess anticompetitive practices under the federal antitrust laws. In a very real sense, we can say that this group of people invented antitrust law.

The principal contributions the Progressives made to antitrust policy were (1) partial equilibrium analysis, which became the basis for concerns about economic concentration, the distinction between short- and long-run analysis, and later provided the foundation for the development of the antitrust “relevant market”; (2) the classification of costs into fixed and variable, with the emergent belief that industries with high fixed costs were more problematic; (3) the development of the concept of entry barriers, contrary to a long classical tradition of assuming that entry is easy and quick; (4) the distinction between horizontal and vertical relationships and the emergence of vertical integration as a competition problem; and (5) price discrimination as a practice that could sometimes have competitive consequences. Finally, at the end of this period came (6) theories of imperfect competition, including the rediscovery of oligopoly theory and the rise of product differentiation as relevant to antitrust policy making.

Subsequent to 1930, antitrust policy veered sharply to the left. Then, two decades later it turned just as sharply to the right. Eventually it moderated, reaching a point that is not all that far away from the Progressives’ original vision.

INTRODUCTION

The long American Progressive Era to the New Deal, roughly 1900 into the early 1930s, was the formative age of antitrust policy.¹ During this period a diverse group of policy makers developed nearly all of the analytic tools that antitrust law uses today to evaluate business practices or market structures thought to be anticompetitive. For all intents and purposes, they invented antitrust law. In fact, after decades of experimentation we are reclaiming much of it. The extraordinary Progressive influence on antitrust policy was at least partly a historical coincidence. The passage of the Sherman and Clayton Acts and the development of techniques for evaluating practices tracked extraordinary developments in technology as well as social and economic thought. Antitrust policy would have looked very different had it developed a half century earlier.

The Progressive Era antitrust movement was both political and economic. It reflected the emergence of new interest groups as well as new sources of economic concern and theoretical developments. The emergent interest groups were large multistate business, the trade association movement dominated by small business,² consumers, and labor. The new sources of concern were industrialization, the rise of modern distribution, the labor movement, and the increasing importance of consumers as market participants. The new theoretical developments were the rise of marginalist economics and industrial organization theory, which provided competition analysts with a set of tools like none they had before.

The legislative debate leading up to the Sherman Act can hardly be characterized as a dispute about economic theory. That came later as litigants and courts looked for tools that would enable them to assess practices in a coherent way. Consistent with the economic-focused language of the Sherman Act itself, the tools that emerged were mainly economic, although they were applied by non-economist lawyers and judges. The record of their engagement with the law is impressive; judges routinely used them even if they were not aware of their economic origins or technical meaning. Nearly all of these developments placed antitrust theory on an expansion course that prevailed until the reaction against the New Deal found a voice in the neoliberalism of the 1940s, particularly as expressed by the Chicago School. Even so, the neoliberal revolution adopted most of these tools, although it modified some of them and rejected a few.

The Progressives are occasionally caricatured as people who really did not care about costs and productivity but were concerned exclusively about bigness as such. That could not be further from the truth. By and large the Progressives appreciated the fact that the trusts had lower costs than smaller firms and did not want to punish them for that. In fact, they were fairly obsessed with efficiency and cutting of costs.³ That obsession extended even to Louis Brandeis, a strong proponent of business efficiency even as he railed at large firms.⁴ He campaigned for “Taylorism,” or scientific management, as a way of limiting price increases.⁵ One antinomy in Brandeis’s work was his persistent failure to acknowledge the relationship between greater efficiency and larger size, even though contemporary economists clearly did.⁶

The numerous and varied participants in the Chicago Conference on Trusts, discussed below, favored lower costs and were also concerned about higher prices.⁷ They worried that exclusionary practices might be a vehicle for achieving them and making market dominance permanent.

I. THE CHICAGO CONFERENCE ON TRUSTS

The Progressive Era was heavily preoccupied with the rise of larger firms, or the “trust” problem. The initial reaction was an eclectic range of views about what to do about them, or whether to do anything at all. The gigantic 1899 Chicago Conference on Trusts, hosted by the Civic Federation of Chicago, is an exceptional window into the contemporary mindset because it reflected this diversity of views. Its personnel and proceedings, which were published in 1900, represented every interest group that had a stake in policy about the trusts. Some participants were invited by the conference managers, while others were invited by the governors of individual states.⁸ The speakers included politicians, economists, lawyers, social scientists and statisticians, industrialists, labor union leaders, insurance company representatives, and even clergy.⁹

This diverse group identified a number of phenomena that explained the rise of the trusts and that either justified or damned them. Some argued that the trusts were entirely the consequence of economies of scale or scope and as such were an engine of economic progress that should be left alone.¹⁰ Others argued that potential competition and new entry would always be present to discipline monopoly pricing, thus mitigating any concerns.¹¹ Many others saw the trusts as harmful and blamed their rise on deficiencies in state corporate law. They debated about a national corporation act as a potential solution.¹² Others both blamed and defended tariffs¹³ or unethical business actors.¹⁴

Within this amalgamation of concerns the Sherman Act itself was hardly dominant. In fact, it played a surprisingly small part, and the speeches tended to emphasize its deficiencies more than its strengths. Henry Rand Hatfield’s well-known contemporary account of the Chicago Trust Conference is very likely responsible for the view that the economists who spoke were nearly all opposed to the Sherman Act.¹⁵ A fair reading of the proceedings suggests two quite different splits. First was the division of those who thought that the trusts were efficient and harmless from those who regarded them as threatening. Contrary to Hatfield’s view, a clear majority believed that the trusts presented a serious problem. Second was the question of the best legal tools for confronting them. Here, Hatfield’s point has more traction. As correctives, corporate law and tariff reform were at least as prominent as the Sherman Act, and many of the speakers professed strong disappointment in Sherman Act litigation to that time. Although the speakers were hardly unanimous, the strongest consensus around a single view was that the trusts should be controlled by changes in corporate law.

Prior to the Chicago Conference, the Civic Federation had sent a questionnaire to participants.¹⁶ The summaries contained in the Proceedings say nothing about the methodology, but there were 554 respondents to 69 questions. The respondents were described as “trusts, wholesale dealers, commercial travelers’ organizations, railroads, labor associations, contractors, manufacturers, economists, financiers, and public men.”¹⁷ A separate list, or circular, was sent to a smaller but overlapping group of lawyers, economists, and “public men.” The description of the survey also fails to indicate whether respondents were limited to one answer or could select multiple answers. Nor does it specify how recipients of the questionnaire were selected and what was their distribution over various interest groups. These omissions largely reflected the state of public opinion research at the time.¹⁸ In any event, nothing suggests that this was anything more than an informal questionnaire distributed broadly to invitees.

David Kinley, a professor from the University of Illinois, reported on the results.¹⁹ Three quarters of the participants overall believed that the trusts injured consumers.²⁰ Two-thirds of the respondents regarded the trusts with “apprehension.”²¹ Most on the main questionnaire believed that the trusts resulted in higher prices.²²

The items of information about prices aggregate 506; 452 were to the effect that prices rose after combinations were made; 24 that they fell, 15 that there was no change, and 15 that they were fluctuating; 210 do not specifically assign a cause, 189 assign trusts as the cause of the change (increase, in most of these cases); and 40 assign other causes, usually “increased demand,” “rise of raw materials,” or the tariff.

Id. at 531. However, 90% of the respondents on the second Circular, which was more focused on academics, lawyers, and government officials, believed that the effect of the trusts was to reduce costs.²³ Two-thirds of the respondents on this second list also believed that consumers would benefit. Interestingly, roughly two-thirds of the respondents overall believed that labor organizations should be treated as all other trusts, while one-third took an unspecified “opposite view.”²⁴ This suggests that the idea of a labor “exemption” from antitrust law did not have popular support in 1900.²⁵ Tellingly, this occurred after the federal courts had begun using the Sherman Act as a powerful striking-breaking device.²⁶ Evidently, most of the participants did not object.

The survey concluded with a very general question: “What shall be done with combinations?” The answers were all over the place, with pluralities going to unspecified “legislation” (61 respondents), “let alone” (60) and the third highest specific proposal going to “Tariff revision” (45). “Antitrust” did not appear on the list, except to the extent it may have been included in unspecified legislation. Twenty-six respondents preferred government ownership or control of natural monopolies, and even fewer (10) supported “Stricter Limitation on Corporate Powers.”²⁷ No specific proposal other than “let alone” received 10% of the votes, and it received only 10.8%. The list of options did not include any that were obviously related to morals or ethics, although 123 responses were classified as “miscellaneous,” with no specification of their content.

At the time of the conference, the Sherman Act was nearly ten years old and had produced two important Supreme Court decisions condemning railroad cartels.²⁸ Even here, the very small number of comments on the railroad cartel decisions were more negative than positive. One complaint was that the railroad cartel cases did not authorize the courts to set reasonable rates, but only to condemn bad agreements.²⁹ Another was that the Trans-Missouri railroad cartel case, which had adopted a per se rule against price fixing, had largely “expunged” the rule of reason from the law.³⁰

By 1900 the Sherman Act had also been used aggressively several times against labor unions, a development that was both praised and condemned by participants. In nearly all of the labor cases the plaintiff had been the United States, thus inviting debate about what should be government policy toward labor union activities.³¹ P.E. Dowe, statistician of the Anti-Trust League, declared that while the cost of living within the last two years had increased some 12–16%, wages had risen by less than 3%.³² Nevertheless, as noted above, there was little support for labor antitrust immunity. Overall, while attitudes toward labor changed significantly between 1890 and 1914 when the Clayton Act was passed, most of this was not yet reflected in the conference proceedings.

Other conference participants criticized the Supreme Court’s very first Sherman Act decision, United States v. E.C. Knight Co.,³³ which had concluded that Congress lacked the constitutional authority to control intrastate manufacturing simply because the goods were destined for interstate shipment. That provoked the view that the country “must have a constitutional change if the general government is to deal with the trust problem.”³⁴ Another speaker praised the railroad cartel decisions as well as E.C. Knight for developing the distinction between intrastate and interstate trusts.³⁵ Many commentators expressed concerns about federalism, but most were of the nature that while the states had a primary role in combatting trusts they could not control interstate companies without federal assistance.³⁶

Several conference participants spoke about the role of costs. Many recognized that the trusts tended to reduce costs.³⁷ Even the “Great Commoner” William Jennings Bryan acknowledged the cost reductions but protested that nothing ensured that these savings would be passed on in the form of lower prices.³⁸ “A trust, a monopoly, can lessen the cost of distribution. But when it does so society has no assurance that it will get any of the benefits . . . .”³⁹ Similarly, others indicated a concern for higher prices. For example, John M. Stahl of the Farmers’ National Congress acknowledged that the trusts had lowered costs but accused them of setting anticompetitively high prices.⁴⁰ Some participants defined competition in terms of cost reduction.⁴¹

Critics later faulted the Chicago Conference for failure to make specific recommendations, and state governors called a second conference for that purpose which met in St. Louis later in 1899. It issued a number of recommendations, but its proceedings were apparently never published and it received little attention from the press. It was dominated by state attorneys general who focused largely on corporate law remedies.⁴² Its recommendations either duplicated those already contained in the Sherman Act or else called for corporate law modifications limiting the power of corporations to do business in more than a single state.⁴³

the enactment and enforcement, both by the several States and the nation, of legislation that shall define as crimes any attempted monopolization or restraint of trade in any line of industrial activity,..; punishment to the corporation to the extent of dissolution; an efficacious system of reports to State authority by corporations and the strict examination of all such as are organized under its laws; the prevention of entrance within a State of any foreign corporation for any other purpose than interstate commerce, except on terms that will put it on a basis of equality with domestic corporations, making it mandatory upon foreign corporations to procure State license as a condition precedent to their entry ; the enactment of State legislation preventing corporations created in one State from doing business exclusively in other States; providing that no corporation shall be formed in whole or in part from another corporation, or hold stock in another corporation engaged in similar or competitive business; recommending that each State pass laws providing that no corporation which is a member of any pool or trust in that State, or elsewhere, can do business in that State ; that the capital stock of private corporations should be fully paid up, and that shareholders shall be liable to twice the face value of the stock held by each.

The path of antitrust development that took place in subsequent years leading up to the Clayton Act in 1914 was much more focused than the conference debates, mainly because many alternatives dropped away. The move for a national incorporation statute or expanded state corporate law remedies ran out of gas.⁴⁴ Debates over the tariff remained, but no legislation ever linked them to trusts as such.

The role of labor became more controversial after 1900, with distinctive positions emerging by the 1912 presidential election. The 1912 Democratic Party platform called for protection of labor organizing so that “members should not be regarded as illegal combinations in restraint of trade.”⁴⁵ The Republican platform was silent on that issue, although it did advocate for preservation of high tariffs as a means of protecting workers’ wages.⁴⁶ High tariffs, it should be noted, protected producers directly, and labor only if producers passed on some of their gains in the form of higher wages. The Progressive Party, with Theodore Roosevelt as its head, called for an end to labor injunctions but did not mention a substantive antitrust immunity.⁴⁷ The Democrat’s 1912 election victory very likely accounts for insertion of a labor immunity into the Clayton Act, now as section 6.⁴⁸

Debates over good morals in business behavior are of course never ending, but the concerns were never reflected in the text of an antitrust statute. Rather, when it passed the Clayton Act in 1914, the Progressive-dominated Congress doubled down on the use of exclusively economic language. The Act condemned conduct when it threatened to “substantially lessen competition” or “tend to create a monopoly.”⁴⁹

One thing that emerges powerfully in the proceedings of the conference is that, even though the participants represented a wide variety of political beliefs as well as professions, for a clear majority of them the dominant concern was with the power of the trusts to set high prices or drive rivals out of business. But there were some exceptions. Of the roughly seventy participants whose statements were published, a half dozen emphasized political or social concerns either in addition or as an alternative to the economic ones. The most prominent in Progressive circles was economist Henry Carter Adams, at this time a statistician for the Interstate Commerce Commission. Adams spoke at some length about rising concentration and economic power, as well as the deficiencies of state corporate law. However, he also complained about the “general social and political results of trust organizations” that must be considered. “For the preservation of democracy there must be maintained a fair degree of equality in the social standing of citizens,” he observed, and wondered whether the rise of the trusts was consistent with that.⁵⁰ He concluded:

I would not claim, without discussion, that the trust organization of society destroys reasonable equality, closes the door of industrial opportunity, or tends to disarrange that fine balance essential to the successful workings of an automatic society; but I do assert that the questions here presented are debatable questions, and that the burden of proof lies with the advocates of this new form of business organization.⁵¹

He also suggested that the trusts might have outsize political influence.⁵²

Dudley Wooten, then a member of the Texas legislature, agreed, arguing that the trusts were antidemocratic perversions brought about by selfishness.⁵³ Aaron Jones, a leader of the national Grange, a populist political organization of farmers,⁵⁴ observed that the sugar trust made political contributions to the Republican Party in Republican-controlled states and to the Democrats in Democrat-controlled states.⁵⁵ John W. Hayes, General Secretary of the Knights of Labor, saw a political war between the power of the state and the power of the trusts,⁵⁶ as did Edward W. Bemis from the Bureau of Economic Research.⁵⁷ However, Bemis also praised chain stores for offering low prices and distinguished them from the trusts. While the trusts cut prices selectively in order to drive out rivals, the department store “furnishes alike to all the advantage of lower prices, which are rendered possible by the economies of a big business.”⁵⁸ William Dudley Foulke, a prominent journalist and political activist for Progressive causes, argued that “the political and social effects of monopoly are far more menacing to society than its economic results.”⁵⁹

For more conservative political activist George Gunton, by contrast, politics were present but pulling the other way: politicians were being urged to abandon sound economic principles of “industrial freedom” in order to vote the “arbitrary paternalism” of harsh regulation of the trusts.⁶⁰

Following the Chicago Conference, Progressives began to focus more narrowly on the antitrust laws and the discipline of economics as the preferred tool for dealing with the trusts. While political and moral rhetoric about the trusts has always been present, there is little evidence that it provided substantial guides to policy making. The dominant tool became marginalist economics, then in its infancy, and the darling of the younger generation of political economists in the United States. Most of these were Progressives with a much stronger bias in favor of government intervention than their predecessors had supported.⁶¹

The principal tools that emerged were (1) partial equilibrium analysis, which became the basis for concerns about economic concentration, the distinction between short- and long-run analysis, and later came to justify and provide support for the concept of antitrust’s “relevant market”; (2) classification of costs into fixed and variable, with the emergent belief that industries with high fixed costs were more problematic; (3) development of the concept of entry barriers, contrary to a long classical tradition of assuming that entry by new firms is easy and quick; (4) the distinction between horizontal and vertical relationships and the emergence of vertical integration as a competition problem; and (5) price discrimination as a practice that could have competitive consequences. Finally, toward the end of this period came (6) theories of imperfect competition, including the rediscovery of oligopoly theory and the rise of product differentiation as relevant to antitrust policy making.

II. MARGINALIST ECONOMICS AND MARKET REVISIONISM

The antitrust movement in the United States coincided with a far-reaching revolution in economics. The marginalist revolution has unfortunately been seriously undervalued in history writing about antitrust, mainly because so many historians did not understand it and failed to appreciate its implications.⁶² Nevertheless, the fact remains that one cannot understand the set of tools that Progressive antitrust policy makers deployed without understanding their underlying economics. By the 1930s nearly all economists were marginalists.⁶³

The classical political economists had seen value as inhering in goods or the labor that went into making them.⁶⁴ They tended to assess costs and benefits by looking at averages, which were necessarily taken from the past. They also tended to believe that capital would flow naturally toward profit and that the only practical impediment was government licenses or other restrictions.⁶⁵ In sharp contrast, marginalists saw value as willingness to pay or accept for the next, or “marginal,” unit of something. As a result, its perspective on value was forward looking. Further, market entry was a dynamic concept and its ease and likelihood varied greatly from one market to another.

Three features of marginalism account for both its influence and the resistance to it. One was that marginalist analysis enabled various values governing demand, supply, or economic movement to be “metered,” or quantified, in ways that classical political economy could not do. This feature also made marginalist economics much more technical, with increasing informational demands, but also promised to give Progressive Era economists capabilities far beyond those of their predecessors. Second, and relatedly, marginalism expanded the use of mathematics in economics, to a degree unknown by the classical political economists. This became a particularly attractive feature to younger economists and social scientists looking to add rigor and expertise to their disciplines. It also accounts for some of the resistance from older economists.⁶⁶A third feature was that marginalism undermined the classical view that markets are competitive unless the state creates monopoly. Under marginalism competitiveness was a matter of degree, and only a small percentage of markets satisfied the conditions for perfect competition. As a result, marginalism began to make a broad and unprecedented case for selective state intervention in the economy.⁶⁷

A. Markets as Human Institutions: Coercion

The classical political economists saw the world of commercial relationships in binary terms. For private arrangements people were either free or bound. Aside from government constraint, the boundaries of obligation were defined by contract, property, and tort law. Value inhered in things or the labor used to produce them, and people either purchased or not. Setting aside public obligations, within that world people were free to make their own economic decisions unless a contract, property right, familial hierarchy, or sovereign command bound them.⁶⁸ That bond was particularly strong because the common law principle of liberty of contract refused to set very many contracts aside.⁶⁹ Further, the classical tradition regarded the market itself as a part of nature. Francis Wayland’s popular textbook on political economy defined the discipline in 1886 as “a branch of true science,” and by science “[he] mean[t] a Systematic arrangement of the laws which God has established.”⁷⁰

By contrast, one prominent feature of the late nineteenth century was its fascination with change—in everything from biological evolution to physics to mechanics. The historian Howard Mumford Jones described the period as the “Age of Energy.”⁷¹ It was only natural that economists would develop marginalism, with its forward-looking concept of value that focused on change and the next thing rather than on averages from the past.⁷² “Equilibrium” became the steady state to which all change aspired but seldom reached. Motion rather than stasis was the natural order of things.

Marginalism began with the premise that value is a measurable expression of human choice. Value depended on willingness to pay or willingness to forego. Further, marginalism distinguished among goods depending on costs, availability, and preference. One corollary was the increasing belief that markets were not all the same and did not all function equally well. This opened the way for more substantial if selective intervention to correct market deficiencies.⁷³

This change in the conception of markets from a pure product of nature to a created human institution was perhaps Progressive economics’ most important contribution. Markets became imagined as human creations and not merely a reflection of permanent natural laws. Their design was a product not only of preference but also of state policy, which could be for good or for ill. As the institutionalist progressive economist John R. Commons put it in his important book on law and capitalism, the evolution of economic phenomena was artificial, more like “that of a steam engine or a breed of cattle, rather than like that of a continent, monkey or tiger.” ⁷⁴ Further, the “phenomena of political economy” are in fact “the present outcome of rights of property and powers of government which have been fashioned and refashioned in the past by courts, legislatures and executives through control of human behavior by means of working rules, directed towards purposes deemed useful or just by the law-givers and law interpreters.”⁷⁵

An outpouring of literature stretching from the 1890s through the early decades of the twentieth century developed aspects of this view that markets are “created” rather than simply present in the natural world. One manifestation was unprecedented economic concern with the distribution of wealth as a legitimate target of state policy because, after all, the state was responsible for it in the first place.⁷⁶ Progressive economist Richard T. Ely argued in his two-volume book on the common law and the distribution of wealth that the legal system itself was strongly biased against the poor. The coercive rules of property and contract relinquished power to those who already had it.⁷⁷ In a review, Cambridge economist Charles Percy Sanger concluded that “the most salient fact is the mass of evidence which shows how hostile the constitution of the United States, as interpreted by judges, is to the poor or the public.”⁷⁸

A related consequence that had more salience for antitrust policy was the idea that markets themselves could be coercive instruments that limited human freedom. Columbia Professor Robert Hale, another Progressive who was one of the earliest economists to be hired onto a law school faculty, expressed this idea for an entire generation. In an article entitled “Coercion and Distribution in a Supposedly Non-Coercive State,” he observed that the economic systems that had been developed by classical economists gave lip service to freedom. In reality, however, their systems are “permeated with coercive restrictions of individual freedom, and with restrictions, moreover, out of conformity with any formula of ‘equal opportunity’ or of ‘preserving the equal rights of others.’ ”⁷⁹

Many of these newly discovered concerns about market coercion showed up in public law—things such as greater protection for labor from onerous wage agreements, prohibitions of child labor, women’s suffrage, the progressive income tax, and eventually the expansive safety net programs of the New Deal. But they also affected competition policy. For example, the law of vertical restraints became increasingly aggressive, particularly in its protection of small retailers. It abandoned very benign common law rules for virtual per se illegality for most distribution agreements that limited dealer behavior, as well as aggressive rules for vertical mergers.⁸⁰ The classical conception that new entry would always be around to discipline monopoly unless the government prevented it gave way to one that saw markets themselves as forestalling new competition.⁸¹ The idea of competition itself came increasingly under attack, and not from socialists who did not believe in it. Rather it was from neoclassically-trained economists who realized that the viability of competitive markets depended on several assumptions that did not invariably obtain.⁸²

B. Partial Equilibrium Analysis

Marginal utility theory permitted the creation of tools for determining the relationship between costs and either competitive or monopoly prices within a firm. By itself, however, it was not able to assess how competition works among multiple firms or what the conditions are for achieving it. That required additional theory about interactions among firms.

Partial equilibrium analysis permitted people to group firms producing similar products into “markets” on the assumption that the interactions of firms within the same market were much more important for evaluating competition than the interactions (or lack of them) among firms in different markets. Cambridge University Professor Alfred Marshall, the first great marginalist industrial economist, borrowed this approach from the science of fluid mechanics: for goods within the same market, prices and demand would flow toward equality, but not across the market’s boundaries.

In 1890 Marshall brought the ideas of marginal utility and equilibrium together in a way that made the analysis of market behavior both tractable and useful. First, he developed what came to be known as the Marshallian demand curve, illustrating the inverse relationship between price and output of a single commodity.⁸³ The downward slope of the demand curve is driven entirely by the next, or “marginal,” buyer’s willingness to pay for one unit of that commodity. The model ignored choices people might make about different commodities, even though in a world of limited budgets such choices were relevant.

Marshall was not the first marginalist,⁸⁴ but he did turn marginalism into a practical tool of competition analysis. He explained that he had come to attach great importance to the fact that our observations of nature, in the moral as in the physical world, relate not so much to aggregate quantities, as to increments of quantities, and that in particular the demand for a thing is a continuous function, of which the “marginal” increment is, in stable equilibrium, balanced against the corresponding increment of its cost of production.⁸⁵

For example, a firm would calculate a selling price by comparing the amount of additional cost that production and sale would encounter and the amount of additional revenue that it would produce.⁸⁶

Marginalism provided a partial theory of individual firm behavior, but not so obviously a theory of firm interaction and competition. In order to do that, Marshall needed a mechanism for identifying who in the economy competes with whom. This was in contrast to earlier contemporaries such as Leon Walras and Marshall’s own successor as professor of political economy at Cambridge, Arthur Cecil Pigou, who were more concerned with the economy as whole. Today this division roughly segregates macroeconomics and microeconomics.⁸⁷

Marshall’s concern was to make economic analysis more manageable by focusing on those firms that competed with one another in an obvious way. He realized that everything in an economy affects everything else, but the most important influences can be identified and tracked. In the influential eighth edition of Principles, published in 1920, Marshall observed that informational demands made it necessary for people, with their “limited powers” to “go step by step.”⁸⁸ They would have to break up “a complex question, studying one bit at a time and at last combining [their] partial solutions into a more or less complete solution of the whole riddle.”⁸⁹

He described his solution, which came to be known as partial equilibrium analysis, this way:

The forces to be dealt with [in the economy are] so numerous, that it is best to take a few at a time; and to work out a number of partial solutions as auxiliaries to our main study. Thus we begin by isolating the primary relations of supply, demand and price in regard to a particular commodity. We reduce to inaction all other forces by the phrase “other things being equal”: we do not suppose that they are inert, but for the time we ignore their activity. This scientific device is a great deal older than science: it is the method by which, consciously or unconsciously, sensible men have dealt from time immemorial with every difficult problem of ordinary life.⁹⁰

This focus on individual industries quickly took over the entire field of business economics, or “industrial organization,” as a distinct area of economic inquiry. Industrial organization theory seeks to determine the conditions under which a particular industry attains equilibrium. Today, antitrust has become a substantially microeconomic discipline, certainly in litigation if not always in theory.

Marshall set industrial organization economics on the path of studying industries individually by identifying goods, which he termed “commodities,” that were sufficiently similar that they could be said to compete with each other. He borrowed from Augustin Cournot the definition that a “market” is the “whole of any region in which buyers and sellers are in such free intercourse with one another that the prices of the same goods tend to equality easily and quickly.”⁹¹

This assumption had numerous implications that were relevant to antitrust. One was to invite questions about exactly how to identify who was in such a market and who was not. A second was to consider whether the identity of the firms in this grouping changed over time. The concept of “entry barriers” explained the likelihood that firms would cross this line, coming in when profits were high. A third was to make the analysis of relationships among competitors, or “horizontal” relationships very different from the analysis of vertical or other relationships.⁹² A fourth was a search for the conditions that either furthered or undermined competition once such a group of firms or their commodities had been defined.

Marshall clearly realized that in reality there is no such thing as a single market that is completely isolated from the rest of the economy. Partial equilibrium analysis, as it came to be called, was no more than a working assumption—although a very important one for making economic analysis manageable. The idea that groupings of similar (competing) commodities should be industrial economics’ principal subject of study had a profound influence on antitrust policy. One of the most important antitrust tools to come out of this focus was the idea of the “relevant market,” or the grouping of sales whose products and prices are strongly influenced by one another.⁹³

The late nineteenth century was the golden age of engineering and science, including social science and economics. Marshall borrowed his ideas about markets, movement and equilibrium straight from Newtonian physics: “When two tanks containing fluid are joined by a pipe, the fluid, even though it be rather viscous, which is near the pipe in the tank with the higher level, will flow into the other,” he wrote in 1890.⁹⁴ Further, “if several tanks are connected by pipes, the fluid in all will tend to the same level . . . .”⁹⁵

While he appeared to be discussing fluid mechanics, Marshall was actually speaking of the principle of economic substitution at the margin, which he defined as the tendency for prices within a single market “to seek the same level everywhere,”⁹⁶ just as the fluid in a tank. Further, “unless some of the markets are in an abnormal condition, the tendency soon becomes irresistible.”⁹⁷ Within this model a “market” was a closed system in which fluids moved naturally toward equality. A different market would be a different enclosed system, and without any flow from one system to the other. Further, as soon as one relaxed the assumption that resources would move freely and quickly from any place of low utility to any place of higher utility, it became prudent to investigate where such movements could be expected to occur, when they would be less likely, and what were the obstacles that stood in the way.

Irving Fisher, who was to become one of America’s most important early marginalists, used his Ph.D. program at Yale in the 1890s to construct a “utility machine.” The machine illustrated with fluids controlled by pumps and valves how prices within the same market flowed to an equilibrium, but did not flow across market boundaries.⁹⁸

The utility machine was thought to be so innovative that it was scheduled for display at the 1893 Columbian Exhibition in Chicago but was destroyed in route.⁹⁹ Other American economists also used illustrations derived from fluid mechanics to illustrate the equilibrium of prices in a market.¹⁰⁰

Figure 1. Irving Fisher’s Utility Machine (1893)

Sources: Timothy Taylor, Photos of Fisher’s Physical Macroeconomic Model, Conversable Economist (Oct. 25, 2016, 8:06 AM), https://conversableeconomist.blogspot.com/2016/10/photos-of-fishers-physical.html [https://perma.cc/N64M-VLR7].

Marshall’s conclusion that the fluids in a tank would flow to a level equilibrium, even though they were “rather viscous,” presaged another development in marginalist economics: the idea of friction, or “costs of movement,” in the words of Marshall’s successor Pigou.¹⁰¹ This idea was later narrowed and refined to become “transaction costs.”¹⁰² The idea was simply that the costs of moving resources to an equilibrium varied from one market situation to another, and in some cases these costs prevented the movement altogether. As a result, one feature of some markets was “chronic disequilibria,” as Joseph Schumpeter later observed.¹⁰³ Another result was increasing awareness that these costs could interfere with a market’s movement toward competition. These concerns were reflected in the increasing attention toward barriers to entry, in contrast to the historical classical assumption of free entry.¹⁰⁴

Marginalist industrial economics also broke the bond that had always existed between classical political economy and laissez faire policy—at least until significant neoliberal pushback occurred in the 1940s. The classicists had been strenuous opponents of government intervention in the economy, but the new Progressives were not. Indeed, Marshall himself moved significantly to the left as he grew older.¹⁰⁵ As the technical study of market competition under marginalist principles developed, economists became increasingly concerned about defining the conditions for “perfect” competition. Accompanying this came the realization that the conditions are in fact quite strict. Nearly all markets deviated from them, although some more than others.¹⁰⁶ One thing that marginalism provided was a set of tools for measuring these deviations, provided that the data were available. Antitrust policy in turn became a tool for examining certain industry structures and practices in order to determine whether they were anticompetitive and, if so, whether they could be corrected by the legal system.

C. Industrial Concentration

The idea of a correlation between the number of firms in a market and its degree of competitiveness dates back to Cournot, a French mathematician who wrote in the mid-nineteenth century.¹⁰⁷ In Cournot’s model, as the number of effective competitive players in a market becomes smaller, the margin between price and marginal cost increases until it reaches the monopoly level with a single firm.¹⁰⁸ For more than a century, the relationship between industrial concentration and competitive performance has been an important component in competition policy, both at the legislative level¹⁰⁹ and more specifically in merger policy. Nevertheless, its role has been controversial.¹¹⁰

“Concentration” refers to the number of firms in a market and, under most measures, their size distribution. A market is said to be more concentrated as the number of firms goes down or as the size distribution is more lopsided. In order to have a measure of industrial concentration someone needs to have a concept of a market, or “industry,” and that is why partial equilibrium analysis was an essential premise.

Around the turn of the century, marginalist economists began to examine the relationship between market structure and industry performance. As early as 1888 Gunton used data from the U.S. Census of Manufactures to conclude that over the previous half century, industrial concentration in some markets had grown significantly.¹¹¹ For example, the cotton industry census data from 1830 and 1880 showed that during that interval the amount of capital invested in the industry grew fivefold, the amount of production more than tenfold, but the number of firms had actually shrunk from 801 to 756.¹¹² The data also showed that the amount of capital invested per worker had roughly doubled, indicating that the firms were becoming more capital intensive.¹¹³ Gunton also identified railroading, telegraphing, petroleum production, and sugar as showing greatly increased concentration.¹¹⁴

Gunton’s conclusions were not addressed to competitiveness. He never discussed the relationship between the number of firms in a market and the threat of oligopoly or collusion. He observed that some had complained that the “concentration of capital tends to increase prices”¹¹⁵ but found no evidence of it. Rather, he found that most of the facts “point the other way.”¹¹⁶ Prices in most of the industries that had experienced higher concentration had actually gone down rather than up.¹¹⁷ He also rejected the argument that “although these trusts have constantly resulted in reducing prices,” still greater saving would result “should the government run the business.”¹¹⁸ He then concluded that the large firms were fundamentally a good thing.¹¹⁹

Manifestly, therefore, the charge that the concentration of capital in the form of trusts and syndicates, necessarily tends to produce monopoly (in the obnoxious sense), destroy competition, increase prices, oppress labor, or to put the government into the hands of an industrial oligarchy, is without any real foundation in fact, or justification in reason. On the contrary, these institutions, instead of being the evidence of industrial abnormity and economic disease, are the natural consequence of modern industrial differentiation, and in their nature are economically wholesome, and politically and socially harmless.

Id. at 406. Gunton did not attribute these charges to any particular person. See also Charles H. Cooley, The Theory of Transportation, 9 Pub. Am. Econ. Ass’n 13, 75–76, 109–20 (1894) (finding increasing concentration troublesome but acknowledging that it led to lower costs).

Increasingly, however, economists and competition lawyers became less sanguine. Boston attorney Lionel Norman lamented that industrial concentration was increasing at an alarming rate.¹²⁰ Cornell economist Jeremiah Jenks and Walter Clark, a professor of mathematics and economics, were also much more pessimistic,¹²¹ as were Progressive economists Ely¹²² and Edwin R.A. Seligman.¹²³ Looking at the business landscape just after the turn of the century, Seligman concluded that the “study of modern business enterprise thus becomes virtually a study of concentration.”¹²⁴ He also relied heavily on data from the U.S. Census of Manufactures, which showed rapidly increasing concentration around 1900 and a significantly greater number of “combinations,” or firms that had attained their large size by merger. All but one of the top twenty-five combinations had been formed between 1890 and 1904.¹²⁵ On effects, he noted both the possibility of lower costs and higher profits.¹²⁶ He also noted that higher profits did not necessarily mean higher prices, because higher output and lower prices could also be profitable.¹²⁷ He seemed particularly troubled by the fact that the trusts earned higher margins, even if they sold at lower prices.¹²⁸

Progressive railroad economist and Harvard Professor William Z. Ripley also undertook a comprehensive examination of industrial concentration data derived from the Census of Manufactures.¹²⁹ The two census figures he found to be most informative were those of the number of firms in each consecutive five-year census period and the value of their gross product.¹³⁰ He concluded that in 142 of the 322 industries grouped in the census, the number of firms had declined, and there had been significant increases in per firm output. He was able to group industries by their tendency toward monopoly, simply by examining the trend toward increased concentration.¹³¹ “Concentration varies more or less directly with the degree of monopolization,” he concluded.¹³²

These writers generally assumed a correlation between the data contained in the Census reports and the “markets” that Marshall referred to for partial equilibrium analysis. In fact, the census data correlated very poorly. For example, one classification in the 1909 Census of Manufactures was “[f]urniture and refrigerators,” which included both metal and wood furniture of all kinds, as well as wooden iceboxes and metal refrigerators, which were first coming into commercial use.¹³³ A metal refrigerator did not compete very much with an upholstered chair, which did not compete very much with a wooden bed. This very poor fit between industry census data and antitrust markets has served to weaken conclusions about industry competitiveness from census classifications—something that a few Progressive economists realized already at the turn of the century.¹³⁴ This poor correlation has remained to this day as a problem with the measurement of industrial concentration through the use of census data. The classifications are better today than they were a century ago, but they still are not well designed to address this problem.¹³⁵ Nevertheless, data of this type have been in continuous use to produce measures of industry competitiveness ever since the late nineteenth century.¹³⁶

The Chicago School largely rejected the significance of concentration data, opting for a position more like Gunton’s that the aggregation of large firms resulted mainly in greater efficiency and lower prices.¹³⁷ Numerous other scholars from the mainstream and further left have disagreed.¹³⁸ In the mid-1970s, the debate produced an influential conference collecting representatives from both sides.¹³⁹ The resulting book hardly put the debate to rest, however, and census-driven concentration data continue to find a controversial but important place in debates about American competitiveness. For example, the Biden Administration’s 2021 executive order on American competitiveness lamented declining competition and relied on concentration data to make the point.¹⁴⁰

D. Fixed Costs and Equilibrium

Both marginalism as a theory of value and Marshall’s theory of equilibrium made cost classification essential. In fact, for Marshall, the cost problem produced significant frustration. Competition drives prices to marginal cost which, by definition, are costs encountered for each incremental change in output. But if hard competition drives prices to marginal costs, then how could a firm pay off its other costs?

Marshall used the term “marginal cost” to describe the immediate additional cost that a firm faced when it increased output by a single unit. In a chapter on the “Equilibrium of Normal Demand and Supply,”¹⁴¹ he observed that under what he called “free competition” prices would be driven to a level very close to marginal cost,¹⁴² and this would become a stable equilibrium.¹⁴³

Marshall’s theory of marginal cost was an effort to determine how firms decide on prices. He observed that prices are related to costs but not all costs are the same. Some costs seem to be quite unrelated to a firm’s decision about what price to charge, at least over the short run. This included administrative costs as well as depreciation on plant and durable equipment.¹⁴⁴ In calculating whether a particular price is immediately profitable, the firm largely ignores these costs. Marshall identified “total cost” as the sum of these supplemental costs plus marginal costs.¹⁴⁵ In the short run each additional sale would add to a firm’s profit so long as it was at a price that exceeded the firm’s marginal costs.

Marshall never used the terms “fixed costs” or “variable costs.”¹⁴⁶ He devoted an entire chapter to “cost of production,” which spoke of “prime costs,” “total costs,” and “marketing costs.” The words “prime” and “direct” were almost always used as references to what we would call variable costs.¹⁴⁷ Within prime costs he included “the (money) cost of raw material used in making the commodity and the wages of that part of the labour spent on it which is paid by the day or the week.”¹⁴⁸ He excluded salaries such as are paid to management because these did not vary with output over the short run.¹⁴⁹

Marshall observed that for goods that require a “very expensive plant” the “[s]upplementary” cost is a “large part of their [t]otal cost.”¹⁵⁰ As a result, a “normal price” “may leave a large surplus above their [p]rime cost.”¹⁵¹ In today’s terminology, in order to be profitable a business with high fixed costs would have to charge a premium above its variable costs. He also observed what would become a significant problem for establishing equilibrium in markets with high fixed costs. “[I]n their anxiety to prevent their plant from being idle” producers may “glut the market.”¹⁵² If they “pursue this policy constantly and without moderation,” price may be so low “as to drive capital out of the trade, ruining many of those employed in it, themselves perhaps among the number.”¹⁵³ When firms are under “keen competition” this urge becomes inevitable, and firms “whose business is of this kind . . . are under a great temptation” to sell “at much less than normal cost.”¹⁵⁴

Marshall’s problem was getting an equilibrium that would sustain a market that was both competitive and had high fixed costs—an increasingly prominent feature of industrial production. By his eighth edition in 1920, Marshall had come up with a largely unsatisfactory biological model to explain how firms with significant fixed costs might attain equilibrium. Firms were like trees in a forest, he explained. They have individual lifecycles, and thus come and go, and some never survive infancy.¹⁵⁵ This organic metaphor never fit very well into the emergent neoclassical model of equilibrium that looked strictly at the mathematics of profit-maximization.¹⁵⁶

During the formative years of antitrust policy in the United States, a “fixed cost controversy” drawn from Marshall’s model of competition dominated important debates about the appropriate roles of competition, antitrust policy, and regulation.¹⁵⁷ In industries such as the railroads or heavy steel manufacturing, the argument went, “ruinous” competition would occur because firms would be forced to cut their prices toward marginal cost, leaving insufficient revenue to pay off their fixed costs. One equilibrium solution was the emergence of monopoly, perhaps by merger. Others were collusion or price regulation. These concerns were very likely a major contributing factor to the great merger wave that occurred around the turn of the twentieth century.¹⁵⁸ Antitrust lawyers representing cartel defendants in markets with high fixed costs repeatedly asserted a “ruinous competition” defense to price fixing, but the federal courts consistently rejected it,¹⁵⁹ as they do today.¹⁶⁰

On the other side, several of the more left-leaning Progressives denied that there was any such thing as chronic overproduction.¹⁶¹ By rejecting the defendants’ arguments, the Supreme Court was effectively taking their position. That was ironic, because the principal architect of the view was Justice Peckham, also the author of Lochner v. New York.¹⁶² He could hardly be classified as a left-leaning Progressive. Peckham’s opinion in the Joint Traffic case expressed strong doubts about the ruinous competition argument, concluding that the principal consequence of very low rates was increased demand, which would in turn produce a larger supply.¹⁶³ One possibility, of course, was that Justice Peckham did not fully understand the implications of high fixed costs.

Justice Peckham’s clever response to the defense in the Addyston Pipe case was that, whether or not competition was ruinous, the defendants themselves could not be trusted to set a price no higher than necessary to prevent it. In fact, they had set prices so high as to deprive the public of the advantages of any competition at all.¹⁶⁴ The Court cited cost evidence developed in the lower court that the reasonable cost of the defendants’ pipe, including a fair profit, did not exceed $15 per ton and could have been delivered profitably to Atlanta for $17 to $18 per ton. The bid price was actually $24.25 per ton.¹⁶⁵ That statement at least suggested that one judicial response to a ruinous competition defense could be a judicial inquiry into costs, but the Court never went down that rabbit hole. It simply rejected the defense outright, as it has done ever since.

Theorizing about the behavior of firms with high fixed costs became a central focus of early antitrust literature, as well as the early American economic literature on the theory of industrial organization.¹⁶⁶ It also proved to be a general attack on the model of perfect competition.

Prior to the development of imperfect and monopolistic competition models in the early 1930s¹⁶⁷ the principal Progressive theorist of fixed costs was the institutionalist economist John Maurice Clark. Clark found the existence of significant fixed costs, which he termed “overhead” costs, to be a disruptor of the standard notion of the equilibrium of supply and demand under competition.¹⁶⁸ The problem, as he noted, was that in the short run of immediate demand price and output are determined by demand and marginal cost, but in the presence of fixed costs this could be attained only over some longer run.¹⁶⁹ High fixed costs continuously produced “irregularities” that threw the relationship between demand and supply out of balance, with some periods of excess capacity and others of excessive demand.¹⁷⁰ Echoing Marshall, he observed that “where overhead costs are a substantial item, the perfect theoretical equilibrium is not found.”¹⁷¹

The implications, as Clark worked them out, were chronic overproduction, because any price above short run marginal cost would serve to reduce the deficit in payment of fixed costs.¹⁷² Another result was that price discrimination became a profitable strategy to the extent that a firm was able to maintain higher prices on established demand while bidding a lower price for new sales.¹⁷³ One characteristic of price discrimination as a solution to the problem of high fixed costs is that when it occurs it results in increased output. Clark concluded that there was nothing inherently anticompetitive or even suspicious about most instances of price discrimination.¹⁷⁴ They were simply a mechanism that firms used to sell individual batches or product at a profit-maximizing (or loss-minimizing) price. That view has very largely persisted within antitrust policy.

The Marshall equilibrium problem ultimately went away when economic models began to incorporate product differentiation, particularly in the theory of monopolistic competition.¹⁷⁵ The principal problem had been Marshall’s assumption that all sellers in competition sold identical “commodities.” As a result, firms competed only on price. When differences in the product or even the terms of sale were incorporated, it became possible to have equilibrium without relying on any non-economic theorizing about the nature of the firm. The significance of this debate, which occurred almost entirely during the Progressive and New Deal eras, is difficult to exaggerate. It gave us much of our theory about equilibrium in industrial markets, analysis of costs, and theories about the limits of competition and the appropriate scope of regulation.¹⁷⁶ It also fueled the Harvard School view that markets differ from one another, and antitrust policy thus requires intense factual queries into particular industries and practices.

E. Market Failure and Regulation

The fixed cost controversy strongly supported Progressives’ suspicions that markets were not as inherently benign as the classical political economists had believed. However, some worked better than others. Antitrust for its part is dedicated to the proposition that markets can be made to work tolerably well on their own with only selective intervention. In other cases, however, the roots of failure are so deep that ordinary market forces are ineffective.

Increased appreciation of market diversity led to a more general theory of market “failure,” championed by Pigou.¹⁷⁷ Pigou developed the idea of a “divergence” between private and social costs, or “externalities” that private bargaining could not correct. For example, a negative externality might occur when a polluting refiner was not required to compensate downwind neighbors for its air pollution. By contrast, a positive externality occurred when the inventor of a new product could not effectively prohibit people from copying it. In the first case the result would be too much pollution; in the second case it would be too little invention.

The idea, which became more technically expressed in the 1950s,¹⁷⁸ was that in a few markets sustainable competition is impossible without state intervention. The goal of regulation became to emulate competitive outcomes in these markets. Adams had anticipated a version of that argument already in the 1880s, arguing that competition was not sustainable in industries with declining costs because the emergence of monopoly was inevitable.¹⁷⁹

The Progressive Era then saw an outpouring of literature on regulation as a corrective for market failure, much of it focused on transportation and public utilities.¹⁸⁰ Among the most important contributions was Joseph Beale and Bruce Wyman’s 1906 book on railroad regulation.¹⁸¹ They made two important observations. The first was that monopoly provisions in corporate charters for railroads and bridges were common at least since the early nineteenth century. The argument that Justice Story articulated for them already in 1837 was that monopoly privileges were essential to attract investment into public utility markets, which were distinctive because of the amount of investment they acquired.¹⁸² However, Beale and Wyman observed a second rationale, which was “virtual monopoly”—namely, that the cost structure of these industries required a monopoly. Further, they argued, this was the “true ground” for regulation of monopolies.¹⁸³ “[W]here competition prevails it regulates the conduct of business by its own processes, but monopoly requires the intervention of the law of the land . . . .”¹⁸⁴

This neoclassical theory of regulation has since formed the basis of core regulatory theory in the United States, as well as one of its most controversial features: cost-of-service rate making.¹⁸⁵ The idea of market failure expanded significantly in the 1930s and after, bolstered in significant part by the Depression. Regulation moved far beyond the relatively narrow neoclassical conception of market failure even to the idea that markets themselves cannot be trusted to distribute goods or services in an efficient, egalitarian manner.¹⁸⁶

Both Progressive and New Deal regulatory theory were aggressively assaulted in the 1960s and 1970s by Chicago School critics such as George J. Stigler.¹⁸⁷ His critique completely ignored natural monopoly or other structural characteristics thought to justify regulation. Rather, he substituted a theory based entirely on political capture—namely, that regulation is nothing more than interest group purchase of regulatory favors from legislatures or government agencies. Stigler never even mentioned declining average costs or natural monopoly. In fact, the only costs he discussed were the cost of operating the political process, including the costs to lobbyists or political operatives of obtaining favorable legislation.¹⁸⁸ He argued, for example, that the costs of successfully lobbying for an exclusionary occupational license are small when distributed over each member of society, but they can produce enormous gains to activists seeking such licensing protection.¹⁸⁹ In sum, Stigler’s model completely divorced the theory of regulation from firm costs or market structure; it was purely political.

That Chicago School effort substantially failed. It never generated a theory with significant explanatory power outside the realm of badly designed regulation that could be explained only by political influence. For example, it could not explain why public utilities are subject to price regulation at the retail level while groceries in every state are sold competitively, except perhaps by offering that the utilities had better lobbyists. To be sure, the Chicago School did make some important contributions at the margins—mainly by hammering home the proposition that regulation can lead to harmful capture and there are good reasons to be on guard about overreach. In addition, regulatory fervor led to excessive controls that did more harm than good. For that, however, the usable critiques came from centrists such as then-Professor Stephen Breyer¹⁹⁰ or Cornell economist and Chair of the Civil Aeronautics Board Alfred E. Khan.¹⁹¹

F. Price Discrimination

Price discrimination, which technically refers to selling to two or more customers at different ratios of price to cost, has always produced divisions in antitrust policy, most typically between economists and non-economists.¹⁹² Lawyers often view it with suspicion, something like race or gender discrimination. By contrast, economists have always tended to be more circumspect, and more inclined to divide it up into different varieties. Even a Progressive institutionalist economist such as John Maurice Clark discussed it in relatively benign terms. Minnesota economist and eventual Director of the United States Census Edward Dana Durand probably stated the consensus view among Progressive economists. In a critique of the Clayton Act, he observed that price discrimination “is an all but universal practice and is not necessarily injurious or calculated to bring about a monopoly.”¹⁹³ However, he also observed that price discrimination could be a strategy of selective predatory pricing used to drive competitors out of the market.¹⁹⁴

Most of the economic foundations for our understanding of price discrimination developed during the Progressive Era as an outgrowth of marginal analysis. The principal originator of the modern theory was Pigou.¹⁹⁵ Pigou divided price discrimination into three types, which he named first-, second-, and third-degree price discrimination. First-degree, or “perfect” price discrimination, is an analogue of perfect competition: it never exists in the real world but is an important tool for analysis. Under it, a seller sells every unit at that customer’s reservation price, or the highest price that customer is willing to pay. The result is that output is at the competitive level, but all of the industry profits go to producers rather than consumers.¹⁹⁶

Second-degree price discrimination occurs when the seller adopts a discriminatory pricing formula and the buyer “chooses” its price by selecting how to purchase. A quantity discount schedule is one prominent example. The purchaser can obtain a lower price by buying more. A discount for early booking is another.

In third-degree price discrimination the seller preselects categories of customers based on certain observed characteristics and charges them different prices—for example, one price for commercial users and another for residential users.

United States antitrust law has never developed general antitrust rules governing price discrimination. Section 2 of the Clayton Act, subsequently amended by the Robinson-Patman Act, addressed a practice that it called “price discrimination.”¹⁹⁷ But the set of practices that statute reached often had little to do with economic price discrimination. Rather, the statute simply condemned price differences.¹⁹⁸ The Progressives did often identify predatory price discrimination as one of the evils brought about by the trusts, particularly Standard Oil.¹⁹⁹ The result was the original section 2 of the of the Clayton Act,²⁰⁰

That it shall be unlawful for any person engaged in commerce, in the course of such commerce, either directly or indirectly to discriminate in price between different purchasers of commodities . . . where the effect of such discrimination may be to substantially lessen competition or tend to create a monopoly in any line of commerce: Provided, That nothing herein contained shall prevent discrimination in price between purchasers of commodities on account of differences in the grade, quality, or quantity of the commodity sold, or that makes only due allowance for difference in the cost of selling or transportation, or discrimination in price in the same or different communities made in good faith to meet competition: And provided further, That nothing herein contained shall prevent persons engaged in selling goods, wares, or merchandise in commerce from selecting their own customers in bona fide transactions and not in restraint of trade.

Id. which the Robinson-Patman Act later amended. The original statute was intended to reach a particular form of predatory pricing widely attributed to the Standard Oil Company as well as others.²⁰¹ The House Judiciary Committee report on the provision indicated that its purpose was to target the practice of large corporations using local price cutting intended to destroy a competitor.²⁰² In a 1923 decision, the Second Circuit described the condemned practice this way:

[P]rior to the enactment of the Clayton Act a practice had prevailed among large corporations of lowering the prices asked for their products in a particular locality in which their competitors were operating for the purpose of driving a rival out of business. Such lowering of prices was maintained within the particular locality while the normal or higher prices were maintained in the rest of the country; and this practice was continued until the smaller rival was driven out of business, whereupon the prices in that locality would be put back to the normal level maintained in the rest of the country. The Clayton Act was aimed at that evil.²⁰³

The statute did not explicitly require that the lower price be below cost, but that was largely the way it came to be interpreted.²⁰⁴ The Supreme Court initially construed the statute broadly without discussing any requirement of below-cost pricing.²⁰⁵ Further, the statute’s express limitation to “commodities” meant that it could not apply to things such as railroad rates, which were one of the biggest targets of price discrimination concern.²⁰⁶

John Maurice Clark’s important 1923 book on fixed costs made a convincing argument that, setting aside differences in bargaining relationships or customer sophistication, price discrimination is largely a consequence of fixed costs.²⁰⁷ A firm with a heavy fixed cost investment needs to keep its output up, and any sale at a price greater than incremental costs will improve its bottom line. As a result, it tries to retain legacy customers at higher prices while bidding lower prices for new or spot market sales. When a firm has excess capacity, these pressures are great.

This explanation of price discrimination was already known in the railroad industry by Clark’s time. Forty years earlier Yale economist and eventual President Arthur Twining Hadley had made a similar observation in justifying railroads’ policies of charging different freight rates for different commodities depending on shippers’ willingness to pay.²⁰⁸ By doing this the railroads were able to maximize output. Given their high fixed costs, this meant that the average cost of transportation went down.²⁰⁹

The Robinson-Patman Act was passed in 1936, subsequent to the period under discussion here. It was not a way of approaching the problem of fixed costs. The statute condemned many of the things that Clark’s analysis had explained as causing no competitive harm.²¹⁰ In any event, the Robinson-Patman Act was a complete misfire. The concern motivating the statute was the emergence of large chain stores such as A&P, which had become the nation’s largest grocer. A&P drove many smaller grocers out of business, mainly because it was vertically integrated and also because it was able to purchase in large quantities, enabling it to undersell small grocery stores. The Robinson-Patman Act ignored vertical integration and scale economies and identified the problem entirely in terms of a firm’s insistence on charging some buyers lower prices than others.²¹¹

The statute completely failed to limit vertical integration because of its requirement that both the higher priced and lower priced transactions be “sales.”²¹² The courts consistently held that a “sale” refers to a transfer of goods from one firm to a different firm. The vertical passage of a good from a firm to its wholly owned store or other subsidiary was not a “sale.”²¹³ The Act did condemn a few large suppliers, such as Borden, for selling milk to large grocers at a lower price than to small grocers.²¹⁴ Further, because the statute targeted “sales,” it did not effectively reach powerful buyers such as A&P itself. The statute did contain a buyers’ liability provision, almost as an afterthought, which was never very effective.²¹⁵

During the Progressive Era through the New Deal, the antitrust analysis of price discrimination was spotty and indeterminate. In fact, however, it remains indeterminate to this day. We have never developed good theory for generalizing about the competitive effects of price discrimination. The consensus of economists today is probably not much different from what it was in the 1920s and 1930s—namely, most instances are competitively harmless, particularly if the discrimination tends to increase output.²¹⁶

G. Monopoly Power and Structure: Potential Competition, Barriers to Entry, and the Relevant Market

In 1890, when the Sherman Act was passed, legal doctrine did not have a coherent conception of market power as a measurable phenomenon. Economics was not much further along. Judicial decisions contained plenty of discussions of “monopoly,” virtually always in relation to patents or other grants of exclusive rights. In most cases “monopoly” was simply assumed from the existence of the exclusive grant itself. For example, many nineteenth-century decisions spoke of the “patent monopoly,” as if the relationship between the two terms was automatic.²¹⁷ All of the references to patents in the Chicago Conference used the term this way.²¹⁸ The law dealing with various aspects of monopoly came essentially from three sources: patent and copyright law, the common law of unfair competition and contracts in restraint of trade, and state corporation law. None contained a market power requirement, and power was generally either assumed or irrelevant.

Estimation of market power by reference to the share of a relevant market, as it is used today in antitrust cases, was a relatively late arrival. Today it has become so conventional that we regard it as routine, and in 2018 a divided Supreme Court mistakenly concluded as a matter of law that it is the only way to assess power in a vertical case.²¹⁹ Since the existence and measurement of market power present questions of fact, the Court’s conclusion was not only technically incorrect, it was also a dictatorial intrusion of policy into fact finding. Econometric tools for assessing market power, such as the Lerner Index, were actually developed prior to judicial usage of the “relevant market” in antitrust analysis.²²⁰ Today econometric methods often produce better results than traditional measurement.²²¹ Further, the use of econometric devices is fundamentally inconsistent with the model of perfect competition. The firms within a perfectly competitive market have no power to price above marginal cost unless they collude. Implicit in the Lerner Index, and later in the development of more sophisticated econometric tools for assessing the power of individual firms, is that the firms are not operating in perfectly competitive markets.²²²

1. Potential Competition and Barriers to Entry

The belief that trusts both promised lower costs and threatened higher prices at least partly explains the heavy focus in the early antitrust literature on “potential competition” as a disciplinary tool. In 1895, Gunton optimistically described potential competition as a force “that is ever waiting to step in where large profits warrant the risk.”²²³ Even a dominant trust would not charge monopoly prices if the looming threat of competition was sufficient to keep its prices down. Classical political economists had always assumed that any attempt to charge monopoly prices would invite new competitive entry that would force prices back to the competitive level. About the only things that would prevent this were government restrictions on entry, including patents.

In his 1884 critique of traditional political economy, Ely, who was to become one of the most prominent Progressive economists, caricatured the classical assumptions of easy market entry, which he described as “the absolute lack of friction in economic movements. Not only do capital and labor move with perfect ease from place to place and from employment to employment, but this . . . is accomplished without the slightest loss.”²²⁴ Under this image of the economy, Ely continued:

The silk manufacturer diverts his capital into another employment like the construction of locomotives with precisely the same facility with which he turns his family carriage horse from an avenue into a cross street, while the Manchester laborer on a moment’s warning finds a suitable purchaser for his immovable effects and without expense or loss of time transfers himself to London where employment is at once offered him at the rate of wages there current. Equality of profits and equality of wages flowed naturally from these assumptions.²²⁵

By contrast, the emerging discipline of industrial economics began to consider how long this might realistically take, what were the market factors that determined the speed and scope of new entry, and the power of incumbent firms to throw obstacles in the way. As Adams admonished in his book on trusts, “[t]he point at issue is whether the public is justified in placing sole reliance upon potential competition, active competition having disappeared.”²²⁶

Privately created barriers emerged as a concern of antitrust law early in the Progressive Era. They were undoubtedly heightened by the Progressives’ increased sensitivity to the natural coercive power of markets.²²⁷ The Supreme Court recognized one such barrier already in 1904. In an early private action under the Sherman Act, the Supreme Court condemned a guild rule that limited membership and effectively prohibited market participation by tile layers who were not members of the defendant organization.²²⁸ Members of the association were prohibited from dealing with non-members. As Justice Peckham noted in his opinion for a unanimous Court, the association’s rules prohibited dealers from acquiring tile “upon any terms” from members of the guild, and all of the manufacturers in the area were members.²²⁹

A few years later, in the American Tobacco case, the Court referred to a dominant firm’s vertical integration and market foreclosure as creating “perpetual barriers to the entry of others into the tobacco trade.”²³⁰ Some lower courts were less concerned. For example, in United States v. Quaker Oats Co.,²³¹ the court rejected the government’s claim of attempt to monopolize, noting that the product at issue, packaged rolled oats, was a commodity produced by many firms, and that the defendant had no reasonable means of excluding them.²³²

Most of the participants in the multi-disciplinary proceedings of the Chicago Conference on Trusts saw potential competition as crucial to any assessment of the likelihood of monopoly. They disagreed about its effectiveness. The debates reveal that the classical assumption of free entry had become controversial. For example, Jenks was a skeptic. He acknowledged the existence of potential competition as a disciplinary force but doubted that the power of the large trusts to charge high prices would be effectively controlled.²³³

Attorney A. Leo Weil was less concerned. He observed that the trusts generally reduced costs and prices, but if there were any tendency toward price increases, potential competition from new firms would tamp them down. Further, this new entry could be expected to occur “unless the laws of trade are to be reversed.”²³⁴ Statistician Joseph Nimmo observed that as a consequence of the revolution in railroad transportation, the range of potential competition was much wider than it had been previously.²³⁵ Economist James R. Weaver from De Pauw University was even less concerned. He suggested that potential competition “rarely fails” to aid the consumer.²³⁶ Accumulations of capital were easily assembled, and those who controlled it stood “ready to enter any specific field of production, whenever the profits of that industry offer sufficient inducement.”²³⁷ Further, it was well known that at the present time entrepreneurs were sitting on “a great mass of idle capital.”²³⁸ As a result, “to avoid this new competition, prices must be lowered or profits shared with the consumer.”²³⁹ Francis B. Thurber, the President of the United States Export Association, believed that the trusts merely moved competition to a higher and more beneficial level:

If a combination of capital in any line temporarily exacts a liberal profit, immediately capital flows into that channel, another combination is formed, and competition ensues on a scale and operates with an intensity far beyond anything that is possible on a smaller scale, resulting in breaking down of the combination and the decline of profits to a minimum.²⁴⁰

John Bates Clark, the most prominent economist among the Conference participants, was much more skeptical.²⁴¹ In theory, he observed “potential competition . . . is the power that holds trusts in check,” but “[a]t present it is not an adequate regulator.”²⁴² The “potential competitor encounters unnecessary obstacles when he tries to become an active competitor.”²⁴³ He mentioned patents as one obstacle, but refused to endorse abolition of the patent system.²⁴⁴ He was also more cynical about the railroads, which he regarded as using manipulation of shipping rates as a device for deterring potential competition.²⁴⁵ Clark also blamed selective price discrimination—or the power of the trusts to exclude entrants by charging unreasonably low prices in that particular portion of the market where new entry was threatened.²⁴⁶ A particularly pernicious form of price discrimination was selective predatory pricing:

The ability to make discriminating prices puts a terrible power into the hands of a trust. If . . . it can sell goods at prices that are below the cost of making them, while it sustains itself by charging high prices in a score of other fields, it can crush me without itself sustaining any injury. If, on the other hand, it were obliged, in order to attack me, to lower the prices of all its goods, wherever they might be sold, it would be in danger of ruining itself in the pursuit of its hostile object. Its losses would be proportionate to the magnitude of its operations.²⁴⁷

This observation became the theory under which original section 2 of the Clayton Act was passed in 1914—namely to prevent firms from using selective, geographically limited discounts to drive rivals out of business.²⁴⁸ Finally, Clark opposed tariffs because their higher costs deterred the potential competitor “from becoming an actual one.”²⁴⁹

Several years later Clark was even more pessimistic.²⁵⁰

From a theoretical point of view competition, actual or potential, will not permit the existence of monopoly control. What would happen in theory can, I believe, be made to occur in fact. At present it does not represent the usual course of events. Effective in theory, potential competition under actually existing circumstances is impotent.

Robert L. Raymond, Industrial Combinations—Existing Law and Suggested Legislation, 20 J. Pol. Econ. 309, 312–13 (1912). At one time potential competition may have been more effective at keeping prices down, he acknowledged, but today that power had largely been eliminated by incumbent firms’ use of selective preferential rates, local discrimination, and exclusionary agreements.²⁵¹ Clark then gave a strong endorsement to the Sherman Act, although he believed that more was necessary, including a federal law chartering corporations and an “industrial commission” designed to examine the competitiveness of individual large firms. Further, he would impose on them “a burden of proof,” first to show that they do not dominate the entire market and, secondly, to show “that the way is so open for the entrance of more that prices cannot become extortionate.”²⁵²

Adams agreed in a 1903 essay on the trusts,²⁵³ as did Boston lawyer Robert L. Raymond.²⁵⁴ Raymond argued what came to be a common position held by Progressives—that potential competition was natural and ordinarily to be expected, but that dominant firms could devise practices that would prevent or limit its operation. He also observed that potential competition did not “instantaneously” become actual competition. Rather, “even with abundant capital one cannot erect a steel manufacturing plant or a sugar refinery until considerable time has elapsed.”²⁵⁵ This delay, he observed, gave dominant firms an opportunity to behave strategically.²⁵⁶ He also warned, however, that competition policy should not go further; it had to preserve the “true economic value” that they promised while also preserving the power of potential competition to limit their prices.²⁵⁷ Progressive economist Ely, who published his book on monopolies and trusts simultaneously with the Chicago Conference, doubted potential competition as a device for disciplining monopoly. He concluded that “[n]o evidence has been adduced of the sufficient action of potential competition in the case of monopoly.”²⁵⁸

Clark returned to this problem in The Control of Trusts, a book he had had originally published in 1901.²⁵⁹ For subsequent editions he was joined by his son, John Maurice Clark.²⁶⁰ The revised edition was even more pessimistic than John Bates’ original, very likely reflecting John Maurice’s more institutionalist leanings. “When the first edition of this work was issued, so called potential competition had shown its power to control prices,”²⁶¹ the Clarks lamented, but

[t]he potentiality of unfair attacks by the trust tended to destroy the potentiality of competition. Under these conditions it was and is clearly necessary to disarm the trusts—to deprive them of the special weapons with which they deal their unfair blows. It is necessary to repress the specific practices referred to and so to enable every competitor who, by reason of productive efficiency, has a right to stay in the field, to retain his place and render his service to the public.²⁶²

As a result, they concluded, while experience has shown that “potential competition is a real force, it has also shown that it is a force which can be easily obstructed.”²⁶³ A few years later, John Maurice Clark argued that potential competition was an unlikely discipline for monopoly in markets with “heavy permanent investment”—that is, with high fixed costs.²⁶⁴ In such cases, he noted, incumbent firms will be holding excess capacity and be able to expand their own output in response to new entry. Knowing this, potential competitors will not wish to make a significant investment in entry.²⁶⁵ Further, he observed, prospective entrants into such a market would realize that total output would be higher when their own production was added in, and thus prices lower. So what appeared to be profitable entry before might not be so later.²⁶⁶

The Clarks’ work developed the basic model that emerged by mid-century for monopolization cases and that prevails today. That judge-made formulation required a showing of both monopoly power and anticompetitive practices. This model retained faith that in a market that is not restrained by either the government or private action, new entry could be expected to maintain competition. The problem for the antitrust laws was anticompetitive practices that forestalled competitive entry before it could occur or become effective. “A merely possible mill which as yet does not exist may forestall and prevent monopolistic acts,” the Clarks conceded, but only provided that the way is “quite open for it to appear.”²⁶⁷

Writing in 1911 about the ongoing government cases against Standard Oil and American Tobacco, Raymond observed that the firms’ growth had depended on the suppression of potential competition.²⁶⁸ In American Tobacco, the district court condemned a trust agreement that involved a group of the same shareholders’ acquiring interests in multiple companies. The court acknowledged the defense that potential competition would discipline any monopoly because the combination itself did not involve any sort of market exclusion.²⁶⁹ But entry would take some time, the court observed, and the “objection is to present and not future conditions.”²⁷⁰ The court believed that argument to be worthy of “serious consideration.”²⁷¹

By contrast, in the 1918 United Shoe Machinery (“USM”) merger case the Supreme Court refused to condemn the union of several shoe machinery makers into what became the USM Company.²⁷² The government’s argument was that the merged companies were potential competitors who could have turned into actual competitors but for the merger. The case thus invited a tradeoff question that remains to this day: some mergers increase productive efficiency by enabling a firm to do things at lower cost, but in the process may harm competition by preventing competition that might have developed had the merger not occurred.

The USM union was a merger of complements, and the district court had concluded that the individual companies were not in competition with one another at the time of the merger.²⁷³ Justice Holmes had actually elaborated on that conclusion several years earlier in a decision that approved the original merger.²⁷⁴ He also observed that the participating firms had not been competitors but rather were makers of complements. One firm produced lasting machines, another welt-sewing machines, and others outsole-stitching machines and heeling machines.²⁷⁵ It was not the purpose of the Sherman Act to “reduc[e] all manufacture to isolated units of the lowest degree.”²⁷⁶ In this case “the combination was simply an effort after greater efficiency.”²⁷⁷ He compared the merger to a situation in which a single firm was created to make “every part of a steam engine,” rather than using the antitrust laws to force “one to make the boilers and another to make the wheels.”²⁷⁸

In the American Can case, which condemned the can-making trust but declined to break it up, the court also cited potential competition as the reason for being cautious about the remedy.²⁷⁹ The court observed that the American Can Company, given its large size and multiple plants, was highly efficient and made good cans.²⁸⁰ Further, the record revealed “that there are many ways in which a large and strong can maker can serve the trade, and a small one cannot.”²⁸¹ In any event, the defendant’s power to restrain competition was limited by “a large volume of actual competition and to a still greater extent by the potential competition” from which it cannot escape.²⁸² For example, when the defendant raised its price—perhaps prematurely believing that it had destroyed enough rivals—new competitors quickly re-emerged. It became “apparently profitable for outsiders to start making cans with any antiquated or crude machinery they could find in old lumber rooms.”²⁸³ At that point the defendant became so desperate that it actually started buying cans from its rivals, even though these were “very badly made.”²⁸⁴ Many of these were later destroyed.²⁸⁵

The language of potential competition evolved into the modern doctrine of “barriers to entry,” a term that came into common use at mid-century. An entry barrier could be either natural or fabricated obstacles that made it more difficult for competition to enter the market. The Supreme Court first used the term in the American Tobacco case, when it referred to the defendant’s acquiring control of numerous “seemingly independent corporations, serving as perpetual barriers to the entry of others into the tobacco trade.”²⁸⁶ More specifically, the Court referred to the defendant’s acquisition of plants “not for the purpose of utilizing them, but in order to close them up and render them useless,” and also to noncompetition clauses placed on sellers that kept them from re-entering the market.²⁸⁷ A few years later a district court quoted this language in condemning Eastman Kodak of monopolization by acquiring around twenty companies and assembling all of the components of the photography industry.²⁸⁸ The phrase did not find much use in the economic literature until the 1940s, followed by significant expansion in the 1950s.²⁸⁹ It entered the mainstream antitrust literature after Joe S. Bain’s pioneering work on barriers to entry in the 1950s.²⁹⁰

2. From Potential Competition to the Relevant Market

As long as confidence was high that potential competition could be trusted to control prices, the precise definition of the market in which firms operated was relatively unimportant. Even monopolists could be kept in check if potential competition was robust. The assumption of robust potential competition explains both why early antitrust decisions involving dominant firms were not particularly fussy about market definition and also why they tended to emphasize detailed litanies of exclusionary practices. Monopolization was all about harmful conduct intended to exclude rivals.

As confidence in the efficacy of potential competition waned, however, it became more important to know the number and robustness of a firm’s actual competitors. Any discipline of monopoly would come primarily from them. As John Maurice Clark observed in 1923, for most markets “it is inherently impossible to have industry effectively governed by potential competition alone.”²⁹¹

Concerns about potential competition are inherently dynamic. They ask about where a market is going, rather than how it may appear at this moment. In fact, accounting for movement and the ability to make useful predictions about it is one of the most challenging questions of antitrust policy.²⁹² Classical economists assumed markets were competitive unless the government intervened because they focused so completely on the long run. The fact that monopoly might be dissipated by new market entry is certainly reassuring. Eventually such a market may reach an acceptably competitive equilibrium, but how long will that take, and who will be affected along the way? Focusing on macroeconomics in the 1920s, John Maynard Keynes ridiculed the optimistic faith of many economists that eventually the economy would move to a healthier equilibrium. In contrast stood the policy maker’s more immediate concerns about time. He famously concluded that the “long run is a misleading guide to current affairs. In the long run we are all dead.”²⁹³ Further, focusing on the long run makes economics worthless as a policy tool: “Economists set themselves too easy, too useless a task if in tempestuous seasons they can only tell us that when the storm is long past the ocean is flat again.”²⁹⁴

The “relevant market” in antitrust analysis emerged as a device for trading off these static and dynamic concerns. First of all, it revealed who was competing with whom in the present instant. If the market was well defined and included consideration of entry barriers, it also estimated what was likely to change over time. The evolving concern was with how rivals and customers would respond to a future price increase above competitive levels.

The idea of a “relevant market” is entirely a creature of partial equilibrium analysis. While that proposition is uncontroversial, it was not commonly acknowledged in the antitrust literature until Oliver Williamson began talking about antitrust policy and welfare tradeoffs in those terms in the 1960s.²⁹⁵ As Marshall had observed, in selecting a market economists should group sales of close substitutes and then make a working assumption that those within the grouping affect one another’s behavior, but that firms outside of the group do not.²⁹⁶ Marshall also realized that this was a simplifying assumption and not a hard picture of a situation in which the elasticity of substitution between goods in the same market is infinitely high, while the substitution between goods inside and goods outside is zero.²⁹⁷

Our partial equilibrium analysis suffers from a defect common to all partial equilibrium constructions. By isolating one sector from the rest of the economy it fails to examine interactions between sectors. Certain economic effects may therefore go undetected, and occasionally behavior which appears to yield net economic benefits in a partial equilibrium analysis will result in net losses when investigated in a general equilibrium context.

Id. Today we commonly say that to the extent a market is “well defined” these two conditions come closer to applying.

Assessing antitrust practices by reference to the “market” in which they occur naturally produced several questions about delineation and measurement. The most obvious one was how to identify the particular grouping of firms to which the analysis should be applied. Marshall himself paid scant attention to the issue. He identified the grouping of sales in a particular market as a “commodity.” His favorite example was tea.²⁹⁸ In that case, sales of tea constituted the relevant market. He gave only a little thought to questions about whether tea competed with coffee or water, or even the extent to which a coffee producer might switch to tea in response to a higher price. He did conjecture at one point that a failure in the coffee harvest might lead to an increase in demand for tea.²⁹⁹ He made a similar conjecture about beef and mutton.³⁰⁰ He also noted that questions about “where the lines of division between different commodities can be drawn must be settled by the convenience of the particular question under discussion.”³⁰¹ For some purposes, he acknowledged, we might even acknowledge Chinese and Indian teas as different.³⁰²

Early Sherman Act cases took roughly the same approach, never putting a fine point on market definition. For example, neither the 1911 Standard Oil³⁰³ nor American Tobacco³⁰⁴ decisions discussed the boundaries of the “market” under consideration. In StandardOil, the Court referred repeatedly to “petroleum and its products,”³⁰⁵ without saying anything about what that might include. In American Tobacco, the Court did observe that the defendant produced a number of products, including “cheroots, smoking tobacco, fine cut tobacco, snuff and plug tobacco.”³⁰⁶ The Court did discuss some vertical practices that involved specific products. For example, the defendant also tried to control sales of licorice paste, an essential ingredient in plug tobacco, in order to exclude rivals.³⁰⁷ The Court never spoke of any of these products as relevant markets, or considered whether they were in the same or different markets.

The American Can decision a few years later described a large litany of bad practices but said virtually nothing about the scope of the market, other than to refer to it as “cans.”³⁰⁸

Id. at 989; see also United States v. Corn Prod. Refin. Co., 234 F. 964, 974, 976 (S.D.N.Y. 1916), appeal dismissed, 249 U.S. 621 (1919) (condemning a trust, but in the process noting that the relevant process included both wet milling and dry milling of corn; the court observed that cost distinctions among them were relevant). The Court wrote:

If the wet process is cheaper than the dry, then, although a monopoly of the wet will be limited by the dry, it is improper to consider the production of the dry millers, when ascertaining the proportion of production controlled by a supposed monopolist of wet milling. If, on the other hand, the dry process is cheaper than the wet, and if, which would be hardly possible, a sustained competition between them existed, then one could not disregard the dry production for all purposes.

Id. at 976; accord O’Halloran v. Am. Sea Green Slate Co., 207 F. 187, 193 (N.D.N.Y. 1913), rev’d on other grounds, 229 F. 77 (2d Cir. 1915) (noting that where black and green slate competed for some buyers but the green slate manufacturers had both production and cost disadvantages, their power was limited by price of black slate); cf. Standard Oil Co. v. United States, 283 U.S. 163 (1931) (noting that although gasoline made by traditional refining methods and the defendant’s large scale “cracking” method was fungible, the latter had an advantage in production costs). The court gave no thought to such questions as whether glass bottles, which were also widely used for preserving food,³⁰⁹ were in the same market. Such questions arose regularly after mid-century.³¹⁰

The International Shoe case, decided in 1930, included a brief discussion of the proper delineation of a product market. It also reflected the emergence of product differentiation as a factor in market analysis. The FTC challenged a merger of two manufacturers of dress shoes. McElwain made more expensive, attractive, and “modern” shoes entirely of leather. International made cheaper shoes that included some non-leather components.³¹¹ Without discussing the scope of the market, the Court did credit the defendants’ testimony that there was “no real competition” between the two firms.³¹²

Estimating market power today by reference to a share of a “relevant market” is not a pure exercise in static partial equilibrium analysis. In Marshall’s model, one examined equilibrium in the market under study on the assumption that the price and output of everything else remained constant.³¹³ However, he also acknowledged that this assumption often fails to obtain in the real world:

[T]he demand schedule represents the changes in the price at which a commodity can be sold . . . other things being equal. But in fact other things seldom are equal over periods of time sufficiently long for the collection of full and trustworthy statistics . . . . This difficulty is aggravated by the fact that in economics the full effects of a cause seldom come out at once but often spread themselves out . . . .”³¹⁴

A price increase naturally invites other sellers to move into the price increaser’s market territory and customers to defect away. These substitutions upset the equilibrium, and within Marshall’s model, continue to occur until the equilibrium is restored. To the extent the market is more rigorously defined and the market share of the price increaser is higher, the movements would take longer or be less likely to occur.³¹⁵

The 1940s and 1950s saw a significant expansion in antitrust usage of relevant markets to estimate market power. Judge Hand’s discussion in the Second Circuit’s 1945 decision in United States v. Aluminum Co. of America has become well known.³¹⁶ The first Supreme Court decision to contain a significant discussion about the scope of a relevant market was United States v. Columbia Steel Co. in 1948.³¹⁷ It concluded that the market that the government alleged was too narrow.³¹⁸ First, the area of effective competition was larger than the government claimed. Second, the two firms actually made different although somewhat overlapping types of steel. On a 5–4 vote, it dismissed the complaint. Justice Douglas’s dissent (joined by Justices Black, Murphy, and Rutledge) contained almost no discussion of the relevant market except to dispute the fact that the acquired firm’s three percent share of the purchasing market under consideration was insubstantial.³¹⁹

The chronology of these concerns is revealing because of what it says about the declining faith in potential competition to solve monopoly problems. As noted previously, as of 1899 even monopoly was not a matter of concern for some participants in the Chicago Trust Conference because potential competition could be trusted to keep prices down.³²⁰ Subsequently, greater doubts about the disciplinary effects of new entry naturally led to increased concerns about just how competitive the market was when entry is disregarded. By the 1930s most antitrust cases involving large firms were harboring significant doubts about the ameliorating effects of potential competition. That explains the rising importance of market definition in antitrust cases.

3. The Rise of Structuralism and the Diminishing Importance of Conduct

As Chief Justice White observed in the 1911 Standard Oil decision, the monopolization offense required bad conduct and not mere monopoly status. Chief Justice White’s reasoning was that the practices condemned by section 1 of the statute actually forbade “all means of monopolizing trade, that is, unduly restraining it by means of every contract, combination, and so forth.”³²¹ To this, section 2 of the Sherman Act sought,

if possible, to make the prohibitions of the act all the more complete and perfect by embracing all attempts to reach the end prohibited by the first section, that is, restraints of trade, by any attempt to monopolize, or monopolization thereof, even although the acts by which such results are attempted to be brought about or are brought about be not embraced within the general enumeration of the first section.³²²

The lower court had spoken much more clearly: section 2 should require a restraint of trade as embraced by section 1, but the difference was that “[o]ne person or corporation may offend against the second section by monopolizing, but the first section contemplates conduct of two or more.”³²³ That is in fact the distinction that modern courts have adopted.

What the statute did not do, Chief Justice White continued, was condemn “monopoly in the concrete,” or the mere status of being a monopolist.”³²⁴

At that point, however, the Chief Justice cast his entire reasoning and perhaps even his mental acuity into doubt with his infamous argument that because “reason was resorted to” in deciding earlier cases the law reached only unreasonable actions.³²⁵ That dubious rationale presaged the more formal recognition of a rule of reason seven years later.³²⁶

All of this was in pursuit of a larger point, as the Chief Justice elaborated, which was to shunt aside the argument that a court could not constitutionally divest an innocent firm of its property without compensation simply because it was a monopolist. Property owners had no right to engage in restraints on trade.³²⁷ Rather, the statute was directed to “particular acts,” even though these were inferred only “generically” from the statutory language.³²⁸ That is, requiring wrongful acts—even though the statute did not explicitly list them—was essential to the statute’s constitutionality.³²⁹

No doubt it would be desirable to define what constitutes a monopoly or an attempt to monopolize a part of interstate trade or commerce; but it is very questionable whether a comprehensive and clear statutory definition could be framed. A statutory definition probably would give rise to as much uncertainty and litigation as the word “monopolize,” and judicial decisions would be necessary to define the definition itself. The safer and better course is to let the courts, guided by common understanding of the word “monopolize” and by the principles of the common law, settle the meaning of the statute by determining its application to individual cases as they arise.

Victor Morawetz, Should the Anti-Trust Act Be Amended?, 22 Harv. L. Rev. 492, 497–98 (1909). At that point the opinion turned to a detailed summary of Standard Oil’s conduct.

The Clayton Act developed this theme further by its enumeration of specific acts that threatened to create monopoly—namely selective and discriminatory predatory pricing,³³⁰ tying and exclusive dealing contracts,³³¹ and anticompetitive mergers.³³² Nothing in the Clayton Act even hints of possible condemnation of monopoly without fault; indeed, its added specificity points in the other direction.

It is thus not surprising that Progressive Era monopolization cases often read like tort cases—with an extensive discussion of conduct, accompanied by relatively thin treatment of market structure and power. This period preceded the structuralist revolution that would occur in the late 1930s and 1940s. Indeed, some commentators from the period wrote of the monopolization offense as if it did not contain a market power requirement at all, but only guilty conduct.³³³ After World War II antitrust policy as led by industrial economists completely flipped that script.³³⁴

Already in the 1930s some industrial economists began to study the monopoly problem by looking at the types of structures most likely to produce it. In 1937 Harvard industrial economist Edward S. Mason observed that in “recent years economic thinking on the subject of monopoly has taken a radically different trend.”³³⁵ It began with the observation that “monopoly elements” of conduct were apparent in the “practices of almost every firm.”³³⁶ As a result, policy makers were increasingly required to make “distinctions between market situations all of which have monopoly elements.”³³⁷ For that, conduct alone provided little basis for differentiation. The important differences were not the conduct but rather the markets in which the conduct occurred. He noted an emerging distinction between “restriction of trade” and “control of the market.”³³⁸ If economics was to make a contribution to the problem of monopoly, Mason observed, it must move beyond practices and descriptive accounts of anticompetitive behavior and look for structural features that made markets more or less conducive to monopolization.³³⁹

The development of imperfect competition theories in the early 1930s forced a shift in focus toward the particular market structures that made noncompetitive outcomes more likely. Some of the foundational work was done earlier. For example, in the 1920s economist John Maurice Clark looked at the manifold sources of economies of large plant size.³⁴⁰ The economies, which resulted from technology and engineering, were inherent in certain industries. In addition, the presence of high fixed (“overhead”) costs provided an explanation for price discrimination, showing it to be typically but not invariably procompetitive.³⁴¹ Clark also discussed “economies of combination,” showing how the effect of high fixed costs and large plant size made markets more conducive to both horizontal and vertical control arrangements.³⁴² In such industries “large-scale production, combination, and monopoly or restricted competition are all more or less bound together, and all occur in the same class of industries.”³⁴³ Everything in Clark’s book pointed in the direction of assessing competition problems by assessing the particular structural characteristics of each firm, emphasizing the extent and nature of fixed costs.

Clark’s book was too technical to have widespread public appeal, but it did both reflect and lead an important set of developments in the field of industrial economics. Antitrust policy became more interested in the types of market structures that made noncompetitive outcomes more likely. Enforcement policy followed these developments, culminating in massive monopolization cases brought against capital intensive firms in the 1930s and after, including Alcoa and USM. Both decisions emphasized market structure and market definition and de-emphasized conduct. Indeed, both toyed with but did not ultimately embrace the idea of monopoly “without fault”—or that certain dominant firms should be broken up simply because they are too big. In Alcoa, Judge Learned Hand discussed the possibilities of a presumption that a firm that had acquired a ninety percent market share was behaving unlawfully. It could defeat that presumption, however, by showing that monopoly had been “thrust upon it,” or that it was merely the “passive beneficiary” of monopoly.³⁴⁴ A few years later in the USM case, Judge Wyzanski characterized Alcoa as suggesting that a firm with an overwhelming market share monopolizes whenever it “does business.”³⁴⁵ That was as close as American antitrust law ever came to a rule of no fault monopolization.

With those decisions the courts entered the era of antitrust structuralism, which in its strongest form made evidence of bad conduct almost but not quite irrelevant.³⁴⁶ That largely ended the Progressive Era’s tort theory of monopolization.

III. THE EMERGENCE OF VERTICAL COMPETITION POLICY

A. “Competition,” Horizontal and Vertical

Progressives were the first to examine vertical practices and vertical integration systematically as competition problems. While some law of vertical contracting practices existed prior to that, almost none of it was concerned with competition. The Progressive accomplishment was noteworthy, because vertical business practices have historically been the most poorly understood in antitrust and have provoked the most controversy. Articulate writers have argued that they should be governed by both the extreme rules of per se illegality and per se legality.³⁴⁷ The Progressives in fact opted for a highly defensible middle ground that has proven to be very durable.

Progressive Era contributions to the law of vertical integration and restraints were formative but also modest.³⁴⁸ Mainly, they focused on the relationship between vertical integration or vertical contracting and realistic threats of monopoly. Subsequently the antitrust law of vertical business relationships veered to the left and became very aggressive, condemning many practices where harm to competition was never seriously threatened.³⁴⁹ Later it changed course again, veering very far to the right and developing rules of virtual nonliability in Chicago School academic writing. The case law never went quite that far.³⁵⁰ Since 2000 or so it has been moderating once again. The rule of reason that is currently the law for nearly all vertical practices is in between, although somewhat closer to a rule of nonliability.³⁵¹ This is at least partly because the courts have made it so difficult for plaintiffs to win rule of reason antitrust cases.

The thoroughly conventional distinction that antitrust and economics makes today between “horizontal” and “vertical” practices is actually a fairly recent development. Today most of our antitrust rules of illegality are driven by it: horizontal restraints are more suspicious than vertical ones. Unlike horizontal agreements, vertical agreements do not increase the effective market share of the participants. The means by which horizontal price fixing agreements reduce market output are more obvious and better understood than for vertical price agreements. Vertical arrangements have a greater potential to produce cost savings.

Classical political economists and most lawyers prior to the 1910s or so did not see these distinctions. They tended to see competition as “rivalry,” and the vertical rivalry that might occur between a buyer and a seller, or employer and employee, counted as “competition” just as much as the rivalry between two competitors. For example, in the 1888 edition of his popular text on political economy, MIT economist Francis Walker defined competition as “the operation of individual self-interest among buyers and sellers.”³⁵²

Marshall did only a little better in Principles of Economics. On horizontal competition, he focused almost entirely on the theory of monopoly, or single firms that accounted for all sales in a market. In a footnote he spoke briefly about “partial monopoly,” which he described as a firm whose wares were better known than those of other firms.³⁵³ Marshall’s chapter on “The Theory of Monopolies” largely assumed exclusivity and focused on how the monopolist determines its output and price when there is no threat of entry.³⁵⁴ He did mention that a vulnerable monopolist, such as a railroad threatened by new competition, would very likely charge a lower price in order to protect its trade.³⁵⁵ Never once in the 750 pages of the first edition did Marshall mention cartels or price fixing. While he drew his theory of marginalism from Cournot,³⁵⁶ he never discussed Cournot’s very influential theory of oligopoly. He did mention the rise of the American trusts in his Eighth Edition in 1920, seeing them largely as an alternative to German cartels³⁵⁷

The economies of highly organized buying and selling are among the chief causes of the present tendency towards the fusion of many businesses in the same industry or trade into single huge aggregates; and also of trading federations of various kinds, including German cartels and centralized co-operative associations. They have also always promoted the concentration of business risks in the hands of large capitalists who put out the work to be done by smaller men.

Marshall, supra note 89, at 282. On the United States trust as an alternative, see id. at 304. and ultimately describing them as “treacherous.”³⁵⁸ He also saw the evil of the trusts as “narrowing . . . the field of industry which is open to the vigorous initiative of smaller businesses.”³⁵⁹ None of these discussions mentioned vertical integration or restraints.

Marshall’s relatively infrequent expressions about “competition” seem almost amateurish today—for example: “The strict meaning of competition seems to be the racing of one person against another . . . .”³⁶⁰ He complained that the term “competition” has “gathered about it evil savour, and has come to imply a certain selfishness and indifference to the well being of others,”³⁶¹ and that “unrestrained competition” produced suffering.³⁶² He spoke of competition as “glorified individualism.”³⁶³ He also lamented that machine production had led to undesirable competition that, “like a huge untrained monster,” led to weakness and disease.³⁶⁴ He blamed this on excessive British protection for liberty of contract.³⁶⁵ Marshall made the same complaint about labor, where he saw unfettered competition as driving wages to subsistence levels.³⁶⁶

Marshall also had little to say about vertical integration and vertical relationships, and nothing about their impact on competition. His few mentions focused on labor. For example, he distinguished horizontal movement of workers from one firm to another from vertical movement, or promotion within a firm.³⁶⁷ Speaking again of labor, he also discussed the “vertical” competition that existed between skilled and unskilled workers who performed the same task.³⁶⁸ He concluded that for workers competition was both vertical and horizontal. First, they competed vertically for advancement within the firm. Second, they competed horizontally by movement from one employer to another.³⁶⁹ In Chapter 8, entitled “Industrial Organization,” he used the term “integration” a single time, using a biological metaphor. He defined it as “a growing intimacy and firmness of the connections between the separate parts of the industrial organism.”³⁷⁰ Late in his life, in his much less prominent and overly long book on Industry and Trade(1919), Marshall began exploring some of the differences between horizontal and vertical expansion.³⁷¹

Prior to 1910 or so, courts also viewed “competition” in terms that did not distinguish the horizontal from the vertical. Often the reference was to the “competition” that exists between the two parties to a bargain, with the seller wishing to receive as much as possible while the buyer wished to pay as little as possible. For example, John D. Park & Sons Co. v. Hartman,³⁷² one of the earliest Sherman Act challenges to resale price maintenance, spoke of the practice as “protecting the seller of property against the competition of the buyer.”³⁷³ The Supreme Court of Oklahoma treated resale price maintenance agreements as a form of noncompetition covenant, used to protect “the seller of the property against the competition of the buyer.”³⁷⁴ Today, of course, we would characterize the relationship between a buyer and a seller as vertical, at least in most cases.

Even Justice Holmes, whose grasp of economics was better than that of most contemporary judges, spoke of competition interchangeably as horizontal or vertical. While a Justice on the Supreme Judicial Court of Massachusetts, he had defined competition in a tort case as “not limited to struggles between persons of the same class” but rather as applying “to all conflicts of temporal interests.”³⁷⁵ He continued, offering a purely vertical illustration:

One of the eternal conflicts out of which life is made up is that between the effort of every man to get the most he can for his services, and that of society, disguised under the name of capital, to get his services for the least possible return.³⁷⁶

In keeping with more modern views, in 1908 the Supreme Court of Illinois rejected that characterization, describing it as “fanciful and far-fetched.”³⁷⁷ It then concluded that an employer and its unionized employees could not be said to be in “competition” with one another, even though their interests clearly diverged.³⁷⁸

Holmes also dissented from the U.S. Supreme Court’s decision condemning resale price maintenance. The Court had reasoned that resale price maintenance was a restraint on alienation that served to eliminate competition among dealers in the sale of Dr. Miles’s brand of medicines.³⁷⁹ Holmes responded that the competition of “conflicting desires” should be sufficient to do that for most goods that were not essential, and Dr. Miles medicines were not.³⁸⁰ If a good was not essential (Holmes’s example was “short rations in a shipwreck”), the price would be set by the “competition” between the seller’s wish to charge more and the buyer’s wish to pay less.³⁸¹ In the Northern Securities merger case he dissented from the majority’s condemnation of a merger to monopoly under section 1 of the Sherman Act.³⁸² The “act says nothing about competition,” he observed.³⁸³ He then described the litany of common law situations characterized as contracts in restraint of trade and concluded that the facts of the present case did not fit into any of them.³⁸⁴ The idea that elimination of competition between firms that had previously been rivals might result in higher prices did not obviously trouble him.

With one implicit exception, the Sherman Act itself never distinguishes vertical from horizontal practices. The exception is the reference to “contracts . . . in restraint of trade” in section 1 of the Act.³⁸⁵ As Justice Holmes pointed out in his Northern Securities dissent, at common law that phrase referred to “contracts with a stranger to the contractor’s business, . . . which wholly or partially restrict the freedom of the contractor in carrying on that business as otherwise he would.”³⁸⁶ Justice Holmes gave as an example the British decision in Mitchel v. Reynolds.³⁸⁷ The lessor of a building to be used by the plaintiff as a bakery promised not to open a competing bakery in the vicinity. Noncompetition agreements such as these are vertical because they are formally between the seller (lessor) and buyer (lessee) of property or in other situations between an employer and an employee. Nevertheless, the agreement also has a horizontal effect to the extent that its purpose is to limit the competitive choices of the promisor. In Mitchel, the lessor had promised the lessee that he would not enter into business in competition with the lessee.

Even the Clayton Act, passed in 1914, ignored vertical competition issues with one limited exception. That was section 3, which prohibited the sale of commodities on the “condition or understanding” that the buyer not deal in a competitor’s goods.³⁸⁸ This of course became the basis for the modern law of tying and exclusive dealing. Even here, however, while the law condemned a vertical agreement, the impact was horizontal. The concern was agreements that limited competition from rivals. Further, its historical focus was on patent license agreements in which it was thought that patentees used ties to extend their patent beyond its lawful scope.³⁸⁹ The Clayton Act did not seek to expand the law of purely vertical restraints that limited only the sales of a manufacturer’s own product.

Section 2 of the original Clayton Act prohibited price discrimination directed at rivals, a form of predatory pricing.³⁹⁰ That was a purely horizontal practice. The provision was amended in 1936 as the Robinson-Patman Act so as to reach so-called “secondary line” price discrimination, or the charging of two different prices to two different customers, favoring the customer who paid the lower price.³⁹¹ These 1936 amendments effectively turned it into a predominantly vertical statute. Ever since, the Act has distinguished “primary line” (horizontal) and “secondary line” (vertical) violations. However, the first was entirely a creature of the original 1914 Act, while the second was developed by the 1936 amendments.³⁹²

Likewise, the original Clayton Act provision condemning mergers reached only those that limited competition “between” the merging firms—that is, mergers of competitors.³⁹³ It was amended and extended to vertical mergers in 1950, as it appears today.³⁹⁴

In sum, while the Clayton Act greatly expanded upon the Sherman Act and the Supreme Court largely interpreted it that way, it was concerned almost exclusively with horizontal practices. It became “vertical” only through amendments passed in the mid-1930s and after. Outside the law of resale price maintenance, which did not have a well-developed economic rationale other than the concern for restraints on alienation, competitive concerns about vertical integration had not yet emerged. While the Sherman Act’s concern with contracts in restraint of trade and the Clayton Act’s concern with tying were both vertical, today we would characterize both as “interbrand” restraints. That is, they were vertical contracts aimed at limiting horizontal competition.

At the same time, however, section 3 of the Clayton Act and the 1917 Motion Picture Patents case became important vehicles for developing a theory of anticompetitive vertical practices that expanded greatly in the 1930s.³⁹⁵ Notably, however, the practice in that case was substantially horizontal, directed at insulating the patentee’s films from the films offered by rivals.³⁹⁶

B. Progressive Economics and Vertical Integration

One important chapter in Adam Smith’s Wealth of Nations was entitled “That the Division of Labour is Limited by the Extent of the Market.”³⁹⁷ Smith’s point was that larger markets permit greater specialization because businesses are able to depend more on exchange rather than internal supply.³⁹⁸ In isolated villages in Scotland, “every farmer must be butcher, baker and brewer, for his own family,” and in such towns it is hard even to find a professional carpenter or mason.³⁹⁹ From Smith’s insight that larger markets lead firms to rely more on others for certain inputs, Stigler fashioned a theory that small markets provide an impetus for internal vertical integration.⁴⁰⁰ As markets grow larger firms have more opportunities to buy rather than make. But to turn that argument into one that saw Adam Smith as developing a general economic theory of vertical integration was a stretch.

During antitrust’s early years the idea of competitively harmful vertical practices was almost entirely absent in economics as well as law. Very little of that occurred prior to the twentieth century. In the 700 pages of proceedings of the Chicago Conference on Trusts⁴⁰¹ neither lawyers nor economists ever once discussed vertical integration or vertical practices as a competition problem. A few years later federal courts first began addressing resale price maintenance under the Sherman Act.⁴⁰²

Exploration of vertical business relationships and competition policy began to enter economics literature in the early twentieth century, although somewhat haphazardly. Notwithstanding the heightened Progressive concern about the trusts, they did not see vertical integration or vertical control as a threat. The early discussions spoke of it in very benign terms. In 1901 William F. Willoughby, a political scientist and lawyer who taught at both Harvard and Princeton, concluded that the competitive effects of vertical integration were overwhelmingly positive.⁴⁰³ Speaking of Andrew Carnegie’s steel company, he concluded that the “policy of the company” in integrating vertically was “not in attempting to lessen outside competition, but in seeking to bring about a more perfect organization and integration of its own properties.”⁴⁰⁴ Overall, he believed, the principal reason that firms integrated vertically was to ensure themselves of adequate and timely supply in the event of shortages.⁴⁰⁵

Progressive economist and President of the University of Wisconsin Charles Van Hise’s 1912 book on the Trust Problem spoke a single time of “vertical combination.”⁴⁰⁶ He was referring to vertical integration in the steel industry, but drew no conclusions about vertical integration generally.⁴⁰⁷ John Bates and John Maurice Clark’s important 1914 book on The Control of Trusts never discussed vertical practices or vertical integration at all.⁴⁰⁸ That omission is significant because at the time the father, John Bates, was one of the most prominent economists in the country and a leading marginalist. More generally, their book was a fierce indictment of the trusts.

In his 1919 book on Industry and Trade, written near the end of his life, Marshall did speak several times about the “vertical expansion” of firms into markets for supply or distribution.⁴⁰⁹ He noted, for example, that firms sometimes integrated vertically in order to avoid the effects of upstream cartels.⁴¹⁰ His only sustained discussion of vertical integration was in relation to firms that did so in order to assure sources of supply or distribution, and he spoke of it entirely in benign terms.⁴¹¹ A few other economists did talk about vertical integration, mainly to emphasize the efficiencies that vertical control made possible.⁴¹²

John Maurice Clark’s 1923 book on fixed (“overhead”) costs did contain a more detailed discussion of vertical integration.⁴¹³ He spoke briefly of “vertical combination” in the steel industry⁴¹⁴ and more generally in a chapter entitled “Economies of Combination.” He described it as “the combination under one management of successive stages in a chain of productive operations.”⁴¹⁵

Clark cast the vertical integration problem as one of managing information and fixed costs: “The employer’s knowledge of his own needs and of the conditions of his own business is an expensive industrial asset . . . .”⁴¹⁶ Further,

[A]nother gain from integration arises, in the shape of great reliability in the supplying of materials. The two concerns adapt their processes to each other, and the supply of materials, both in quality and regularity, can be more carefully suited to the needs of the user than they would be if the two were independent concerns . . . .⁴¹⁷

As a result, “[a]nother thing that is saved is all the work of negotiation, bargaining, higgling, stimulating demand (on the part of the seller) . . . and much of the other work of buying and selling, which could be reduced to a matter of routine.”⁴¹⁸ He described this as “an overhead outlay which is capable of being enormously reduced by vertical combination.”⁴¹⁹ Clearly Coase was not the first to observe that internal integration is a way of avoiding the costs of using the market.⁴²⁰ Clark’s own contribution was mainly to observe that high fixed costs and product differentiation exacerbated problems of market coordination of upstream and downstream levels.

During the Depression the economic treatment of vertical practices did an about face, becoming much more critical, minimizing the role of cost savings or even finding them harmful, and focusing on problems of monopoly.⁴²¹ One of the most pessimistic was economist Arthur R. Burns’s 1936 book The Decline of Competition, which was heavily influenced by the theory of monopolistic competition. He presented vertical integration as inherently monopolistic and as strong evidence that competition was in decline.⁴²²

C. Progressives and the Emerging Law of Vertical Integration

In distinguishing vertical from horizontal practices, the difficult part was to determine how a firm’s control of a vertically related market affected competition. As previously noted, economists of the day were keenly aware that vertical integration could reduce costs.⁴²³ So were many courts. Already in 1866, a British decision observed that one effect of a railroad’s acquisition of a colliery was to reduce the cost of coal necessary for its operations.⁴²⁴

The courts were also aware of foreclosure threats but did not generally find them decisive. In 1886, the Supreme Court held that a railroad that had integrated into express freight delivery services had no obligation to provide equivalent services for an independent delivery company.⁴²⁵ Justices Miller and Field dissented. Given that the delivery service was a complement to the railroad, they observed, the effect of the refusal would be to exclude competing express companies from the markets served by that railroad. There was no relevant antitrust law or even an Interstate Commerce Act, which was passed a year later.⁴²⁶ Rather, they would have found a duty under the common law of common carriers.⁴²⁷ A few years later the first Justice Harlan wrote the opinion for a unanimous Court declaring that an exclusive dealing contract between a railroad and a provider of sleeping cars was not contrary to public policy or common law.⁴²⁸ The action did not rely on any federal statute.

Speaking of noncompetition covenants, which are a form of vertical exclusive contracting, Judge Taft’s 1898 antitrust opinion in United States v. Addyston Pipe & Steel Co.⁴²⁹ noted that they could sometimes be harmful. They might injure the parties by depriving them of opportunities; or they might deprive the public of services that would be valuable and thus discourage enterprise. In addition, he gave two reasons more directly related to competition policy: they might “prevent competition and enhance prices,” and they “expose the public to all the evils of monopoly.”⁴³⁰ For its part, the common law approved the great majority of vertical agreements with the exception of some noncompete agreements.⁴³¹ In any event, Judge Taft’s statements in Addyston Pipe were dicta, because the case involved only naked horizontal price fixing.

That analysis still left many questions open. For example, how does one account for the fact that vertical arrangements may simultaneously reduce costs and exclude rivals? One of these things seems beneficial and the other harmful. Further, how much weight should be given to the common law’s traditional strong protection for liberty of contract and the freedom to trade? Those concerns loomed large in cases involving resale price maintenance and other vertical restraints, where the freedom to trade came to be the freedom to be free from restrictions on distribution.⁴³² As the Supreme Court reiterated in a 1919 decision declining to find an agreement to engage in resale price maintenance, the purpose of the Sherman Act is to “preserve the right of freedom to trade.”⁴³³

Historically the common law did recognize limitations on business firms’ vertical integration by contract, but the concerns did not relate to competition policy. First was the common law policy against restraints on alienation, which courts had used regularly to decline enforcement of certain types of contracts.⁴³⁴ Later on, antitrust decisions cited this policy as a rationale for using the Sherman Act to condemn vertical contractual limitations on resale, including resale price maintenance.⁴³⁵ The Supreme Court cited concerns about restraints on alienation in an antitrust case as recently as 1967, when it declared territorial restraints on dealers to be per se antitrust violations.⁴³⁶

Second, many exclusive dealing and similar contracts were incomplete because they did not specify price or quantity. The common law itself exhibited a strong preference for “one off” contracts that contemplated sales with precise terms covering all important elements. Here, the most frequently challenged practice was requirements contracts, which later came to be called exclusive dealing. Under them, a purchaser promised to purchase its needs for a product from the seller but did not state the quantity. Through the early twentieth century such contracts were routinely struck down, not because of concerns for competition, but because the contracts lacked specificity. As the New York Court of Appeals declared in 1921, while a contract did not necessarily need to specify a precise amount, the quantity must be able to be “determined by an approximately accurate forecast.”⁴³⁷ This rule threatened the early development of business franchising, because franchise agreements were by nature open ended as to price, quantities and even other terms of dealing.⁴³⁸ Within a few years such open-ended contracts were to become a routine and essential part of franchised dealership networks.⁴³⁹ This occurred largely as a result of contract law’s developing doctrine of the good faith purchaser.⁴⁴⁰

In his influential 1920 treatise on contracts, Harvard’s Samuel Williston approved of the common law’s restrictive interpretation. He also suggested a workaround, however, that revealed that competition policy was not the driving concern. As a general rule, he concluded, a promise to sell a purchaser’s needs without precise specification of the number is “not sufficient consideration” to make an enforceable contract.⁴⁴¹

A promise to buy such a quantity of goods as the buyer may thereafter order, or to take good in such quantities “as may be desired” . . . is not sufficient consideration since the buyer may refrain from buying at his option and without incurring legal detriment himself or benefiting the other party.

Id. at 216–17 (citing numerous decisions); see, e.g., Oscar Schlegel Mfg., 132 N.E. at 149 (contract that did not specify quantity void). Other developments are analyzed in Hovenkamp, supra note 438. He then added however, that the contract could be made enforceable if the buyer promised to purchase all of its needs from the seller. Thus “the promise of a seller not to manufacture except for the buyer, or the promises of a buyer not to buy except from a particular seller” was adequately supported.⁴⁴² Williston’s statements, amply supported by case law,⁴⁴³ reflected that the common law around 1920 ran in just the opposite direction as the subsequently emerging antitrust rule: contracts of this kind were enforceable at common law only if they were exclusive. By contrast, under antitrust law exclusive contracts were looked at with ever increasing suspicion.

Another concern that the case law reflected and that did breach the boundary into antitrust policy was when contractual restraints were included in patent or copyright licenses. Initially the courts refused to enforce many such agreements under patent law, using a variety of doctrines intended to limit the power of patentees to impose restrictions on patented articles once they had been sold.⁴⁴⁴ For example, in its influential decision in Wilson v. Simpson, forty years prior to the Sherman Act, the Supreme Court held that a patentee could not require purchasers of its wood planing machine to purchase its own unpatented disposable blades.⁴⁴⁵ In Adams v. Burke, the Supreme Court refused to enforce a condition imposed by the manufacturer/patentee of coffin lids limiting the geographic area where the lids could be used for a burial.⁴⁴⁶ That restriction, the Court held, was not “within the monopoly of the patent.”⁴⁴⁷ In Bobbs-Merrill Co. v. Straus,⁴⁴⁸ it refused to enforce a resale price maintenance agreement contained in a book copyright license, three years before the Supreme Court applied the antitrust laws in the Dr. Miles decision. The decision did not cite the antitrust laws. Long prior to the passage of the antitrust laws, the Supreme Court was routinely denying enforcement to vertical restrictions contained in patent or copyright licenses. Much of this doctrine eventually found its way into antitrust law.⁴⁴⁹

These decisions did not consider anything about competition in distribution, but only whether the restrictive license provision fell outside the scope of the intellectual property grant. Eventually, however, the patent decisions did generate some pushback on competition grounds. One example was Judge (later Justice) Horace Lurton’s 1896 opinion in Heaton-Peninsular Button-Fastener Co. v. Eureka Specialty Co.⁴⁵⁰The seller of a patented button-fastening machine prohibited purchasers of the machine from using it with any except its own unpatented fasteners, one of which connected each button to a garment. In modern terms we would characterize this arrangement as a variable proportion tying arrangement.⁴⁵¹ In addition to a dispute over the reasonable scope of the patent license in which the restriction was placed, the purchaser made an argument “based upon principles of public policy in respect of monopolies and contracts in restraint of trade.”⁴⁵² The gist was that “public policy forbids a patentee from so contracting with reference to his monopoly as to create another monopoly in an unpatented article.”⁴⁵³ Judge Lurton responded by noting that the tying clause served the useful purpose of measuring usage of the machine in order to determine the royalty.⁴⁵⁴

In 1912 a divided Supreme Court relied heavily on the Button-Fastener case to hold in Henry v. A.B. Dick Co. that the maker of a patented office copying machine could tie its own unpatented paper, stencils, and ink to the machine.⁴⁵⁵ By this time Judge Lurton had been elevated to the Supreme Court and wrote the opinion. The Sherman Act had now been passed, but the Court rejected the contention that it prohibited this kind of agreement. Rather, the Court noted the general rule of “absolute freedom in the use or sale of rights under the patent laws.”⁴⁵⁶

The Henry decision proved to be too much. Congress responded two years later with section 3 of the Clayton Act, which prohibited ties of goods “whether patented or unpatented,” provided that harm to competition was shown.⁴⁵⁷ That is, competition law rather than the appropriate scope of the patent became the driver. With that statement, the law of tying migrated from patent law into antitrust law. Section 3 became the first antitrust statute specifically targeting a vertical restraint. The statute actually went further, prohibiting not only absolute ties but also discounts or rebates conditioned on tying.⁴⁵⁸ However, it did not condemn all ties or even all patent ties, but only those that threatened to “substantially lessen competition or tend to create a monopoly.” Indeed, it is hardly clear that the Clayton Act would have condemned the button and office copier ties that had provoked Congress to act. Both were of common commodities and very likely caused no harm to competition.

In 1917 the Supreme Court overruled Henry in condemning a tying arrangement involving the Edison motion picture projector. It was sold subject to a patent license agreement that prohibited users from showing any films other than the seller’s own.⁴⁵⁹ By the time of the litigation, separate patents on the film had expired. The Court read the license restriction as effectively attempting to continue the film patent’s exclusivity by tying the film to the patented projector.⁴⁶⁰ While the decision generally relied on patent law, the Court quoted the new Clayton Act provision as confirming its conclusion.⁴⁶¹ Unlike Henry, the Motion Picture Patents case did involve a serious threat of monopoly in the infant motion picture industry.⁴⁶² After 1930 the tying decisions were not so circumspect and began condemning competitively harmless ties.⁴⁶³

Decisions such as Motion Picture Patents never spoke of vertical practices, but the decision did indicate judicial recognition of downstream control of films as a monopoly problem, at least in the area of patents. The concern in this case was that a patented film projector and control of film could become the lever for control of the motion picture industry. In the 1930s this concern about vertical practices as a tool of monopoly became prominent in the literature of industrial economics.⁴⁶⁴ In the motion picture industry itself, it eventually led to a near obsession with vertical integration reflected in the 1948 Paramount decree.⁴⁶⁵

Resale price maintenance—a so-called intrabrand restraint because it does not limit competition with rival products—received the harshest treatment of all. Today we are inclined to think that tying arrangements present greater potential for competitive harm than do resale price maintenance agreements. In 1907 Judge Lurton, still on the Sixth Circuit, held that an agreement between a proprietary medicine manufacturer and its various distributors and resellers stipulating their resale price was not enforceable because it was a contract in restraint of trade.⁴⁶⁶ There were no antitrust issues.⁴⁶⁷ The difference between this case and his own previous decision in the Button-Fastener case was that the medicines in question may have been protected by a trade secret, but they were not patented.⁴⁶⁸ Four years later the Supreme Court agreed in Dr. Miles Medical Co. v. John D. Park & Sons Co.⁴⁶⁹ It referenced the Sherman Act only to conclude that earlier decisions refusing to apply it had all involved patented products.⁴⁷⁰

Federal antitrust case law did not refer to a practice as “vertical” until the 1930s. In 1934 a district court opinion in the SugarInstitute case spoke about the possibility that “vertical organization of distribution agencies” might result in “a lower price to the ultimate consumers.”⁴⁷¹

More explicit judicial recognition of a distinction between horizontal and vertical practices emerged a little later, and from an unlikely source. After the Dr. Miles decision holding resale price maintenance unlawful, small business interest groups began a “fair trade” movement to permit individual states to opt out of federal law and permit resale price maintenance within their borders.⁴⁷² After some state attempts to do so contrary to federal law, Congress yielded to an intensive campaign of small business groups led by the National Association of Retail Druggists, which had drafted a “model act” for Congress to adopt.⁴⁷³ Congress responded with the Miller-Tydings Act in 1937.⁴⁷⁴ President Roosevelt opposed the bill and threatened to veto it, but he caved to political pressure at the last moment.⁴⁷⁵

Miller-Tydings authorized states to approve resale price maintenance within their borders, but it invited considerable dispute about its scope. While it never used the terms “vertical” or “horizontal,” it did contain a proviso that it did not immunize agreements among manufacturers, producers, and wholesalers.⁴⁷⁶

render illegal, contracts or agreements prescribing minimum prices for the re-sale of a commodity which bears, or the label or container of which bears, the trade mark, brand, or name of the producer or distributor of such commodity and which is in free and open competition with commodities of the same general class produced or distributed by others, when contracts or agreements of that description are lawful as applied to intrastate transactions, under any statute, law, or public policy now or hereafter in effect in any State . . . in which such resale is to be made, or to which the commodity is to be transported for such resale . . . .

Id. However, then the statute provided further

[t]hat the preceding proviso shall not make lawful any contract or agreement, providing for the establishment or maintenance of minimum resale prices on any commodity herein involved, between manufacturers, or between producers, or between wholesalers, or between brokers, or between factors, or between retailers, or between persons, firms, or corporations in competition with each other.

Id. The scope of this immunity had to be determined judicially. Because the proviso was triggered by state legislation, it was interpreted mainly by state courts, which very largely concluded that the statute exempted “vertical” agreements but not “horizontal” ones. For example, the North Carolina Supreme Court explained in 1939:

The agreements authorized by the law are vertical, between manufacturers or producers of the particular branded commodity and those handling the product in a straight line down to and including the retailer; not horizontal, as between producers and wholesalers or persons and concerns in competition with each other . . . .⁴⁷⁷

The Supreme Court eventually confirmed this view as a matter of federal antitrust law in Schwegmann Bros. v. Calvert Distillers Corp.,⁴⁷⁸ concluding that the statute did “not authorize horizontal contracts, that is to say, contracts or agreements between manufacturers, between producers.”⁴⁷⁹

By the early 1930s the law of vertical practices had developed to a place not all that different from where it is today, save for the treatment of resale price maintenance. Tying arrangements were addressable under antitrust, but liability was very largely limited to firms that had dominant market shares or where foreclosure percentages were high. In addition to Motion Picture Patents, the IBM tying case of 1936 found a tie of IBM’s computation machine and its data cards to be unlawful on a market share that exceeded eighty percent.⁴⁸⁰ By contrast, General Motor’s (“GM”) tie of car repairs to its original equipment parts was approved when the court concluded that the tie was essential for quality control and that there was plenty of competition in any event.⁴⁸¹ Other decisions also approved ties when the markets in question were competitive.⁴⁸²

The same thing was true of exclusive dealing, which condemned the practice when it realistically threatened to perpetuate market dominance. In a decision applying the Clayton Act to exclusive dealing, the Court noted that the supplier controlled roughly forty percent of the dress pattern outlets in the country and that the exclusive agreement in question threatened to create several local monopolies.⁴⁸³ There was no antitrust law of vertical territorial restraints until the Supreme Court addressed the issue in the 1960s in White Motor Co. v. United States.⁴⁸⁴ Justice Douglas held for the Court that it was too early to say. Resale price maintenance, which remained unlawful per se, was the outlier.

The law of vertical mergers and ownership vertical integration cut a similar path. The courts condemned it when it appeared to create or preserve monopoly, but generally required evidence of market dominance or foreclosure. For example, judicial condemnation of vertical integration in the American Tobacco,⁴⁸⁵ Corn Products,⁴⁸⁶ Kodak,⁴⁸⁷ and Keystone Watch⁴⁸⁸ decisions were all predicated on at least an assumption of dominant market shares. On the other hand, the court refused to condemn United States Steel’s integration into distribution facilities,⁴⁸⁹ finding that the integration improved efficiency and reduced costs and uncertainty.⁴⁹⁰ In affirming, the Supreme Court cited evidence that it was cheaper for the defendant to combine several operations in a single facility and that this combination would enable it to compete more effectively in the world market.⁴⁹¹

In its unanimous antitrust decision in Eastern States Retail Lumber Dealers Ass’n v. United States, the Court even intervened to protect ownership vertical integration in the lumber industry.⁴⁹² The defendants were classic examples of Progressive Era small businesses who relied on the mantle of “fair trade” to protect themselves from larger vertically integrated firms. In this case they organized a boycott, which the Court condemned, agreeing among themselves that they would not purchase lumber at wholesale from anyone who had vertically integrated into retailing. The decision never used the words “vertical” or “integration.” Rather the boycott was cast in terms of wholesalers who sold directly to customers rather than exclusively to the defendant retailers.

D. Growing Fears of Vertical Control After World War II

The law of vertical relationships began to go off the rails in the 1940s, and for a confluence of reasons. One of course was the Great Depression and the dramatic rise of small business as an interest group following World War I.⁴⁹³ Another was President Franklin D. Roosevelt’s appointment of Thurman Arnold to be head of the Department of Justice Antitrust Division, turning it into a potent antitrust and anti-patent tool. The development of influential models of imperfect competition also had considerable influence.⁴⁹⁴

In its International Salt tying decision in 1947, the Supreme Court applied both the Sherman and Clayton Acts to condemn a non-foreclosing tie involving a common staple—salt—that was not realistically capable of being monopolized.⁴⁹⁵ The case effectively migrated patent act tying policy into antitrust law by holding that the defendant’s patents on its salt injecting machine created a presumption of market power sufficient to condemn that tie. It also watered down the Clayton Act requirement that an unlawful tie must “substantially lessen competition”⁴⁹⁶ by holding that proof of competitive harm did not require foreclosure—something that would have been impossible to show, given that the tied product was ordinary salt.⁴⁹⁷ Rather it was enough to show that the tying contracts covered a significant amount of salt. In this case that was approximately $500,000 per year.⁴⁹⁸

From that point tying law was used aggressively to condemn competitively harmless practices that the Court did not understand. Nor did it need to, because the per se rule for tying that the Court adopted created a strong presumption of illegality without competitive analysis.⁴⁹⁹ The Court relied on Justice Frankfurter’s dicta in the 1949 Standard Stations exclusive dealing case that “[t]ying agreements serve hardly any purpose beyond the suppression of competition.”⁵⁰⁰ That dicta served to make the Court more hostile toward tying arrangements than it was toward exclusive dealing.

In its 1949 Standard Stations decision, the Supreme Court expanded the rules against exclusive dealing to prohibit Standard Oil of California from engaging in “single-branding,” or insisting that its franchised gasoline stations pump only its own gasoline.⁵⁰¹ Standard Oil’s contracts covered 6.7% of the gasoline sold in California.⁵⁰² The Court’s condemnation of the practice was too much for Justice Douglas, otherwise an aggressive antitrust enforcer, who predicted in his dissent that requiring franchised gasoline stations to sell multiple brands of gasoline would force the refiners to build their own stations, thus eliminating the smaller dealers altogether.⁵⁰³

One effect of these decisions was a long-standing hostility toward tying arrangements, although it never extended quite as far to exclusive dealing.⁵⁰⁴ That distinction does not make a great deal of sense. While a tie requires a dealer to carry a specific second product as a condition of obtaining the first, exclusive dealing excludes a particular product from the dealer’s entire business. For example, under tying a dealer that sells GM cars might be required to repair them using GM parts.⁵⁰⁵ By contrast, under exclusive dealing the dealer would be prohibited from selling non-GM cars altogether. While outcomes vary with facts, often the amount of market exclusion produced by exclusive dealing exceeds the amount produced by tying. In any event, the per se rule for tying was not a creature of the Progressive Era, but rather of the late 1940s.

The courts also became more aggressive about vertical integration by merger and even by new entry.⁵⁰⁶ In fact, vertical integration almost became a suspect category. After the merger law was amended in 1950 so as to reach vertical as well as horizontal mergers, the Court applied it liberally to situations where foreclosures were not in the 40% and above range that Progressive courts had condemned, but as low as 3% or 4% on the Supreme Court,⁵⁰⁷ or barely over 1% in the lower courts.⁵⁰⁸ Internal vertical expansion earned similar treatment. For example, some decisions condemned automobile makers’ distribution of cars through wholly owned dealerships rather than contracting with independents.⁵⁰⁹ While the Mt. Lebanon Motors decision observed that the law of exclusive dealing required market power, the requirement was met when the court defined the market as “Dodge automobiles [sold] at the retail level in Allegheny County,”⁵¹⁰ thus guaranteeing that Chrysler’s market share would be 100%.

Numerous decisions in the 1960s and 1970s prohibited nondominant firms from doing any more than switching to self-distribution rather than relying on independent dealers.⁵¹¹ None of these decisions has survived today.

CONCLUSION

In 1933, two disruptive books appeared that presented the theories of imperfect and monopolistic competition. One was written by Cambridge University’s Joan Robinson,⁵¹² and the other by Edward Chamberlin from Harvard.⁵¹³ Both books reflected the Progressives’ increased skepticism about the benign qualities of markets. In the process they also paved the way for significantly more aggressive enforcement.

The theories of imperfect and monopolistic competition immediately became influential in academic circles. They gradually evolved into a single set of theories that today go by the name of imperfect competition.⁵¹⁴ Whether incidentally or as a result, antitrust policy began to veer left, often past all reasonable boundaries, condemning efficient practices where the creation of monopoly was virtually impossible.

This increased level of antitrust enforcement subsequently provoked a fierce neoliberal reaction, mainly from the Chicago School. It was prominently represented in the writing of George J. Stigler and, a little later, Robert Bork.⁵¹⁵ The Chicago School fought an ultimately losing battle to present imperfect competition models as untestable or incoherent. An empirical renaissance in economics, mainly in the 1970s and after, refuted that critique.⁵¹⁶ Today imperfect competition models clearly dominate the microeconomic literature as well as antitrust law, and their empirical robustness is well established.⁵¹⁷

The most general result has been a shift back toward the center. Today antitrust policy sits between the aggressiveness of the Roosevelt Court on one side, which often condemned competitively harmless practices, and the decaying remnants of the Chicago School on the other. Against this the Progressive response—aggressive in its own time but quite moderate today—has proven to be surprisingly durable.

96 S. Cal. L. Rev. 129

Download

Herbert Hovenkamp*

* James G. Dinan University Professor, University of Pennsylvania Carey Law School and the Wharton School. Thanks to Erik Hovenkamp and Matthew Panhans for valuable comments.

Fifty Ways to Leave Your Lover: Doing Away with Separation Requirements for Divorce

April 2023April 2024Claire P. Donohue*

Despite the evolution of no-fault divorces, which were intended to remove certain barriers to divorce and essentially make any divorce filed inevitable, many jurisdictions prescribe a waiting period before eligibility for divorce, during which there must be a demonstrable period of separation. In support of findings of facts and conclusions of law about whether the divorcing couple has established a separation, some jurisdictions will ask whether the couple has lived in the same abode and, if so, will inquire about the divorcing couple’s roles and choices vis-à-vis one another—for example, preparing meals for one another or engaging socially with one another. Other jurisdictions will make explicit inquiries into whether a couple has had sex with one another. Probing into families’ living arrangements and adults’ sexual choices does real and particular harm to marginalized social groups, and doing so defies the liberty and privacy interests of families and couples. In explicating this litany of critiques, this project attempts to avoid the trap that family law scholarship can too easily fall into; namely, criticizing doctrine “on a low level of abstraction” and rushing to a proposed reform. This piece, therefore, offers a taxonomy of the harm that separate and apart requirements cause—paying particular attention to the ways in which these laws are classist, heteronormative, gendered, and racially charged—and illuminates how constitutionally precarious such laws are. The project is ambitious as it attempts to situate and expose the deep-seated problems of separate and apart requirements as reflective of the deep-seated flaws in family law jurisprudence generally. The piece offers a comprehensive analysis and investigation of separate and apart requirements, and it serves as an invitation to further conversation and exploration of the themes raised herein.

Based on the author’s practice experience as much as her scholarship, the proposal insists that where couples are struggling deep in the heart of the matter about their choices—the good ones and the mistakes—they do not need or desire a judicial officer to ask them to wait or to organize their life a certain way before allowing them to divorce. Nothing and no one is served by insisting on some normative view about what the end of a marriage looks like and requiring some time period for performance of that view. The proposal in this piece joins a growing chorus of practitioners, judges, and scholars talking about administrative divorces. The distinct voice in this piece advocates for administrative divorce as a procedural decoupling of divorce from any underlying or attendant economic and custody issues. The piece motivates this argument based on the premise that allowing families to proceed thusly will enhance the self-determination of families in transition and promote use of the courts when, and only when, the families determine that court involvement in matters of children and economics will improve their stability.

INTRODUCTION

The problem is all inside your head, she said to me

The answer is easy if you take it logically

I’d like to help you in your struggle to be free

There must be fifty ways to leave your lover¹

Paul Simon knew full well that there are 50 Ways to Leave Your Lover, yet many jurisdictions insist on just one. That one way looks something like this: decide you are unhappy, unsafe, or unstable in your marriage. Leave the marital home or somehow excise your spouse from it. Pay for that additional rent or mortgage or count on the fact that your spouse can and will. File some paperwork with the court and wait. Wait a long time. Pay a lawyer. Pay a lawyer a lot of money. While you are waiting and paying you are still married, but you are also not really married. So do not resume living with your spouse, even if there is room in that property for you. If you do find yourself back in the house (but goodness, please don’t) do not socialize unduly with your spouse. You may not be sure what that looks like, but just please refrain from it. Do not share meals with your spouse. Certainly do not sleep with your spouse. Never. Not if you are living together or if you have moved out. Eventually, go to court. See a judge. Let the judge know that you followed these rules.

This Article takes up those rules, namely jurisdictions’ requirements that couples live separate and apart and wait out arbitrary waiting periods to be eligible for no-fault divorce. Despite the evolution of no-fault divorces, which were intended to remove certain barriers to divorce and essentially make any divorce filed inevitable, many jurisdictions prescribe a waiting period before eligibility for divorce, during which there must be a demonstrable period of separation.² In support of findings of facts and conclusions of law about whether the divorcing couple has established a period of separation, some jurisdictions will ask whether the couple has lived in the same abode and, if so, will inquire about the divorcing couple’s roles and choices vis-à-vis one another—for example, preparing meals for one another or engaging socially with one another.³ Other jurisdictions will make explicit inquiries into whether a couple has had sex with one another. These are questions about families’ living arrangements and adults’ sexual choices, questions that invade the privacy of families concerning their living arrangements and adults concerning their sexual choices. Moreover, the requirements of separateness and the inquiries they inspire do real and particular harm to certain social groups. This Article critiques these requirements as being classist, heteronormative, gendered, and racially charged and suggests that they defy constitutional protections. The Article ends by proposing a process that protects the dignity of divorcing couples and better provides predictability and stability for families in transition.

The primary argument for separate and apart requirements posits that separateness is a proxy for establishing that the decision to leave one another is mutual and voluntary or at least that one spouse has given the other a very clear indication that they want out.⁴ To the extent that one regards marriage as a contract, a meeting of the minds as to a modification of its terms or its termination makes a certain sense. But the requirements of separateness and the inquiries they inspire are superfluous and odd, given several realities of divorce: first, under no-fault divorce, no one has to prove any particular transgression; and, second, the contestations in divorce are rarely if ever about the divorce itself—rather, disagreements concern custodial, property, and support disputes.

A second argument, more tenuous than the first, to justify these requirements is anchored on the belief that marriage is a primary source of stability and security for children, families, and society. Divorce, the argument goes, is a destructive life event that couples should avoid, delay, or undertake painstakingly slowly.⁵ Yet the passage of time and greater visibility of families and couples not hiding their choices and arrangements has debunked the myth that marriage is the only available and functional means of raising children and ordering a civil society. Meanwhile, the time periods and requirements embedded in many separate and apart requirements are deeply destabilizing and burdensome.⁶

Moreover, the requirements for separateness burdens certain social groups in particular. To begin, living—and parenting—separately prior to final orders for support and division of assets is challenging if not impossible for those who are under-resourced or living in poverty; these economic realities impact women in particular. Moreover, the obsession about what is happening behind closed doors and what those intimate and interpersonal choices might tell the public about a couple’s desires or capacities is deeply rooted in heteronormative thinking and reasoning that applies rigid binaries to gender, gender performance, sexuality, and family constellations.⁷ Many expressions of self, love, and family do not match rigid constructions of how to “do” family.⁸ Moreover, inquiries into sex or home life are a particular violation to women, members of the LGBTQ+ community, and people of color, as classes of people whose sexuality and home life are too often distorted or weaponized against them.⁹ Meanwhile, the requirements appear contrary to constitutional protections.¹⁰ A fulsome accounting of harms—both shared and specific—and a survey of the constitutional concerns reveal that separate and apart requirements defy the very expectations we ought to have for family policies. They do not extend the dignity and respect to couples and families that they deserve, and they do not scaffold the predictability and stability that divorcing couples and families need.

Part I of this Article will explore the context of divorce—who is divorcing and why people leave marriages. This Part also offers a primer as to how the process and requirements for divorce are situated in the history of divorce. Part II will clarify and expand upon the harm done by separate and apart requirements generally and the intrusion they inspire. The Part will begin with an overview of the toll that pursuing divorce takes and how separate and apart requirements compound these burdens. This Part also seeks to situate these harms in the context of the disenfranchisement experienced by those who the law subordinates or fails to anticipate, as well as the particular psychological harm to subordinated communities brought on by invasions of privacy and judgment about lifestyle. To the extent that Part II describes how separate and apart requirements complicate the lived experience of families, Part III introduces the legal doctrine that should challenge the existence of the requirements themselves.

Part III outlines preliminarily the substantive due process right to be free from the burden of separate and apart requirements and inquiries. Specifically, the Part will illuminate an intersection in the Venn diagram of family law—namely, in the overlay of intimacy cases, right to marry cases, and family rights cases that suggest separate and apart requirements are on shaky constitutional ground. This Part will be in conversation with scholars calling for a right to sexual privacy and a right to unmarry, and it is meant as an invitation to further and future analysis. Preliminary analysis is offered here in this inchoate form to illuminate how clumsily and carelessly we define and defend family as a matter of law. It is not just that separate and apart clauses cause or exacerbate psychic and sociological harm—the risk of this harm exists and persists even where the law appears to be on precarious constitutional footing. In many respects, the Article agrees with Martha Minow’s assessment from almost thirty-five years ago that there is “an incoherent jurisprudence about families, [because it is] a jurisprudence tugged and pushed by other concerns.”¹¹

The final Part of the paper will turn to a consideration of what really matters to families: (1) being afforded dignity and respect; and (2) stability and predictability for ordering finances and property and raising children. Interestingly, these are the public policy concerns cited in support of, but not actually served by, divorce law. Part IV will offer prescriptions that eschew dogmatic and political views of marriage and actually serve familial interests. First, jurisdictions must do away with separate and apart requirements. Second, jurisdictions should bifurcate the adjudication of divorce in a prompt administrative proceeding, allowing for subsequent adjudication or alternative dispute resolution of custodial, property, and support disputes.

This Article attempts to avoid the trap that family law scholarship can too easily fall into criticizing doctrine “on a low level of abstraction” and rushing to a proposed reform.¹² This Article offers a taxonomy of the harm that separate and apart requirements cause, paying particular attention to the unique harm to those whose experience of the law is too often invisible. The Article also illuminates how constitutionally precarious such laws are. The project, then, is both ambitious and insufficient, as it attempts to situate and expose the deep-seated problems of separate and apart requirements as reflective of the deep-seated flaws in family law jurisprudence generally.¹³ The Article is intended, therefore, to serve as an invitation to further conversation and exploration of the themes raised herein.

I. YOU JUST SLIP OUT THE BACK, JACK: GETTING DIVORCED

People get divorced for all sorts of reasons along a spectrum: from mistaken compatibility to situations that pose health and safety risks to spouses or children. All along this spectrum, there may be elements of neglect of self or partner in the marriage, or unkindness or sorrow or even deceit and scandal, but there is also room for collaboration or planning for a next chapter and a changed future. Whatever the reason, or whatever the conduct of the spouses involved, states now universally recognize the importance of letting people out of unhappy marriages.¹⁴ And about forty to fifty percent of Americans will avail themselves of that option each year.¹⁵ Even where a party can now rely on no-fault grounds for divorce, in many jurisdictions they must establish eligibility under the jurisdiction’s separation requirements. A separation requirement refers to the amount of time two spouses must live separately to be eligible for a divorce.¹⁶ Separation requirements range from sixty days to five years.¹⁷ These requirements affect when a party can file and start the clock regarding when the matter will actually be heard or finalized.¹⁸ Moreover, in many cases, these waiting periods are not just a matter of running the clock; rather, the period of separation has to have demonstrable features of separation to satisfy the court that the matter is ripe for divorce.

Judges in Pennsylvania, for example, may seek evidence that the spouses began to lead independent lives, may inquire about whether spouses have stopped sharing a bedroom and whether they have had sex, may ask how much time a spouse spent in the marital home, and may question whether the spouses shared meals.¹⁹ In the District of Columbia, as in Pennsylvania, a couple is permitted to remain in the same marital home pending divorce, but the court will make explicit inquiry into whether the couple has had sex with one another and may additionally inquire about whether and when the spouses began to use separate bedrooms and how household finances were managed.²⁰ In Maryland, couples are not permitted to cohabitate at all, which does not forestall inquiry into the spouses’ sexual relationship; rather, Maryland courts will make a direct inquiry regarding sex. Two spouses who have had sex will be deemed to be cohabitating regardless of a reality of separate abodes.²¹The existence of separation periods and the depth of inquiry required in some jurisdictions are vestiges of confounding Victorian principles and the conspiring paternalism of the state when it comes to divorce.

A. Current Requirements in Conversation with the Confounding History of Divorce

Historically marriage was a matter of “status and cultural location” as well as economic security (of dependent women) and property rights (of men).²² As stated by the Supreme Court in 1888,

Marriage is something more than a mere contract, though founded upon the agreement of the parties. When once formed, a relation is created between the parties which they cannot change; and the rights and obligations of which depend not upon their agreement, but upon the law, statutory or common. It is an institution of society, regulated and controlled by public authority.²³

Divorce, then, was about restructuring one’s economic life and one’s public standing.²⁴ By extension, divorce was quite public and political. Perhaps the greatest indication of this was the manner of seeking a divorce, namely through a petition to the legislature.²⁵ Divorce by legislature meant a popularly elected branch of government would decide whether a marriage harmed the spouses and the community to such an extent that the harm justified ending the marriage.²⁶ The move from legislative halls to courtrooms did not really render divorces particularly more available, any more private, or any less political.²⁷ Separate and apart periods and requirements reflect the longstanding insistence that marriage has an awful lot to do with the performance of normative roles and obligations, so divorce is only an option if there has been a failure to execute these roles. Proving oneself eligible for divorce inspires the same theater of early divorce law and continues to invite or advance a regime of judicial paternalism.

1. Marriage as Performance of Obligation

Where the legislature might have asked itself if a given marriage violated public policy, the courts addressed divorce as an adversarial process concerning a breached marital contract.²⁸ The terms of the contract flowed between husband and wife reflecting normative values. The conventional story of marriage in the nineteenth and early twentieth centuries was this: man and woman meet, perhaps based on love, but more likely based on a courtship promoted and controlled by the involved families. Man marries and stays man; woman marries and becomes wife, a person no longer entitled to an independent legal identity.²⁹ Husband assumes the legally and culturally assigned role of provider and protector for this now vulnerable creature. Wife agrees to obedience and sexual submission in order to birth children and tend to a home.³⁰

Even as divorces left legislative halls, the jurisprudence of divorce still reflected a second social contract that flowed between the couple and the state.³¹ The state had an interest in reinforcing predictable, regulated (gendered) expectations of support and obligation.³² The prevailing notion was that this paradigm policed virtue and upstandingness. As sole benefactor to his family, a man would be sober and productive. As keepers of the hearth, women would be too busy or grateful to be performing their manifest destiny as mothers to notice their disenfranchisement.³³ Children would be fed and clothed and sheltered.³⁴ Conduct such as adultery, desertion, or cruelty—and eventually habitual drunkenness and use of illicit drugs—became acceptable common grounds for divorce as they amounted to an obvious breach in the promises flowing not just from husband to wife, but also between married couple and state.³⁵

2. Divorce as Theater

Forced to comply with divorce law’s requirements for specific action or omission by a spouse, “one-sided evidentiary hearings[] and feigned testimony became common.”³⁶ Litigants continued to make public performances in courts concerning the appropriateness of their divorcing, just as they had before the legislature.³⁷ Indeed, litigants would “blithely relate[] prefabricated stories of their spouses’ ‘extreme cruelty’ destroying their marriage.”³⁸ The following situation seems humorous in retrospect, but is in actuality a maddening example of what happens when there is such a gulf between law and society.³⁹ In New York, couples staged elaborate farces, complete with paid actors, for example, to play a mistress who would substantiate an adultery ground for divorce.⁴⁰ Attorneys and judges played along.⁴¹ The truly affluent skipped the show and headed to Reno for “quickie” divorces.⁴² When New York, one of the last states to allow no-fault divorces finally did so, the two “chief evils the new divorce law was designed to eliminate” were the “collusive or fraud-ridden divorce actions” and “out-of-state divorces based upon spurious residence and baseless claims.”⁴³ If a party failed to keep up the guise of the performance or a judge was not as accommodating of any novelty or stretch in the arguments litigants made, this could forestall the divorce.⁴⁴ And of course, any party not willing to concede grounds or agree to the divorce could lock a miserable couple together in perpetuity.⁴⁵

Eventually, the chasm between what relationships actually looked like and what divorce laws and jurisprudence required grew too huge and too public to ignore.⁴⁶ The nineteenth century Women’s Movement had animated questions about women’s roles and capacity that challenged the prevailing notion about family.⁴⁷ These reformers raised consciousness regarding women’s property rights and a desire to dismantle male domination, “alter[ing] the notion of the husband/father as the legal representative for the family in public and commercial realms.”⁴⁸ Decades later, in the 1960s, a powerful secondwave of feminism supplied further fodder for divorce reform.⁴⁹ These twentieth-century feminists were outspoken in naming the privilege and subordination of the varying roles and opportunities available to men and women in marriage and the public realm (for example, the privileged public role of man as employee versus the subordinated private role of woman as caretaker).⁵⁰ They mounted resistance to the subordination of private roles and to a woman’s default position in them. Emerging notions that women might have myriad ways of deciding whether and how to be wives and mothers lent themselves to reforms that facilitate movement in and out of marriage.⁵¹ This cultural revolution combined with the chorus of litigators, judges, and families who were growing tired of the system-inspired collusion and fraud piloted a transition to no-fault divorces.

The timeline for states adopting no-fault divorce statutes tracked with Supreme Court cases chipping away at laws that carried or reflected assumptions that women would marry, marry young, and remain dependent in their marriages.⁵² The first state to adopt no-fault divorce was California in 1969, and the last state was New York in 2010. To this day, only seventeen states have true no-fault statutes whereby the parties cannot raise fault.⁵³ While the concept of fault eroded or became of second order importance in the divorce law of most states, requisite periods of separation did not, and normative requirements for what constitutes appropriate levels of disconnectedness arose.⁵⁴ Parties’ abilities to satisfy the court that they have lived separately turns on their abilities to perform as expected.

The standards before a family court judge are notoriously subjective. The best interest of the child standard in custody matters, for example, asks judges to make determinations concerning a child’s welfare and happiness.⁵⁵ On the margins—where one parent is unfit or dangerous—this may be an easy call, but, in the vast majority of cases, judges are trying to parse facts about things like parental involvement, a “child’s adjustment,” and “the wishes” of the parties and the child in order to make predictive determinations about what arrangements will best serve the needs of a child.⁵⁶ These determinations inevitably turn on a judge’s opinion of the parents—opinions, which are in turn, based on the judge’s own observations and values.⁵⁷ Parties that do not play the predictable and expected part of mother as nurturer and father as provider can struggle in custody determinations.⁵⁸ Similarly, judges’ expectations and values about roles and behavior inevitably also come to bear when judges consider the conduct and choices of parties when making findings of fact and conclusions of law about whether or not couples’ marriages have in fact broken down irretrievably.⁵⁹ Trials and colloquies on these issues, after all, seek to unearth the couples’ “inner most beliefs.”⁶⁰ Perhaps because this task is so impossible and offensive, some jurisdictions pretend to have turned instead to “objective” evidence of separateness to prove the claim that the marriage is over.⁶¹ And yet even here these inquiries can include questions about sex and particularized questions about shared meals, sleeping arrangements, and social engagements when a couple continues to share a residence. Just as with other aspects of family law, couples whose performance is outside of a subjective norm can and will struggle to convince the court that they are eligible for divorce.⁶²

One appropriately wonders if any of these inquiries into intimate and familial choices are necessary given that parties can plead in plain and simple language that yes, in fact, through “no fault” of either party, there has been an irretrievable breakdown of the marriage.⁶³ Surely the inquiries are of no consequence when neither party contests the divorce, and even where one spouse contests the divorce, under a no-fault regime, this spouse cannot successfully defend against the divorce itself for long: if one spouse wants out, they will get out.⁶⁴ And yet all scenarios—from uncontested divorces to those where someone will impotently contest an inevitable divorce—many courts can and do inquire into the conditions or level of separation before entering an order of divorce.

3. Judicial Paternalism

The early days of judicial divorce and the jurisprudence of fault invited an era of judicial paternalism in which parties aired the failures of their spouse to act in accordance with norms, and the court’s orders were a mechanism to replace the failed male head of household.⁶⁵ Subsequent divorce reform allowing for no-fault divorce shifted the “regulatory” energy or emphasis away from that of locating blame on one individual and towards the “internal aspects of family life.”⁶⁶ The need for parties to demonstrate that they have lived separate and apart generally, and the companion assumption that this means something, particularly about the way a couple shares bed and board, tells us that models of judicial paternalism are alive and well.

In twenty-nine states, parties must invite the court to enter their home to determine if they and their soon-to-be-ex-spouse are behaving as if the marriage is truly over.⁶⁷ The suggestion, in turn, that evidence of sex acts or contributions to a shared residence is sufficient proof of an intact marriage reflects antiquated and gendered visions of marriage, namely that marriage is nothing more than the exchange of sexual services and housewifery for the support of bed and board.⁶⁸ It is also a reminder that marriages have always been a vehicle for the state to police interpersonal relationships and regulate society: “Marriage defines normality. It is the standard against which all other relationships are judged. Societies promote and expect marriage. And governments use marriage to police social groups.”⁶⁹

The creation of family courts themselves signal a sense that what the court and judge were doing was intervening into the family, not merely presiding over a breached contract or brokering terms for a party wishing to modify their marital contract.⁷⁰ Family courts as originally conceived were meant to “mend, and if possible cure, sick marriages,” ending them only “if cure was hopeless.”⁷¹ Judges, then, became marriage doctors or conciliators.⁷² Still today in many jurisdictions, when one files through no-fault grounds, it triggers not just a separation period and the inquiry into separateness already discussed, but it can also trigger a requirement that the couple undergo mandatory “counseling” sessions.⁷³ In Pennsylvania, for example the court does not have to order counseling but may do so on information and belief that there is a reasonable prospect of reconciliation.⁷⁴ So, if a judge determines that the couple really meant that they wanted to be divorced when they filed for divorce, the judge may decline to order them to counseling. But if the judge decides—what exactly?—that one spouse might still be invested in the marriage and should be able to use state resources to pursue their disinterested spouse, or that either spouse has not thought it through?⁷⁵ Then the judge can order the parties into counseling. And counseling to what end? To reconcile?⁷⁶ To “play nice”? This is not counseling. This is social engineering.⁷⁷ It is well studied that in order for clinical intervention to be successful, particularly in family counseling, each member must come to the counseling with a sense of autonomy, choice, and insight.⁷⁸

Of final and considerable concern, courts’ paternalistic “fact” finding into separateness seeks to ask and answer heteronormative and gendered questions about family composition and choices. The burden of the courts’ voyeurism is not something experienced or borne equally across all populations; rather, it is a practice that disproportionately subordinates people of color, women, the poor, and members of the LGBTQ+ community. Meanwhile, the intrusion into divorcing couples’ intimate choices and shared living arrangements, and its disparate impact on subordinated populations, is inapposite to family rights doctrine and the evolution of privacy rights in marital and non-marital homes.

II. SHE SAID IT GRIEVES ME TO SEE YOU IN SO MUCH PAIN: BURDEN AND HARM FROM INTRUSION INTO INTIMACY AND FAMILY

While there is some variation across jurisdictions, one can articulate a “typical” process to secure a divorce. One spouse will file a complaint and another will answer, or the two will file a joint complaint; the matter will be marked for a preliminary hearing, at which point the court and the parties chart a path for the divorce, which may include discovery deadlines and a series of court appearances; thereafter follows a final hearing, which may or may not be contested, so this hearing may be an evidentiary hearing or more of a colloquy with the parties; and finally, finally, finally the court will enter an order pronouncing the couple divorced and addressing issues of custody, support, and property. Yet, even within this similar arc of a divorce case, the distinct experience of a given family will be different depending on the predilections of their jurisdiction or the circumstances of the litigants.⁷⁹ A rudimentary Google search tells us that on average it will take a couple about one year to secure a divorce.⁸⁰ In those jurisdictions that require a waiting period before filing for divorce, however, the timeline will be one year plus that waiting period; in Maryland for example, practitioners will set clients’ expectations to contemplate a total wait of about two years before the divorce is final.⁸¹ These timelines have elongated substantially during the COVID-19 pandemic and the ensuing crisis of capacity and flexibility in state courts to administer matters remotely.⁸²

A. Social Emotional and Financial Costs of the Divorce Process

Scholarship about divorce includes studies tracking divorce rates, attempting to predict why couples divorce, and describing how they fare in the years after divorce. Dwarfing that scholarship are the multitude of articles and studies about how children fare after a divorce. In contrast, there is very little writing and research about the experience—the trajectory and social emotional states—during the years that pass when couples are waiting to file for divorce and moving through the courts to secure a divorce. We do know that leaving a marriage is a significant life stress.⁸³ “For many people, marital separation means substantial financial upheaval, the renegotiation of parenting relationships and co-parenting conflict, changes in friendships and social networks, moving locally or relocating cities, as well as a host of psychological challenges, including re-organizing one’s fundamental sense of self: Who am I without my partner?”⁸⁴

We also know that the divorce process imposes financial strain on families. Divorces are costly.⁸⁵ There are filing fees associated with the process.⁸⁶ People using an attorney pay for that assistance—sometimes as much as $400 per hour.⁸⁷ Divorces may involve consultation with accountants, therapists, or other professionals, none of whom work for free.⁸⁸ Divorcing may require refinancing homes and cars to adjust ownership of that property.⁸⁹ Couples may face moving costs or additional rents and payments to set up separate homes.⁹⁰ The particular time periods and separate and apart requirements of certain jurisdictions are deeply destabilizing and burdensome. The waiting periods and the requirements of separate and apart make the cost of divorce more immediate or pronounced.⁹¹ Protracted divorce proceedings mean lost wages or use of personal leave for multiple court appearances, as well as the risk of job loss for missing work time and the cost of childcare expenditures.⁹²

Financial strain and the protracted timeline for divorce map onto a sea of logistical and existential difficulties that are already part of divorce for families.⁹³ One difficulty surrounds the public airing of private matters. We are socialized—and in fact, the law affirms in many respects—that our marriages are confidential places.⁹⁴ Yet, the divorce process invites, even requires, an invasion of this privacy. When the issue of privacy breaches in divorce is discussed publicly or in legal discourse, the discussion usually centers on situations in which one spouse may have crossed an ethical, if not legal, line in accessing information to buttress their claims for a divorce. The invasions I refer to here, in contrast, are invasions solicited by the court and the legal process itself. Even leaving aside the particular inquiries of separate and apart, divorce itself as it is conceptualized and adjudicated asks litigants to discuss a breakdown of a (previously) private domain. It may further require discussion or examination of child rearing and finances. Now layer onto this the particularized inquiry of some separate and apart jurisdictions: last sexual encounters, sleeping arrangements, or the nature of shared meals and social engagements. In almost any other context, sex, money, and child rearing are hallowed grounds. These are issues that one may not have reason or comfort enough to discuss with anyone at all, or only with close friends; and yet now the divorce requires a public airing, all while insisting that the subject matter of the litigation is not to locate or determine any one person’s fault. As shall be discussed in more detail below, privacy is an important concept in one’s sense of self and sense of control, so the confusing breaches of it take a human toll.⁹⁵

Additionally, families arriving in divorce court are not on happy or easy footing to begin with. They have experienced interpersonal stressors or have had pressures outside the marriage spill over into the marriage, which have triggered the marital conflict.⁹⁶ When the divorce process itself introduces new sources of stress and strain—financial, logistical, psychological, and otherwise—it taxes the very families who are already struggling to maintain a sense of collaboration and problem-solving.⁹⁷ Where deterioration of the social fabric is absolute, the inability to abide each other, let alone work with one another, presents a particular problem for families with children.⁹⁸ The prevailing wisdom is that (absent issues of abuse or parental unfitness) children benefit from access to and care by both parents, and so the presumption at law is one of joint custody. Essentially, divorcing parents will need to “deal” with one another regarding the care of their shared children.⁹⁹ Childcare is not the only matter that requires cooperation or compromise during a divorce. Divorcing couples must make decisions about property distribution and support or risk the court making the decision for them. Even in situations in which couples are willing and able to communicate and contribute to the joint enterprises of raising children or structuring post-divorce households, navigating these scenarios requires heightened intentionality and care in order to avoid or minimize discord. This work is exhausting. Enter separate and apart requirements—requirements that exacerbate all of the sources of stress and tension described above.

B. Burdens Are Not Evenly Held

While any household or divorcing couple risks facing the burdens described above, the risk of exposure to the burdens or the depth of experience of each burden is not evenly borne by each family and couple. This is because not all families are resourced, respected, and accounted for in a way that provides them political and social power. It is worth starting by pointing out the ways in which differently situated families’ actual passages through the divorce process will be different; from there, we will move to consideration of the more nuanced aspects of social and political differentiation as it impacts families’ relative treatment in, and experience of, the divorce process. To begin then, it is not uncommon for different types of cases to be “tracked” differently, with separate judges for each type of case and distinct tracking orders that reflect the different realities of the pace and nature of the litigation. For families with fewer means, and particularly for those without counsel, pretrial events become the occasion for negotiation and mediation, much of which can be happening before the judge’s eyes or with the judge’s involvement. Where there are breakdowns or confusion regarding temporary orders, there are no attorneys to turn to for assistance, so the parties will seek the assistance of the court. In contrast, for parties with means, many pretrial court appearances are quite pro forma. The attorneys for the parties submit or discuss the private separation agreements that the divorcing spouses have agreed to in out-of-court negotiations or mediations. Parties produce evidence and ask and answer questions in depositions or interrogatories. Court appearances can be an occasion for announcing what is known, what has been done, and what has been decided.

The effect is to offer people of means the opportunity at least for the vision of divorce that Cady B. Stanton herself had wanted when she advocated for marriage to be considered a private agreement between the parties that could be terminated themselves with only state acknowledgment of the termination.¹⁰⁰ What she argued against is what people of lesser means arguably endure: supervised marital relations by surrogate governmental heads of household.¹⁰¹ Yet, for certain families, the entire scaffolding for divorce invites judicial involvement and threatens judicial paternalism. All this, in turn, maps on to the public discourse about the divorce “problem.”¹⁰² One hears claims that feuding parents should stay together for the sake of the children, that revaluing the idea of marital service and obligation would improve family life, and that marrying and not divorcing would lift women and children out of poverty.¹⁰³ Conservative pundits have laid blame for all manner of social problems on the thresholds of “broken homes.”¹⁰⁴ “[M]arriage, rather than a shift in public priorities, [is] the solution to poverty, violence, homelessness, illiteracy, crime, and other problems.”¹⁰⁵ It is in this context of punitive and judgmental rhetoric and under the eye of judicial paternalism that families are asked to declare their choices about whether they have lived together, how much they have communed with one another if they have lived together, and whether or not they have had sex with one another. There is a risk that subordinated and under-resourced families will have a particularly difficult or strained experience in such a divorce process.¹⁰⁶ This is not only unfair on a systemic level for a society that strives for justice, but it is painful on a personal level for those individuals whose families and needs are ignored, mischaracterized, or marginalized.

The state’s intrusion into sexual and familial choices is a story told in race, class, gender, and sexuality, yet the state will declare its laws neutral.¹⁰⁷ The critique herein is twofold: first, to notice the inadequacy or stubbornness of the law, but also to take the time to name the psychic collateral consequences of our subordinating jurisprudence. An example will help here. Let us consider the law of rape. It is well studied that when Black women report rape, their accusations are under-investigated and under-prosecuted.¹⁰⁸ Yet, in other contexts, the law is swift and careless in its intrusion into Black communities for the purpose of criminalizing the behavior of Black bodies.¹⁰⁹ Kimberlé Crenshaw explains how, therefore, a Black woman may be reluctant to call the police even when she has been raped or assaulted due to an unwillingness to subject her private life to the “scrutiny and control of a police force that is frequently hostile” to the Black community. ¹¹⁰Will her account be heard as an assault as clearly as it would if it had been reported by a white woman? Will her assault and the violation of her sanctity be credited as intolerable as it would if it had been reported by a white woman? This has led, then, to the reality of Black women underreporting violations to their bodies. It has confirmed in the hearts and minds of many in the Black community that the law, for them, is not about protection and safety. There is also lasting psychic harm to Black women, and ongoing risks to their bodily safety.¹¹¹

One can follow a similar path in analyzing separate and apart requirements and inquiries. To begin, requirements and inquiry around separate and apartness are manifestations of not believing—not believing that a family is considering or preparing itself appropriately for divorce; not believing their declarations that a marriage is over. Not being believed takes a psychic toll.¹¹² Secondly, these laws require probing into private spheres, and often, sexual choices. In this way law is primed—designed?—to alienate, ignore, or suppress classes of people, because not everyone’s sexual dignity is held in positive regard, and because the law is tethered to heteronormative arrangements for what the private family sphere is “supposed” to look like. Moreover, inquiry in search of “proof” of separation invades a person’s sense of privacy. Here, I do not refer to privacy in the constitutional sense (though I will do so in later sections of this Article) but rather in the ways in which individuals understand, hold, and value their privacy.¹¹³ Privacy is an elusive concept: Privacy is associated with liberty, but it is also associated with privilege (private roads and private sales), with confidentiality (private conversations), with nonconformity and dissent, with shame and embarrassment, with the deviant and the taboo . . . and with subterfuge and concealment.”¹¹⁴ Perhaps as a consequence, people perceive invasions of privacy differently and bear those invasions differently.¹¹⁵ We are not all situated similarly in terms of the treatment we receive, the ways we are heard, the sense people make of our lives, and our experience of normative expectations.¹¹⁶ As described above, rhetoric about divorce is already punitive and judgmental.¹¹⁷ Black, Indigenous, and people of color (“BIPOC”), LGBTQ+, and under-resourced families, meanwhile, are asking for divorces in the context of their own stigmatization, discrimination, and associated psychic pain. They are asking for divorces in the context of specific stigmatization and discrimination about their families and sexuality.¹¹⁸

1. Stigma & Discrimination

Discrimination contributes to poor health outcomes and specifically affects mental health when the experience alters “one’s perception of self and their surroundings.”¹¹⁹ One’s stigmatized social status can create “unique minority stressors” for stigmatized and disadvantaged populations.¹²⁰ People of color, specifically, are “stressed by individual, institutional, and cultural encounters with racism.”¹²¹ Specific encounters with racism may be aversion, harassment, discrimination, hostility, and violence.¹²² These encounters and experiences can be the source of affirmative trauma or the cumulative experience of them can lead to toxic stress responses.¹²³ Unsurprisingly, studies suggest that these race-based stressors have an impact on BIPOC’s psychological and physical health.¹²⁴

LGBTQ+ people also suffer from individual and institutional discrimination. LGBTQ+ people may suffer from stigmatization by individuals and institutions, which can in turn provoke self-stigma. LGBTQ+ people are specifically subjected to stigmas based on perceptions of illegitimacy: gay and lesbian individuals do not participate in legitimate relationships; transgendered persons do not express their gender in a legitimate way.¹²⁵ Researchers have identified different categories of stigma. “Felt stigma” is the knowledge of society’s perception of you.¹²⁶ Felt stigma can motivate LGBTQ+ persons to “constrict their range of behavioral options (e.g., by avoiding gender nonconformity or physical contact with same-sex friends) and even to enact sexual stigma against others.”¹²⁷ Felt stigma may encourage some LGBTQ+ people to conceal their identity or socially isolate.¹²⁸ Another manifestation of self-stigma is “internalized sexual stigma . . . . Internalizing sexual stigma involves adapting one’s self-concept to be congruent with the stigmatizing responses of society.”¹²⁹ Finally, stigmas of a different flavor plague women and particularly poor women. Since time immemorial, women looking to leave marriages were cast as lustful and deviant.¹³⁰ To this day, poor women, in particular, are subject to commentary about their being imprudent and reckless.¹³¹ Consider, for example, the double standard of marriage as something that is necessary or ideal for mothering. A white celebrity in all her staged glory and living in an environment buttressed by endless support and resources can tell a story of her personal redemption and strength in her decision to be a single mother. The object of a “welfare mom,” however, is scrutinized as having subjected herself, her children, and society at large to her irresponsible decision to mother alone.¹³² Moreover, numerous studies have confirmed that the accumulation of stress present in a life of poverty has adverse health and mental health outcomes.¹³³

Withstanding the domination and control of racist, gendered, or heteronormative systems interferes with one’s esteem and mood states.¹³⁴ It also frustrates one’s locus of control.¹³⁵ In psychology, a locus of control refers to one’s perception that they control what happens to them and around them.¹³⁶ Someone with a strong internal locus of control can believe and actualize that they are the masters of their own destiny.¹³⁷ Individuals with external loci of control are left with the feeling that the world happens to them and that they are powerless to chart or change their path.¹³⁸ The requirements of divorce risk adding to accumulative stress, stigmatization, and a loss of control already experienced by vulnerable families. Additionally, any judgment or rejection during a divorce proceeding about not getting the separation “right” follows a litany of experiences and systems that tell them BIPOC, LGBTQ+, and under-resourced families are not getting family “right.”¹³⁹

2. Getting Family and Sex “Right”

Consider, specifically, the requirement for inquiry into a couple’s decision to cohabitate during a period of separation. As previously discussed, anyone might be annoyed or embarrassed by offering a virtual stranger in open court an account of your bed and board choices, but these requirements and inquiries present particular insult to subordinated populations. Separate and apart inquiries specifically are an intrusion into the inner workings and decision-making in a private realm. For subordinated populations in particular, this private realm is a last bastion of dignity. The experience of the intrusion into a private sphere can be particularly painful and acute for those who weather subordination in the public sphere.¹⁴⁰

As Crenshaw so astutely surmised,

There is . . . a more generalized community ethic against public intervention, the product of a desire to create a private world free from the diverse assaults on the public lives of racially subordinated people. The home is not simply a man’s castle in the patriarchal sense, but may also function as a safe haven from the indignities of life in a racist society.¹⁴¹

Racism’s chronic external, public assaults on dignity create resistance to, or sensitivity about, inquiry and critique of private family decisions that are nuanced and, therefore, more susceptible to racist misinterpretations and biased reasoning.¹⁴² People of color are not alone in distrusting inquiries into private realms or experiencing heightened discomfort during such inquiries. The LGBTQ+ community has borne bias in many spheres of life.¹⁴³ For too many members of the LGBTQ+ community, rejection and judgment started in their homes and families.¹⁴⁴ The rejection of LGBTQ+ children in homes and in school can turn violent.¹⁴⁵ Judgment and hostility in the workplace or public spaces is common too.¹⁴⁶ Far too often, LGBTQ+ families are cast as deviant, illegitimate, or confusing to children.¹⁴⁷ The home spaces and families designed by some members of the LGBTQ+ community are an expression of what is required to build the family or keep it safe from heteronormative hostility.¹⁴⁸ Family design can also reflect a conscious decision to reject gendered norms for a family’s financial and social arrangements.¹⁴⁹ These families may feature partners and children connected in diverse ways.¹⁵⁰ Historic lack of protection—or affirmative criminalization—for the family ordering of LGBTQ+ families leave many LGBTQ+ families with legacies of perceived vulnerability, a perception that can be particularly acute during divorce.¹⁵¹ Preliminary research regarding same-sex couples, for example, suggests that these couples feel a “heightened social scrutiny at the time of a relationship’s end.”¹⁵² LGBTQ+ families are not alone in structuring families that do not fit a rigid heteronormative paradigm—male head of household, female companion, children. Under-resourced communities, foreign-born families, and Black families are all more likely than white affluent families to live in multigenerational homes.¹⁵³ There are racial and ethnic disparities in marriage matters as well.¹⁵⁴ Lastly, there are growing disparities by class concerning modalities for child rearing.¹⁵⁵ When families operate outside of the norm, they raise the hackles of our system of supervision: a class-based system of white, heteronormative supervision.¹⁵⁶

Finally, consider where the inquiry into the private family sphere includes specific inquiry about sex. Domination and control of sex and sexuality is an old tool in the arsenal of oppression. Consider, for example, that there was a time when the law did not acknowledge marital rape as a crime. This was because sex was an “essential obligation of marriage,” and sex between married people was “private.”¹⁵⁷ Bound up in the protection of male entitlement to sex and freedom from scrutiny regarding how they pursued it was systemic acceptance of the domination of women. Eventually, the mantle of marriage could not disguise the violence of rape and the public came to see the law’s willful ignorance of the violence as tantamount to support.¹⁵⁸ The change in law, in turn, better reflected and resisted the dominance and control inherent in rape and acknowledged that dominance and control is no less dangerous and damaging in the context of a marriage. Legacies of domination and control explain why women, BIPOC, and LGBTQ+ persons face the most abuses of their sexual privacy and are vulnerable to critique of their sexual choices in public spheres.¹⁵⁹

Anti-racist scholars have also demonstrated the “sexualized nature of racial oppression.”¹⁶⁰ Since the time of slavery, when Black women were reduced “to a sexual object, an object to be raped, bred or abused,” and onward, Black women’s sexuality has been co-opted and weaponized against them.¹⁶¹ Meanwhile, the hyper-sexualization of Black men is “one of the most prevalent stereotypes in white America’s racial mythology.”¹⁶² Indeed, in family court proceedings and the child welfare context, one still sees that the sexual stereotypes of Black men and women result in the “devaluation” of mothers and the stereotype of the absent father.¹⁶³ The scrutiny of Black parents generally, and their sexuality specifically, is ingrained into our definition of the worthy and unworthy poor.¹⁶⁴

LGBTQ+ communities, meanwhile, have experienced state-sponsored hostility regarding private, consensual sexual expression for centuries. Consider “sodomy laws” for example, which “do not merely express societal disapproval; they go much further by creating a criminal class.”¹⁶⁵ Sodomy laws provide a particularly clear example that the law is often clumsy and mean in its desire and attempts to define, understand, and regulate relationships.¹⁶⁶ Separate and apart laws are no exception to this general rule. Laws and procedures that require probing into family constellations and sexual choices are not neutral or kind—not by design and not in effect. Meanwhile, how we define and dignify intimacy between people and to whom we extend corollary rights to privacy and liberty matters. It has significant implications for equality.¹⁶⁷ When we hone in on considerations of liberty and privacy interests, what also becomes clear with separate and apart laws, beyond the fact that they are bastions of bias and unkindness, is that they are not obviously even permissible.

III. SHE SAID IT’S REALLY NOT MY HABIT TO INTRUDE: INTIMACY, MARRIAGE, AND FAMILY

The legal grounds for doing away with separate and apart requirements and their invasive inquiries are hiding in the shadows where many rights important to families and those in relationship do. Our Constitution does not articulate positive rights, rights securing access to a given thing—education or housing or health care, for example.¹⁶⁸ The quintessential articulation of rights in the U.S. Constitution—the Bill of Rights—articulates a series of negative rights, or limits on the government. The Ninth Amendment does, however, remind us that the enumeration of certain rights “shall not be construed to deny or disparage others retained by the people.”¹⁶⁹ And so, against a scaffolding of governmental restraint and in combination with an explicit invitation to recognize rights of the people, we see a liberty interest the “exactness” of which is difficult to define, but which “[w]ithout doubt . . . denotes not merely freedom from bodily restraint but also the right of the individual to . . . establish a home and bring up children”¹⁷⁰ and an articulation of a privacy right “formed by emanations” from other constitutional guarantees.¹⁷¹ The articulation and application of these rights has been vital to those in families and relationships.¹⁷² These rights as discussed in the context of intimacy cases, right to marry cases, and family rights cases suggest that separate and apart requirements are on shaky constitutional ground.

A. Privacy: Intimacy

In 1965, the Supreme Court asked itself if our society could tolerate the police searching the “sacred precincts” of a marital bedroom for evidence of use of contraceptives. It answered its own question, declaring that “[t]he very idea is repulsive.”¹⁷³ The Court’s language in Griswold v. Connecticut, describing the image of a police officer in the bedroom in order to regulate the intimacy of two adults, was not hyperbolic rhetoric. Rather, the description was reminiscent of the actual encounter that Mr. and Mrs. Loving had with police in their bedroom in 1958 and foreshadowing of state action to come.¹⁷⁴ In 1988, an officer entered Michael Hardwick’s home with a (moot and invalid) warrant for his arrest on another matter, and, upon seeing him in his bedroom having sex, arrested him. Then, in 1998, police entered John Lawrence’s home on a report of a “weapons disturbance,” saw John Lawrence and Tyron Garner having sex, and arrested them.¹⁷⁵ All might have been lost for Lawrence as it was in Bowers v. Hardwick had the Court not recognized that the issue before it concerned “the most private human conduct, sexual behavior, and in the most private of place, the home.”¹⁷⁶ In so doing, the Court finally agreed that the question provoked by a law regulating sex between consenting adults was not a question of what an individual was doing in the privacy of his own bedroom, but rather what was the state doing there.¹⁷⁷ Griswold, Lawrence v. Texas, and their progeny tell us that the bedroom becomes a proxy for “the exercise of . . . personal rights.”¹⁷⁸ These cases also confirm that these rights exist within, but also extend beyond, marital relationships.¹⁷⁹

Danielle Keats Citron argues more specifically that it is “time to conceptualize sexual privacy clearly and to commit to protecting it explicitly.”¹⁸⁰ Citron’s advocacy concerns civil and criminal liability for those who attack and assault the sexual dignity of individuals through any range of behaviors, including nonconsensual pornography, coerced sex, nonconsensual capture of nude images, and so forth, but her analysis affirms concepts important for the issue at hand. Citron defines sexual privacy as “the behaviors, expectations, and choices that manage access to and information about the human body, sex, sexuality, gender, and intimate activities.”¹⁸¹ She argues that sexual privacy combines principles of equality, intimacy, and sexual agency and that recognition of such a right and protection under it allows people to “author [their] intimate lives and be seen as whole human beings rather than as just . . . intimate parts or innermost sexual fantasies.”¹⁸² Protecting the self-disclosure and vulnerability inherent in sex upholds principles of dignity and equality. While the concept of sexual privacy is developed and litigated, cover for the literal and figurative “bedroom” at issue in separate and apart inquiries is undeniably located in the penumbra of privacy interests.¹⁸³ The privacy rights here clearly establish that the state is not, when it comes to consenting adults, permitted to intrude on who is having sex¹⁸⁴ and to what end.¹⁸⁵

One cannot help but notice how these principles of privacy around sexual intimacy erode completely in the context of separate and apart requirements. Indeed, in the context of divorce, at least, the needle has moved since the early and deeply influential articulations of why privacy matters. Samuel Warren and Louis Brandeis, in their seminal contributions to the conversation of privacy, repeatedly emphasized protection of “thoughts, sentiments, and emotions,” not just the body and property.¹⁸⁶ They made their impassioned case for privacy following publicity and specifically photography of a wedding at which the many Boston elite were present. To their thinking, by photographing the wedding and making those pictures available for public view, the press was laying bare “the sacred precincts of private and domestic life.”¹⁸⁷ If photographing marital joy was so compelling to early proponents of privacy, how can seemingly superfluous inquiry at the time of a divorce not seem problematic? Consider, for example, in Bergeris v. Bergeris, from the year 2012—not 1812—in which we see a court probing the interactions of a couple to determine whether and what type of phone sex they had.¹⁸⁸ The probing occurred despite Maryland ostensibly being a no-fault jurisdiction. The probing occurred despite the procedural posture of the case, in which Ms. Jeanine Bergeris sought and received a protective order against Mr. Bergeris, and both parties had—at varying times in the history of the case—sought limited or absolute divorces from one another.¹⁸⁹ And, in which, the scrutiny of “sexually explicit telecommunications” forestalled the divorce of this couple, despite their having been locked in litigation for two years during which time one or both of them was seeking one.¹⁹⁰

Privacy for sexual intimacy is not the only substantive right important to families hanging out in the shadow of liberty interests.¹⁹¹ One can see declaration after declaration that “[t]here . . . exist[s] a ‘private realm of family life which the state cannot enter.’ ”¹⁹² As stated in Carey v. Population Services International, “[w]hile the outer limits of [the right of personal privacy] have not been marked by the Court, it is clear that among the decisions that an individual may make without unjustified government interference are personal decisions ‘relating to marriage, procreation, contraception, family relationships, and child rearing and education.’ ”¹⁹³ Accordingly, the Court has admonished laws that abridge the freedom of personal choice in matters of family life.¹⁹⁴ The family realm, as a site for making choices for and about one’s family, has been afforded both substantive and procedural protection.¹⁹⁵

B. Family Rights

Families’ privacy and liberty interests can link in important ways to their survival.¹⁹⁶ This liberty interest has translated into the law affording families’ choices dignity, respect, and a wide berth.¹⁹⁷ Family survival has, in turn, always included the notion of change or restructuring.¹⁹⁸ Any suggestion that families journeying through a divorce are no longer families or will no longer be families once the divorce is finalized is intellectually dishonest and demeaning. To begin, any argument that the end of a marriage means the end of a family does not track with common sense or with the Court’s recognition of many non-nuclear or bi-modal families.¹⁹⁹ Moreover, statutes adjacent to divorce, namely support, child custody, and property distribution statutes, confirm that divorced families will still be tied to one another through continued coordination, support, or cooperation, even while each spouse will be entitled to independence from the marriage. Property distribution statutes, for example, do not just consider spouses’ past contributions to marital property and past acquisitions of assets and income to design equitable distributions, but also consider spouses’ future opportunities for acquisition of assets and income, and forward-looking needs in terms of providing care for any children.²⁰⁰ Custody statutes will ask about the living arrangement and structure of care to which children are already accustomed while also asking questions about a parent’s willingness and capacity to shape new and presumptively shared custody arrangements going forward.²⁰¹ Alimony statutes call out spouses’ past contributions to the achievements of the other or running of the household, while also enumerating forward-looking considerations of spouses’ abilities to find employment or achieve financial independence. In this way, the jurisprudence around care of children, support, and division of property reflects the complex reality of divorced families: while the pathways for two divorcing individuals is diverging, there is a history that binds them and a tomorrow that involves them both.

Prior to any restructuring contemplated above, there is a limbo period during which spouses are contemplating divorce or are in the process of negotiating or litigating a divorce. What couples are in this stage is still married. And what the couple is doing at this stage is making choices. These choices might include choices about how to spend money, how to organize their affairs, and how to care for children and prepare them for their new reality. Couples’ status as (still) married and the choices they are confronting provoke liberty interests. Direct and easy application of family rights doctrine should forestall court inquiry into the private realm of their family dealings.²⁰² Parents of a child, for example, may make decisions to continue to share physical space and even a degree of intimacy as part of a larger vision of how to provide the best care for a child during a time of emotional and financial upheaval.²⁰³ This ability to decide how to raise one’s child is a clearly constitutionally protected interest.²⁰⁴ Families’ interest in childrearing, their interest in protecting their family, and their interest in creating social and legal order were considerations Justice Kennedy named as buttressed to the right to marry. He wrote: “marriage is inherent in the concept of individual autonomy”; marriage is an “intimate association,” a “union unlike any other in its importance to the committed individuals”; “the right to marry . . . safeguards children and families”; and finally, “marriage is a keystone of the Nation’s social order.”²⁰⁵ Taken together, these four principles put flesh on the bones of the interests bound up in marriage.

C. Right to Marry

In Obergefell, the Court had relatively recent occasion to write its latest love letter to the institution, agreeing with sentiments from Courts before it that marriage is the “relation . . . most important” in life,²⁰⁶ and that freedom to marry is a “vital personal right[] essential to . . . happiness.”²⁰⁷ And yet, our nation has a shameful history of denying access to marriage for all sorts of reasons. Until appallingly recently, many states had anti-miscegenation laws on their books. When, in 1968, Loving v. Virginia declared such laws unconstitutional, fifteen states in addition to Virginia had similar laws.²⁰⁸ And there was not a clear, unencumbered pathway for same-sex couples to marry until 2015.²⁰⁹

Loving, Obergefell, and their progeny clarify and confirm that the state may not “significantly interfere” with decisions to enter a marriage, but it is well understood that states can and do regulate marriage, both in terms of one’s entrance into it and exit from it.²¹⁰ With few remaining constraints, however,²¹¹ you can decide to be married and you can—relatively immediately—be married.²¹² In contrast, there is no prohibition against interference regarding the decision to divorce. The only restrictions on states are that they cannot deny access and opportunity to be heard to end the marriage and they must extend full faith and credit once a jurisdiction has pronounced a divorce.²¹³ Thus, states police both the gateway to marriage and the gateway to divorce, but they are neither the same gate nor do they swing with equal ease or open to equal breadths.²¹⁴

Those arguing for a “right to unmarry” take issue with the fact that the process to divorce is so encumbered, as compared to the process to marry.

The government promotes marriage by making it fast and easy, at least if it’s your first marriage. In states like Nevada, you can even get married on the spot. By contrast, divorce is slow and burdensome. It can take many months and inevitably requires many filings. Unlike marriage, which is essentially a ministerial act, divorce typically requires legal representation, multiple filings, court appearances, and considerable expense. You can get married on a lark, but getting divorced is always a bear.²¹⁵

They argue—quite convincingly—that all four Obergefell principles regarding marriage apply to a right to a prompt divorce. In its argument that the fundamental right to marry “must apply with equal force to same-sex couples,” the majority opinion relied upon four principles: (1) individual autonomy; (2) intimate association; (3) the promotion of familial relationships; and (4) social order.²¹⁶ In articulating these principles, the Obergefell Court declared that marriage “draws meaning from related rights of childrearing, procreation, and education” and that choices about marriage “shape an individual’s destiny.”²¹⁷ Proponents of the right to unmarry suggest that “[i]f it offends autonomy and dignity” to prohibit a given marriage, then surely it “offends autonomy and dignity” to bind someone to a marriage they no longer wish to be part of, particularly where that bind constricts their ability to marry another.²¹⁸

One can see that protecting the “choices” and “destiny” of those in a marriage means nothing—and in fact sets us back hundreds of years—if we then limit the acceptable choices to only those that reflect a willingness to stay bound to a marriage no matter the consequences to safety, psychology, or finances. But a stronger, or additional, argument might thread the needle a little differently. One can argue that the principles in Obergefell apply directly to a divorcing couple because, during a divorce proceeding, a couple is married. The operation of laws confirms this simple truth: until parties are actually divorced, they are married. They cannot, for example, remarry in the interim without risking the subsequent marriage being deemed polygamous.²¹⁹ Property acquired before a divorce is final can be deemed marital property, and property disposed of before the divorce can be seen as a party dissipating assets.²²⁰ Couples engaged in divorce proceedings should, therefore, be entitled to privacy in any intimate association they choose to maintain and deference to their sound discretion concerning child rearing and creation of stability and predictability, because doing so will indeed serve to preserve social and legal order.²²¹

These constitutional mandates taken in combination with one another are suggestive of a substantive due process right to be able to maintain privacy and demand state deference to familial decision-making during a divorce. These protected rights of family liberty and privacy should foreclose parties from having to submit to a hearing about their choices to engage in sex, share meals, or occupy similar space.²²² There may also be equal protection challenges embedded in the pronounced burdens that separate and apart laws place on to particular social groups.²²³ But, what both the typology of harms and the analysis of the rights and interests show more generally is that separate and apart requirements are baffling and problematic. They are a “solution” in search of an actual problem, which meanwhile ignores the legitimate needs of families in transition. Perhaps this should surprise no one because family law jurisprudence and the associated rights and obligations so rarely reflect the needs and interests of families and the individuals that make up those families; rather, they are a reflection of prevailing political forces and wills.²²⁴ Divorce, in particular, is “a lightning rod for deep-seated political anxieties that revolve[] around the positive and negative implications of freedom.”²²⁵ And we have a long and particularly brutal history of disregarding or distorting the familial rights and interest of subordinated groups.²²⁶

IV. MAKE A NEW PLAN, STAN

If we are to design laws and procedures that protect families, it is worth pausing to name, even in the most general sense, what matters to families. What appears to matter—sociologically, psychologically, and historically speaking—is (1) stability and predictability for raising their children and ordering finances and property; and (2) being afforded dignity and respect.²²⁷ Separate and apart requirements, and the invasive inquiries that some requirements provoke, defy these interests. They deny some families a path to the clean, expeditious exit that they need, despite it being “socially and morally undesirable to compel a couple whose marriage is dead to remain subject to its bonds.”²²⁸ For other families, it forces social arrangements that feel premature, unnatural, or disadvantageous to a family’s plan for transition. In contrast, the proposal herein better protects and reflects a commitment to stability and predictability and dignity and respect. Preliminarily, divorce law must be rid of separate and apart requirements. From there, one could more easily contemplate divorce as an administrative and civil—not judicial—matter, just as marriage is a civil and administrative matter. Divorce could then be bifurcated from subsequent adjudication of custodial, property, and support disputes where the circumstances and needs of a family require adjudication of those matters.

A. Stability and Predictability

In a manner relatively consistent since the Victorian era, divorce has been cast as the cause of many social ills—sex-crazed men and women, unrestricted by a commitment to have sex only in order to have children and then raise children together; vagabond children left by the aforementioned parents; impoverished women-led households.²²⁹ Today, conservative pundits insist that decline in marriage is the cause of children struggling in school, financial strife, and even crime and violence.²³⁰ Rather than unearth the complicated social realities and psychology that contributes to struggling marriages or undertake public policy reform to address social ills, it is far easier to posit marriage as “the solution to poverty, violence, homelessness, illiteracy, crime, and other problems.”²³¹

Yet, far from causing all social ills, divorce actually provides important social and financial recalibrations for many families.²³² As way of example, let us begin with an honest look at the ubiquitous claim of many pro-marriage and antidivorce activists: what about the children?!? Divorce obviously affects children, and studies have rather consistently concluded that the event of a divorce will produce measurable anxiety and depression for many children.²³³ What early studies failed to ask, however, was how long or pronounced that suffering would be; and moreover, how evidence of anxiety, depression, or clinical antisocial behavior was linked to the quality of family life prior to divorce.²³⁴ More nuanced studies reveal that children in marriages marked by high dysfunction suffer, so their antisocial behavior decreaseswhen parents dissolve unhappy marriages.²³⁵ Other studies suggest a positive relationship between divorce and measures of increased resilience across time.²³⁶ Paying attention to the experiences and outcomes for high-conflict families is particularly important when one considers the reality that divorce is also a means of escape from affirmatively abusive environments. Approximately, twenty-five percent of divorces are initiated in response to domestic violence.²³⁷ There is also an inverse correlation between divorce rates and domestic violence rates: “In the first five years after the adoption of no-fault divorce, divorce rates did indeed rise, but the domestic violence rates fell by about 20 to 30 percent, and wives’ suicide rate fell by 8 to 13 percent.”²³⁸

Even in situations that do not overtly threaten one’s personhood, divorce can increase predictability and stability because it opens the door to structuring parenting and finances.²³⁹ It is no secret that divorce can create economic hardship and that this hardship disproportionately affects female headed households, but it is not true that financial independence and competence would have been assured if the marriage had remained intact.²⁴⁰ Similarly, it is true that custodial disputes can be contentious and painful, but it is not true that marriages produce equal and adequate parenting.²⁴¹ Oftentimes, divorce recognizes the truth that some people reach more authentic or sustainable parenting and financial arrangements when they are apart. And indeed, it is only through divorce and not within an intact marriage that parties have a legal right to equitable distribution of property, claims for support, and claims of custody that are severable from that of the child’s other parent.²⁴² It is only in the context of divorce and not marriage that parties can seek intervention of the court to help them act on these rights.

Whether the assistance of the court or an internal reckoning gets them there, divorces can and do change people’s choices regarding their parenting and decisions to work or pursue training. After studying nearly fourteen hundred families, Mavis Hetherington describes custodial parents learning to round out their skills as parents; for example, a parent may learn more about discipline and control or strive for greater kindness and softness when the parent cannot offload some aspect of parenting on the other parent and must develop the skills themselves.²⁴³ She also writes about a group she calls “divorce activated fathers”²⁴⁴ who “begin to do all the things they were too busy to do before divorce or had relegated to their wives,” for example, soccer games and school plays.²⁴⁵ Other individuals studied reported that divorce ignited an opportunity or a motivation to go back to school, find work, or switch jobs. For many of these divorcees, these changes in their roles and ways they conceived of themselves in their families led to self-discovery and empowerment.²⁴⁶

B. Dignity and Respect

Perhaps precisely, because divorce is a pathway to refiguring one’s social, financial, and custodial relationships, leaving a marriage can be an important expression of agency and self-determination. Self-Determination Theory looks to human experience to understand what motivates people and posits that the ultimate goal for any intervention is inspiring a person’s optimal functioning. The theory suggests that three basic psychological needs are associated with increases in wellbeing: autonomy, competence, and relatedness. Autonomy refers to the need to have an independence of being; competence is defined as the desire to “master one’s environment”; and relatedness refers to the desire for meaningful social interactions. Scholars have described marriage as an “expressive resource” and that commitment to marriage is an expression of association and personhood. ²⁴⁷ David Cruz argues that that ability to hold oneself out in a relationship recognized by civil law, and not just social reality, is an expressive resource.²⁴⁸ He made this case in 2001 to argue for same-sex marriage, but the notion easily applies to divorce as well. A decision to divorce and an appeal to have that divorce recognized by law is an expression of needs and choices about the continuation of a union with another person and the associated intermingling of finances, property, child rearing, and habitation. Limiting an individual’s expression inside marriage to only those decisions and behaviors that commit to the marriage constrains one’s self-determination. Divorce can be a valid expression of one’s autonomy, decision-making, and desire for a different manner or source of relatedness. Divorce allows people the opportunity to resituate themselves in new relationships or outside of any relationship at all. Additionally, in ways unavailable to them in an intact marriage, those seeking divorce are able to pursue orders for equitable reallocation of property or for support that may alter financial and power dynamics with their spouses in important ways.²⁴⁹

It is not just the ability to divorce that matters for one’s self-determination; freedom from public scrutiny regarding the mode and manner of separation while divorcing has implications for one’s self-determination and mental health as well. Much of divorce reform acknowledges that judges have no business probing into families’ personal issues, because their doing so is beside the point if the couple themselves declared their marriage “dead” and because such probing produces “gut-wrenching pain” and injures families.²⁵⁰ Moreover, such probing is also fated to skew toward normative bias or whimsy, which in turn tend to ignore, silence, or diminish voices of those who fall outside the dominant narrative.²⁵¹ Nothing defeats a sense of autonomy and competence like being told “you might have meant x, y, or z by your words and actions, but we find your interpretation of your own words and actions less valuable than our own interpretation of your words and actions.” Moreover, certain requirements of separation ask families to distort or hide their truth by either rearranging themselves or avoiding certain disclosures, or they incentivize families to misrepresent themselves. Truth-telling, meanwhile, promotes social emotional health. Research suggests that truth-telling strengthens the connection between our prefrontal cortex—our “adult” brain—and our limbic brain—our “child” brain. Truth-telling is literally good for our brains.²⁵²

What if, instead of substituting our judgment about what the end of a marriage looks like or incentivizing distortion to satisfy our judgment, we simply said “okay” when a party said, “I wish to no longer be bound by marriage to this person?” What if our process reflected the reality that when couples are struggling deep in the heart of the matter about their choices—the good ones and the mistakes—they do not need or desire a judicial officer to ask them to wait or organize their life a certain way before deciding to divorce or while undertaking the divorce? What if we could avoid forcing “efficacious resolution of economic issues and custody” to take a back seat to the timeline of divorce by decoupling these issues from the right or opportunity to divorce? We would then be free to conceptualize alternative dispute resolution alternatives that can improve outcomes for the thorny issues of custody and economic issues.²⁵³

C. Prompt Administrative Divorce

This Article proposes that divorces should be administratively and promptly issued upon the filing of a request by an aggrieved party to a marriage; such that, thereafter, issues of support, equitable distribution, and custody can drive the manner and mode of adjudication or alternative dispute resolution. First, one must note that this proposal flips the script on divorce, suggesting that a pronouncement of a divorce can be disaggregated from and precede final resolution of matters of property, custody, and support. Currently, in all states, a divorce starts when a party files a complaint and serves it on the other; after which time the parties enter a “kind of purgatory,” a “pendente lite stage.”²⁵⁴ During this time, the court holds hearings or the parties submit agreements regarding temporary decisions on parenting, support, and use or access to certain marital property (such as homes or cars) while the case is pending. After (in some cases) protracted discovery or (in all cases) protracted waits due to court congestion, matters are calendared and heard in final hearings, or negotiated final agreements receive judicial blessing. The meat of these final hearings and agreements are the minutia and nuance of the same issues that were handled initially and temporarily, namely custody, support, and property.

The embedded notion is that a party cannot be divorced until matters of custody, support, and property are squared away. But why? It is a social and legal myth that parties cannot negotiate and contract regarding support and property or negotiate and mediate about parenting children outside of the bonds of marriage. As a matter of law, it is actually possible, for example, to grant a divorce and table or calendar matters of custody for a final hearing. Socially, we know full well that many children are raised in households by unmarried parents or are raised in and by two separate households.²⁵⁵ Moreover, even in the current regime, the litigation of custody, support, and property issues often persists and survives after the dissolution of marriage via motions to modify or motions to compel.²⁵⁶

In addition to disaggregating the divorce itself from resolution of corollary matters, this proposal conflates or includes the concept of shortening waiting periods with freedom from requirements during that waiting period. Any proposal to promptly issue divorces and do away with separate and apart requirements can find comfort in the fact that plenty of states do not have them, and the world appears to have kept on spinning. States such as Alaska, Nevada, New Hampshire, Wyoming, South Dakota, and Idaho have short waiting periods for divorce.²⁵⁷ These same states do not have involved requirements for demonstrating separation in order to qualify for the divorce.²⁵⁸ When one cross-checks “quickie” divorce states against other states, one is struck by the fact that nothing is striking. To begin, aside from Nevada, which has a distinct explanation for being an anomaly,²⁵⁹ divorce rates in these other low-bar states are on par with other states, and even Nevada is not alone in being a high divorce rate state.²⁶⁰ But, then again, separation periods and normative standards for periods of separation are about creating predictability for children and economic and psychological security for families. So then, surely the citizens of Alaska, Nevada, New Hampshire, Wyoming, South Dakota, and Idaho must be floundering in a state of civic and familial chaos. But no. No, they are not: measures of social services consumption, child welfare statistics, school performance data, and so forth are all unremarkable compared to other states with more stringent divorce requirements.²⁶¹

The question then becomes what, if anything, should the requirements be for the administrative pleadings and requests for divorce. Here again, we see examples of jurisdictions offering opportunities for “summary dissolution,” “streamlined dissolution,” or “simplified dissolution” to offer parties efficient, less public, and more cost-effective dissolution of their marriage.²⁶² These processes still require judicial approval, but they do not contemplate a trial and instead invite parties to craft their own agreements. States allowing these dissolutions may impose limits on assets, requests for spousal support, or length of marriage, or they may be limited to cases in which there are no children. France has taken this approach one step further and allowed matters that can be handled summarily to move forward without judicial involvement at all.²⁶³ The same is true in Australia, where uncontested divorces with no children can be obtained by administrative procedure through the mail.²⁶⁴ Denmark similarly allows for an administrative procedure in uncontested cases.²⁶⁵

An expeditious administrative process supports the goals of creating a system that respects the families utilizing it, as well as the goal of creating predictability and security for families. To begin, an administrative process that separates requests to divorce from requests for the court’s assistance with property, support, and custody better reflects several important realities beleaguering the family courts and harming the families who are forced to engage with the courts. Family courts are overcrowded and inefficient.²⁶⁶ Family courts also deal with vast numbers of pro se litigants.²⁶⁷ As compared to represented parties, pro se litigants are more likely to have substantive or procedural missteps such as missed deadlines or deficient filings. As a family court judge and two practitioners put it,

This perfect storm created by a void of knowledge of procedural, substantive, and evidentiary law on the part of individuals stuck in a system to deal with unhappy, very personal, and, at times, highly conflicted matters results in an unnecessary overuse of judicial resources and a growth of the backlog in the court’s docket.²⁶⁸

Under this new proposal, certain divorce scenarios would come off the court docket all together, clearing room for those matters that require more time and attention. Other matters could come before the court, not automatically upon the filing of a divorce, but rather when the families’ own needs and energies direct them to file. Some parties may feel they need court intervention to understand, negotiate, and contract around their property interests, support needs, or child custody issues; some parties may not.²⁶⁹ Some parties may choose to merge and incorporate settlement agreements into court orders, while other parties may not.²⁷⁰

Courts and legislatures recognize that the more process a law requires or inspires, the greater the delay and that the greater the delay, the greater the agony.²⁷¹ Social science literature—and if we are honest with ourselves, our own lived experiences—buttress this conclusion. People do not like to wait; and waiting for uncertain time periods, for an uncertain outcome, is the worst kind of waiting.²⁷² People become agitated and irrational under these conditions; “[w]aiting in ignorance creates a feeling of powerlessness, which frequently results in visible irritation and rudeness.”²⁷³ The psychology of waiting is often studied in connection with customers waiting in line, so one must consider how these same psychological tendencies toward frustration and anger will be amplified when the matter at hand—a divorce—is more socially emotionally fraught than a trip to a customer service center. Consider, for example, patients asked to wait for medical procedures due to COVID-19 protocols and barriers. Here, patients showed marked symptoms of mental distress as they waited to undergo their procedures.²⁷⁴ To add insult to injury, the psychic toll did not just cause suffering in the patient, but it also adversely impacted patients’ trust in the health care system.²⁷⁵ As it turns out, such findings do and have translated onto the legal system generally and family courts specifically. Delay upon delay with uncertainty about the divorce risks exacerbating the social-emotional stress a family is under and threatens to diminish litigants’ ability to work together and confidence in the legal system.²⁷⁶

Simplifying and truncating the line between wanting a divorce, filing for a divorce, and getting a divorce has advantages for almost every type of couple who the family court sees. The law lacks teeth on the issue of divorce itself; so much of divorce law is actually about regulating or apportioning property and money among family members to support the individuals and bimodal family constellations arising out of breakdowns in the married, nuclear family.²⁷⁷ Some couples struggle financially to create separate households or face contentious divorces. Without the benefit of advocacy and assistance, separation periods are rarely productive and simply impede the couples’ full opportunity to use the resources of the court and the force of the law to plan and prepare for a new future. A prompt administrative divorce clears the way for these couples to immediately seek final resolution for the matters that will help them prepare financially and logistically for their next chapter. In contrast, some resourced and represented couples are able to negotiate agreements about property, support, and child support. These couples do not need active involvement of the court to broker agreements, but they need the court’s prompt attention to finalize them. For these resourced couples, the murkiness created by periods of separation prior to divorce can create confusion around what holdings or debts are marital property.²⁷⁸ Clearly delineating the moment of divorce from the period of negotiation and possible adjudication regarding property and support interests clarifies which choices and conduct were “marital” and which were not.²⁷⁹ Still other couples have “simple” cases where they own little to no property and have no children. Despite shifts in divorce law leaving the courts with little to no authority over the matter of divorce itself, these couples must queue up, adding to the clogged docket, missing work for interim court appearances, and waiting—sometimes years—for a pronouncement of what they themselves have known all along: their marriage is over.²⁸⁰ The current divorce system not only creates all these inefficiencies and delays, thus forestalling family problem-solving, but it also decentralizes the problem-solving.

Court filings automatically trigger the involvement of bureaucratic authority, which can be demoralizing or unnecessary for many families. Administrative trends reflect a jurisprudential reality that divorces seem less like a “legal matter in need of adjudication and more a private matter subject to administrative regulation.”²⁸¹ Administrative divorce trends, meanwhile, also mirror the parallel practice of alternative dispute resolution (“ADR”) trends. Many states, even those without summary dissolutions on the books, allow families to use ADR such as mediation to settle their disputes before seeking judicial approval. Such practices promote parties’ self-determination over personal matters and their everyday lives.²⁸² One argument is that court processes and the role of judges are often about rewarding and punishing or incentivizing and barring. These frames contemplate one winner and one loser. The frames are not comfortable or appropriate for people trying to share property equitably or contemplate some sort of partnership to raise children.²⁸³ Others point out that the process of re-conceptualizing family relationships in a new family system is slow, iterative, deeply personal, and ideally collaborative. The divorce process is not currently designed with the space to negotiate, grieve, try, fail, and try again. A system that reduces dockets and narrows issues before the tribunal might better foster opportunity to design systems and processes more respectful of, and responsive to, families’ needs.

CONCLUSION

The origin story of separate and apart requirements is a legacy of the bizarre and lasting stalemate between what people were doing inside unhappy or unhealthy marriages and the technical lawful authority to divorce.²⁸⁴ Surely now we can begin to narrow that divide between law and society, because after all, the train has left the proverbial station.

About half of all Americans over the age of 18 are married, but an increasing number of them have been married before.Over the past ten years, the number of cohabiting adults over the age of 50 has increased dramatically, from 2.3 to 4 million. More than one-quarter of married men in their seventies report having had an intimate relationship with someone other than their spouse. The grey divorce rate—the divorce rate for those age 50 and older—doubled from 1990–2015, although it remains significantly lower than the rate for those under 50. Almost a third of Americans (30%) have a step- or a half-sibling. . . . As many as 5% of Americans are polyamorous, having serious intimate relationships with more than one person at the same time. Approximately 40% of children are born to unmarried mothers, but more than a third of those mothers are cohabiting at the time they give birth.²⁸⁵

“[S]tatistics have normative as well as empirical implications.”²⁸⁶ One can see these implications playing out on our TV screens, in our neighborhoods, our schools, our work, and our social spaces. Families increasingly “do” family all sorts of ways—ways that suggest that family is a beautifully nuanced thing. Bound up in this, is the reality that marriage must also be a beautifully complicated thing—or at least something that is about more than sex and meals. It is illogical to hover the magnifying glass on these issues when a party seeks to end their marriage. Moreover, this piece suggests it is acutely painful and harmful to certain populations and constitutionally precarious to do so. A simple conclusion flows from all of this: these requirements do not make sense. They require performance and permission that is neither necessary in life nor should be acceptable in law.

96 S. Cal. L. Rev. 77

Download

Claire P. Donohue*

* Assistant Clinical Professor, Boston College Law School, Director of Interdisciplinary Practice and Family Law Professor. With thanks to the members of the Boston College Summer Writers’ Workshop for their comments on this project; and thanks also to Laura Robinson for her tireless enthusiasm, even for my most tedious asks; and to Karen Breda, Boston College Law School Librarian and Lecture, who I am convinced, could find anything anyone ever asked for.

Fractionalization to Securitization: How the SEC May Regulate the Emerging Assets of NFTs

Lauren Au*

Blockchain technology opened the world to a variety of new technological advances that reshaped the way humans interact and transact

Ditching Daimler and Nixing the Nexus: Ford, Mallory, and the Future of Personal Jurisdiction under the Corporate Consent and Estoppel Framework

Shauli Bar-On*

While personal jurisdiction is intended to assess whether a defendant should be forced to defend a lawsuit in a location

Toward a New Fair Use Standard: Attributive Use and the Closing of Copyright’s Crediting Gap

John Tehranian*

A generation ago, Judge Pierre Leval published Toward a Fair Use Standard and forever changed copyright law. Leval advocated for the primacy

The Invention of Antitrust

Herbert Hovenkamp*

The long Progressive Era, from 1900 to 1930, was the Golden Age of antitrust theory, if not of enforcement. During

Fifty Ways to Leave Your Lover: Doing Away with Separation Requirements for Divorce

Claire P. Donohue*

Despite the evolution of no-fault divorces, which were intended to remove certain barriers to divorce and essentially make any divorce

Justice Breyer’s Friendly Legacy for Environmental Law

April 2023April 2024Richard J. Lazarus*

Environmentalists did not cheer President Bill Clinton’s decision in May 1994 to nominate then-First Circuit Judge Stephen Breyer to fill Justice Harry Blackmun’s seat on the Supreme Court. Just the opposite. Many instead expressed serious concerns about Breyer’s impact on environmental law were he to be confirmed, and openly questioned whether a Justice Breyer might be “hazardous to our health.” This Article considers whether, in light of Justice Breyer’s actual record over the past twenty-seven years on the Court, environmentalist concerns about him at the time of his nomination were realized. The Article concludes they were not. Justice Breyer was instead friendly to environmental protection concerns even if he fell shy of being an unqualified friend on the bench. In almost all of the most important environmental cases of the past twenty-seven years, he was a reliable vote joining the majority in the big cases environmentalists won—often providing the critical fifth vote. And although Justice Breyer on a handful of occasions was less a reliable vote in dissent with liberal justices sounding the alarm in the big cases environmentalists lost, in none of those cases was his vote dispositive of the outcome. For this reason, although environmentalist concerns at the time of Justice Breyer’s nomination were reasonable, and had the potential to cause the very problems environmentalists identified, they proved largely insignificant in actual application. Finally, Justice Breyer’s actual record on the Court suggests the wisdom of rethinking what it means to be a “dream” justice for environmental law. Most simply put, the best Justice for environmental law may not be a Justice who always votes in favor of the outcome favored by environmentalists in individual cases.

INTRODUCTION

Environmentalists did not cheer President Bill Clinton’s decision in May 1994 to nominate then-First Circuit Judge Stephen Breyer to fill Justice Harry Blackmun’s seat on the Supreme Court. Just the opposite. While Justice Breyer had his defenders, environmentalists mostly expressed serious concerns about Justice Breyer’s impact on environmental law were he to be confirmed. And some denounced him on that ground, worrying that he might be “hazardous to our health”¹—a concern strikingly similar to that which had been expressed a year earlier by attorneys advising President Clinton when the President had first considered Justice Breyer for a vacancy on the Court.² Yet, ironically, although those concerns had abruptly derailed Justice Breyer’s nomination only hours before its expected announcement in 1993, they became a major reason why the President chose Justice Breyer over the President’s first choice for the nomination in 1994. Senate Republican leaders threatened to wage an all-out campaign against the President’s first choice, Secretary of the Interior Bruce Babbitt, because of his reputation as an unabashed environmentalist.³ And that same Republican leadership promised smooth sailing if Justice Breyer were instead the nominee because Justice Breyer had expressed concern about unduly costly environmental protection requirements and therefore was perceived, unlike Babbitt, as pro-business.⁴

Justice Breyer’s recent retirement from the Court after twenty-eight years⁵ provides an opportune moment to reflect on his legacy for environmental law based on his actual record—as reflected in the votes he has cast and the opinions he has written. More specifically, this Article considers whether Justice Breyer’s record on the Court confirms or contradicts the expectations of his supporters and detractors more than a quarter century ago.

To that end, the Article is divided into three Parts. Part I reviews the events surrounding Justice Breyer’s nomination and the role that environmental law then played in securing both his nomination and confirmation. Part I includes discussion of previously undisclosed information long buried in the official archival papers of President Clinton related to the decision not to nominate Justice Breyer in 1993. Part II reviews Justice Breyer’s record in environmental cases before the Court, with special emphasis on opinions he wrote in those cases, whether majority, concurring, or dissenting.

Part III considers whether, in light of Justice Breyer’s actual record on the Court, environmentalist concerns—shared by some advising the White House—about Justice Breyer were realized. Although those concerns were understandable and pertained to the very problems that White House advisors and environmentalists identified in 1993 and 1994, they have proven largely insignificant in actual application. In the vast majority of environmental law cases heard by the Court during Justice Breyer’s tenure to date, Justice Breyer has both displayed heightened sensitivity to environmental protection concerns and voted in a manner sympathetic to environmentalists, without expressing any concern about environmental protection requirements being too demanding. And, in those relatively few cases in which his concern with unduly stringent environmental law was relevant to a legal issue’s resolution, Justice Breyer’s views made no difference to the outcome of the case, nor has he written an opinion of the Court in any of those cases, instead at most writing a separate concurring opinion of no legal effect. In only one relatively unimportant case that defied application of liberal or conservative ideology did he ever supply the decisive fifth vote against environmentalists,⁶ likely because the Court has been consistently dominated by at least five more conservative Justices ever since he joined the bench.

I. WHITE HOUSE ADVISOR CONCERNS AND ENVIRONMENTALIST OPPOSITION TO JUSTICE BREYER’S NOMINATION

President Clinton’s decision to nominate then–First Circuit Judge Breyer to fill Justice Harry Blackmun’s seat in May 1994 was remarkable because only one year earlier, the President had decided at the last minute not to nominate Judge Breyer to fill Justice Byron White’s seat on the Court. In June 1993, Judge Breyer had been the expected nominee, bolstered by the strong support of Massachusetts Senator Ted Kennedy.⁷ But in what was dubbed by the New York Times as “A Surprise Choice,” Clinton instead tapped D.C. Circuit Judge Ruth Bader Ginsburg to fill the opening.⁸

According to press accounts at that time, Judge Breyer’s nomination stumbled in the final day and hours because of reports that he had failed to pay Social Security taxes on the wages of a part-time housekeeper.⁹ Judge Breyer’s problem was an especially sensitive one for the White House because the President had recently suffered repeated embarrassment when his two successive choices to serve as the first female Attorney General were abandoned because of problems with their domestic household employees: the first had reportedly failed to pay taxes for a childcare provider and the other had failed to inform the White House that she had once hired an illegal alien as a household employee (although, in her defense, that hiring was not itself unlawful when it occurred).¹⁰ And, as much as the Clinton White House sought to distinguish Judge Breyer’s circumstances, the specter of excusing conduct from a man that resembled conduct that had only recently derailed two female cabinet picks apparently cratered Judge Breyer’s nomination.¹¹ Judge Breyer also reportedly interviewed poorly with the President not long before Ginsburg was announced—which Judge Breyer’s supporters blamed on painkillers he was taking while recovering from a serious bicycle accident for which he had recently been hospitalized.¹²

But what was not reported at the time is that White House concerns about Judge Breyer were also substantive in nature.¹³ In early June 1993, White House Counsel Bernie Nussbaum asked Joel Klein, a highly skilled and trusted D.C. private sector attorney, to conduct a confidential review of both Judges Ginsburg and Breyer to assist the President’s decision. Over the course of just a few days, Klein orchestrated detailed reviews of the record of both judges by forty lawyers at six law firms. And Klein provided Nussbaum a comparative analysis of the two judges in memos dated June 10 and June 11¹⁴—precisely the time period when President Clinton pivoted away from Judge Breyer in favor of Judge Ginsburg, whose Senate confirmation Klein then championed in his new role as Deputy White House Counsel.¹⁵

The reviews were devastating to Judge Breyer’s prospects. As summarized by Klein to Nussbaum, “Judge Ginsburg more closely meets the President’s articulated standards for the Supreme Court than does Judge Breyer.”¹⁶ Her “work has more of the humanity that the President highly values and fewer of the negative aspects that will cause concern among some constituencies.”¹⁷ “She has written more, and consistently, about the human condition and the plight of the disadvantaged, and she has done so with obvious conviction and commitment.”¹⁸ In a draft memo to Nussbaum on Judge Breyer, while describing Judge Breyer as “a brilliant jurist” with “the potential to rank with the most distinguished judges in our past,”¹⁹ Klein also described the “dispassionate” nature of his writing and how “he does not wear his heart on his sleeve.”²⁰ Klein added that Judge Breyer’s views on government regulation such as environmental risk regulation were “conservative” and in a recent book he authored on risk regulation, he had proposed “a government-wide cost/benefit approach” akin to what Republicans then favored and to those regulatory reforms supported by the prior presidential administration.²¹

Those attorneys who reviewed Judge Breyer’s writings for Klein stressed Breyer’s apparent lack of sensitivity to the human stakes of economic regulation and environmental protection requirements. In one memo dated June 8, and sent to Klein on June 9, two reviewers described the “bloodlessness” evidenced by Judge Breyer’s penchant for “analyz[ing] every problem he is considering within a framework [so] bounded by economic theory or rules of logic that the result seems devoid of emotion and even . . . humanity.”²² The reviewers harshly contrasted Breyer’s writings with those of Bruce Babbitt—the candidate favored by environmentalists who was then serving as Secretary of the Interior—which were described as “lucid, exuberant and wide-ranging.”²³ A second memo, prepared on June 7, similarly described Judge Breyer as a “cold fish,” “bring[ing] no passion or insight” and as so “lack[ing] of vigor in his jurisprudence that one suspects he does not have (or refuses to utilize) any innate sense of justice.”²⁴ The memo concluded that Judge Breyer was “certainly a judicial conservative” and “[c]onservatives will be thrilled if Judge Breyer is appointed. . . . Nothing in Judge Breyer’s opinions suggests that he would be a great Supreme Court Justice.”²⁵ “In no way is he a ‘man of the people,’ as some other candidates have been.”²⁶

These concerns had a clear impact. Then-Associate White House Counsel Ron Klain made an explicit reference to the concern in his June 11, 1993, memorandum to White House Counsel that listed the questions that the President could ask Judge Breyer in their interview.²⁷ One question asked Judge Breyer to respond to the claim that his “writings suggest an over-emphasis on economics: putting a cost on lives, for example.”²⁸ Another question more pointedly asked him to “respond to the criticism that his opinions are ‘bloodless.’ ”²⁹ President Clinton’s interview with Judge Breyer that same day did not go well,³⁰ and a few hours later, Klein called Judge Ginsburg to let her know she should be available for a meeting with President Clinton. She met with the President two days later, Sunday, June 13, and the President called her later that same night to offer her the job.³¹

What resurrected Judge Breyer’s prospects a year later and secured President Clinton’s nomination was the President’s desire to avoid a Senate confirmation battle. Clinton’s apparent first choice in 1994, as it had first been in 1993, was Bruce Babbitt.³² Liberals in the Democratic Party strongly endorsed Babbitt, as did environmentalists, in 1994 because of his progressive views and his championing of environmental protection causes both as Governor of Arizona and Interior Secretary.³³ Indeed, environmentalists had a year earlier been so enthusiastic about Babbitt that some had actually opposed his nomination to the Court to replace Justice White because they did not want to lose his leadership at Interior—in retrospect a decision they may well regret.³⁴

But it was that same environmentalist enthusiasm for Babbitt that ended up sinking his possible nomination to replace Justice Blackmun in 1994. When the White House let leak to the news media that the President had settled on Babbitt and would announce his nomination shortly,³⁵ Republican Senate leadership preemptively announced that they would vigorously oppose Babbitt—because of his reputation as an ardent environmentalist.³⁶ Simultaneously, the Republican Senate minority leader, Senator Bob Dole, predicted “smooth sailing” were President Clinton to nominate Judge Breyer instead.³⁷ The President blinked, and Judge Breyer became the nominee. Judge Breyer also overcame Clinton’s earlier doubts, upon meeting him a summer before, that he lacked energy, by literally taking a run with the President along the Capital Mall to establish his physical stamina for the job.³⁸ The White House’s concerns of a year earlier, regarding his lack of humanity and affinity for regulatory reform, apparently disappeared—these qualities had been transformed from a political liability into a political virtue.

Most liberal Democrats in Congress muted their displeasure with the Judge Breyer choice, presumably to avoid breaking publicly with their own President, following twelve years of Republican administrations. But some of the more progressive Democrats and hardcore environmentalists did not shy away from sharply criticizing the nominee, revealing their obvious frustration that the President had let pass a potentially historic opportunity to have an acclaimed environmentalist join the High Court.³⁹

The focus of their criticism was a common theme evident in Judge Breyer’s scholarship, work experience, and judicial opinions in favor of reform of excessively burdensome regulations. These were the same concerns that advisors to the White House had stressed in 1993. As counsel to the Senate Judiciary Committee, on leave from Harvard Law School in the late 1970s, Breyer had worked effectively in a bipartisan fashion with both Democrats and Republicans in support of legislation that deregulated the airline industry. He favored such deregulation on the ground that regulation imposed unnecessary costs on industry and impeded the operation of free market forces that could on their own lead to better products and services for lower prices than burdensome government regulation might achieve.⁴⁰

Indeed, Breyer had so impressed Senate Republicans with his support of regulatory reform that they endorsed President Carter’s nomination of Breyer to serve on the First Circuit even though that nomination occurred on November 13, 1980—a time when a nomination for a life-tenured position should have been dead on arrival in Congress. After all, the date of the nomination was only nine days after Carter had lost the Presidency to Ronald Reagan and the Democrats had lost the Senate to the Republicans. Confirming Breyer to the First Circuit during a congressional lame duck session would accordingly mean the elimination of an important federal appellate court vacancy that would otherwise have been available for a Republican President and Republican Senate to fill a couple months later. Yet, because of Senator Kennedy’s clout and significant Republican leadership support for Breyer, rooted in his work on regulatory reform as a Senate staffer, Breyer was confirmed as a federal appellate judge less than a month later in December 1980.⁴¹

As an appellate judge, moreover, Judge Breyer continued to be a proponent of regulatory reform, including for environmental protection rules. In both his judicial rulings and his extra-judicial writings, Judge Breyer expressed concern about the possible harm caused by irrational environmental regulations with compliance costs that far exceeded their benefits.

One of Judge Breyer’s most prominent opinions for the First Circuit was United States v. Ottati & Goss, in which the court upheld a trial court ruling that had rejected the proposed remedy by the Environmental Protection Agency (“EPA”) to clean up a hazardous waste site on the ground that the public health benefits did not warrant the high cleanup costs.⁴² The ruling was remarkable at the time because it was so unusual for a court not to defer to EPA’s judgment about the extent of cleanup needed to reduce risks from hazardous wastes. Although the force of Judge Breyer’s opinion for the First Circuit was a bit muted because the appellate court was simply affirming the trial court’s ruling against EPA—concluding that “[w]e cannot say that the district court was ‘clearly erroneous’ or unreasonable”⁴³—his opinion seemed to join the lower court in ridiculing EPA’s decision to base its cleanup remedy on the proposition that a child would eat contaminated soil over a sustained period of time.⁴⁴ And, while declining to impose sanctions on EPA, Judge Breyer’s opinion gratuitously took the occasion to “wonder about the government’s priorities in the face of other, apparently more serious, environmental demands for ‘cleanup’ time and effort.”⁴⁵

Judge Breyer, moreover, did far more than just author the opinion. Outside of his judicial role, he trumpeted its policy themes regarding how environmental risks should be regulated. He used the Ottati & Goss case as the basis for his 1992 Oliver Wendell Holmes Lectures at Harvard Law School, which he then published as a book entitled Breaking the Vicious Circle: Toward Effective Risk Regulation the following year.⁴⁶ In that book, Judge Breyer identified why and how government regulation of risks, including environmental risks, had a tendency to require excessive expenditures to reduce the “last ten percent” of risks,⁴⁷ here too referring to the facts of the Ottati & Goss case as an illustrative example.⁴⁸ Judge Breyer more broadly proffered the question whether determining the acceptable level of environmental risk was best answered by a political process vulnerable to accommodating the public’s tendency to overreact to environmental risks. And, answering his own question, Judge Breyer concluded that such public policy questions were better answered by expert, technical agencies removed from the pressure of politics and popular opinion.⁴⁹

What simultaneously made Judge Breyer’s book so popular with the Republican Party and regulated business and so unpopular with Democratic Party progressives and environmentalists was its embrace of the rhetoric of regulatory reform. Regulatory reform had in fact been an express and significant part of the agenda of the administration of President Jimmy Carter and EPA when Judge Breyer first endorsed it in his work at the Senate in 1980.⁵⁰ Regulatory reform then was a more benign political issue, and it enjoyed support on both sides of the political aisle. In March 1978, President Carter issued an executive order, developed in part by EPA leadership, that sought to “reform” the process for developing “significant regulations” in order to eliminate regulations that “impose unnecessary burdens on the economy.”⁵¹ Political appointees at EPA during the Carter administration favored opportunities to employ economic analysis to ensure that the Agency was directing its limited resources to the most serious environmental issues and taking advantage of market incentives to reduce pollution in general.⁵² The Reagan administration announced from the outset that it would similarly champion a regulatory reform agenda that took more account of the costs of environmental protection.⁵³

But by 1993, the term “regulatory reform” had become a highly partisan term, tainted by efforts during three Republican administrations to cut back on environmental protections under the guise of cost-benefit analysis and economic efficiency. Reagan administration officials at the Office of Management and Budget and EPA used the rhetoric of regulatory reform and cost-benefit analysis but in a wholly skewed fashion to justify deregulation based on exaggerated estimates of regulatory costs coupled with underassessments of the benefits of environmental protection.⁵⁴ Environmentalists vehemently opposed those efforts.⁵⁵

Even prominent supporters of President Reagan openly commented at the time that his environmental appointees had so bungled the regulatory reform effort that they had undermined it. The President’s own chair of the Council of Economic Advisors, a stalwart champion of regulatory reform, publicly declared: “We will be lucky if by January 1985, we are back where we were 1981 in terms of the public’s attitude toward” regulatory reform.⁵⁶

That is why Judge Breyer’s promotion of regulatory reform rhetoric in his 1993 book set off alarm bells throughout the environmental community, and to those reviewing his writings for the White House Counsel in 1993, to a degree that would not have happened in the late 1970s during the Carter administration. But, in light of how much the political debate had shifted since 1980 when Breyer was working on deregulation on the Senate Judiciary Committee, his 1993 publication was either politically tone-deaf or deliberately designed to position Judge Breyer for promotion as a justice with bipartisan support. With Judge Breyer, the former is a distinct possibility. However, whatever the actual motivation, the publication of Judge Breyer’s book coincided with the openings of two seats on the Supreme Court in successive years and played a central role in whether he would be nominated and confirmed.

Republicans and the business community became his cheerleaders while environmentalists expressed serious concerns. The latter’s criticism could be scathing. “[I]t is clear that [Judge Breyer] is no fan of health and environmental regulation.”⁵⁷ If he “had been a member of Congress, he would not have supported many of the current health and environmental statutes.”⁵⁸ They accused him of blithely accepting the economic
analysis of right-wing think tanks to belittle the risks addressed by government regulation,⁵⁹ minimizing “the risks posed by toxic chemicals
in the environment,”⁶⁰ “reject[ing] a policy of erring on the side of safety . . . because it leads society to spend too many dollars chasing after what he believes to be trivial risks,”⁶¹ failing to recognize the limits of cost-benefit analysis,⁶² and of “even accept[ing] the highly dubious ‘richer is safer’ argument against stringent regulation of activities that pose health and safety regulations.”⁶³

Consumer advocate Ralph Nader pulled no punches in testifying against Judge Breyer’s confirmation. Nader described Judge Breyer as an “extremist.”⁶⁴ According to Nader, Judge Breyer was “ridden with fantasy” and “insensitive on the ground to the health and safety needs of the American people.”⁶⁵ Judge Breyer, Nader concluded, “appears to seriously question many health and safety laws that he will be expected to interpret impartially as a Justice of the Supreme Court.”⁶⁶

There were, of course, supporters too. The renowned scholar Professor Cass Sunstein, a close professional colleague of Judge Breyer, casebook co-author, and the leading legal scholar in support of the central role of cost-benefit analysis for rational regulation, testified in favor of Judge Breyer’s nomination.⁶⁷ More moderate legal academics contended that Judge Breyer would be a “friend” to environmental law and that, even though “in sheer numbers, his rulings against environmental groups probably exceed his rulings in their favor”—only because they lose most of their cases—the judge “ha[d] shown a sensitivity and appreciation for environmental issues.”⁶⁸

Only one environmental public interest organization affirmatively supported Judge Breyer’s nomination—the Conservation Law Foundation, headquartered in Massachusetts—perhaps because of his favorable ruling in a case they had brought to clean up Boston Harbor, because of geographic allegiance to Judge Breyer, or perhaps even more likely, because of institutional loyalty to his principal political sponsor, Massachusetts Senator Ted Kennedy.⁶⁹ But even that organization’s letter was noticeably understated. It was only two paragraphs long and addressed merely “To Whom It May Concern.”⁷⁰ The most the organization’s director could muster in his opening sentence was that “Stephen Breyer has fashioned a remarkable record on environmental matters that have come before the First Circuit Court of Appeals.”⁷¹ The word “remarkable,” is, of course, itself remarkable for what it does not say in a letter that purports to be an endorsement.

During his own Senate testimony, Judge Breyer plainly sought to assuage concerns by walking back from the deregulatory import of some of his writings. While conservative Republican Senator Strom Thurmond stressed how he “was pleased to learn of [Judge Breyer’s] concerns with excessive regulation,”⁷² Judge Breyer asserted that the “role of economics” was necessarily “much more limited” in application to “health, safety, and the environment . . . because, there, no one would think that economics is going to tell you how [much] you ought to spend helping the life of another person.”⁷³ He also characterized the book as “a plea . . . not to cut back by 1 penny this Nation’s commitment to health, safety, and the environment” but only to “reorganize[e] that commitment” to ensure that money was spent on saving real lives rather than “on the statistical life that might not exist.”⁷⁴

The Senate voted overwhelmingly in favor of Judge Breyer’s confirmation to join the Court.⁷⁵ Eighty-seven senators voted in favor, only nine were opposed, and four did not vote.⁷⁶ And the only votes opposed were a smattering of conservative Senate Republicans.⁷⁷ No liberal Democrat opposition challenged their own President’s nominee notwithstanding any misgivings they might have harbored.78

II. JUSTICE BREYER’S RECORD IN ENVIRONMENTAL LAW CASES BEFORE THE COURT

Justice Breyer’s environmental law record consists of his votes in individual cases and his written opinions in a subset of those cases. During the past twenty-eight years on the Supreme Court, Justice Breyer has written more than five hundred opinions: (1) about two hundred opinions for the Court; (2) approximately one hundred concurring opinions; (3) just shy of two hundred dissenting opinions; and (4) approximately thirty opinions dissenting and concurring in part.⁷⁹

The Justice has written relatively few opinions in environmental cases, mostly for the straightforward reason that the Court does not decide that many environmental cases. Environmental law is, at least numerically, a relatively small part of the Court’s docket so long as one defines “environmental law” more narrowly as I am doing for the purposes of this inquiry.

My narrower approach considers only those cases that arise in a fact pattern in which environmental protection concerns are at stake and those stakes are not wholly incidental to the legal issue raised. That definition sweeps in both cases involving the construction and application of classic environmental laws like the National Environmental Policy Act⁸⁰ as well as those cases involving general cross-cutting legal issues such as the scope of congressional commerce authority in a case where environmental protection is at stake. A broader, and perfectly fair contrary approach would be to consider all cases that raise legal issues that, although not arising in an environmental protection setting in the case then before the Court, are likely to have significant implications in future cases that do.⁸¹ Consistent, however, with the author’s belief that there is a discernible “environmental” dimension to environmental law that is important for judges (and Justices) to consider,⁸² this Article relies on the narrower definition instead.

Based on that narrower definition, I have identified sixty-three “environmental law cases” decided by the Court during Justice Breyer’s tenure to date, in which he participated in sixty-one due to his recusal in two cases in which his brother, also a federal judge, participated in the case in the lower courts. This subset of cases fulfilled two conditions: (1) each case raised legal issues in the environmental protection context; and (2) that context was not wholly incidental to the legal issue being considered by the Court.⁸³ For instance, I excluded from my sample a case like Alaska v. United States,⁸⁴ a 2005 original action case in which the State of Alaska and the United States were disputing ownership over certain submerged lands in Glacier Bay. I also excluded the Court’s recent decision in BP P.L.C. v. Mayor of Baltimore,⁸⁵ concerning the scope of judicial review of a district court ruling not to allow removal of a state court case to federal court. In neither of those cases, or those like them, do the environmental stakes play any non-incidental role in the Court’s resolution of the legal issue to be decided.⁸⁶ But I included cases like Article III standing and regulatory takings cases because, as I have elaborated in previous scholarship,⁸⁷ the environmental dimension of those cases should bear on the application of the relevant legal standards even if, as described below, individual Justices and sometimes a majority of the Justices too often fail to grasp its relevance.

A. Justice Breyer’s Votes

In 2000, I published an article that tried to develop a rough quantitative basis for comparing how individual Justices voted in environmental law cases and for assessing whether certain Justices and the Court as a whole were more or less responsive to the need for environmental protection. The article argued more broadly in support of the thesis that there was a uniquely “environmental” dimension to environmental law relevant to judicial decision making—for instance, how such concerns might provide a proper basis for rethinking what constitutes a “concrete injury” for Article III standing purposes, a “property right” in natural resources for regulatory takings purposes, an “intelligible principle” for nondelegation doctrine purposes, an “economic activity” in Commerce Clause analysis, or the degree of judicial deference owed a federal agency in both technical assessments and statutory interpretations.⁸⁸ While concluding that the Court overall had displayed “apparent apathy or even antipathy towards environmental law,”⁸⁹ I concluded that some individual Justices had shown more sensitivity than others to how environmental protection concerns could be relevant to how the legal issues before the Court should best be decided.⁹⁰

My 2000 analysis relied on a scoring system somewhat analogous to that employed by the League of Conservation Voters in scoring members of Congress on environmental matters,⁹¹ but now applied to each Justice. A Justice was awarded one point for each outcome that I classified as “pro-environmental protection,” resulting in each Justice receiving an “EP score,” based on the percentage of pro-environmental votes the Justice cast out of those in the sample of environmental cases in which that Justice participated. Although nominally quantitative in its ultimate yield, I freely admitted at the time the “inevitabl[e] arbitrariness and sometimes downright foolishness in attempting any such ‘pro’ or ‘anti’ policy assignments to Supreme Court rulings, especially assignments that purport to be binary in nature.” ⁹²

The problems are obvious. First, I defined as “pro-environmental” the legal position favored by environmentalists in each case. An environmental advocate, however, trying to win a particular case may in fact be making an argument that leads to a win in that case but to losses in other future environmental cases. For instance, the advocate may be arguing against deferring to an expert agency’s judgment because, in the case before the Justices, an argument against such deference may be needed to secure a win. But if the advocate prevails in that case, the precedent established may cause environmentalists in the future to lose far more than they win if it turns out that judicial deference to agency expertise is more advantageous to environmental protection concerns over the longer term.

Second, the legal position favored by environmentalists in a particular case may be very weak on the merits and warrant rejection. There is no necessary correlation between a legal argument favoring environmental protection and its being a meritorious argument. Not every argument in favor of environmental protection is necessarily a strong legal argument that a judge or Justice should accept.

Indeed, there is good reason to worry that the Justices tend to hear cases in which the legal arguments favoring environmental protection are disproportionately weaker. As I have detailed elsewhere,⁹³ the Court’s decision-making at the jurisdictional stage for most of the past fifty years that define the modern environmental law era has been skewed against environmentalists. The vast majority of the cases in which the Court has granted review are cases in which the position favored by environmentalists prevailed in the courts below. The Court has taken relatively few cases in which the environmentalists lost and then sought the Court’s review, especially those in which the federal government was the prevailing party.⁹⁴ The Court has, in effect, cherry-picked the cases in which environmentalists may have won based on potentially weak arguments while not being similarly ready to review cases in which business interests have won on weak grounds.⁹⁵

I accordingly warned in 2000 against drawing any conclusions based on EP scores apart from those at the two extremes—either very high or very low. Because of the obvious limits to such scoring, only such extreme discrepancies in scores might offer a fair basis for positing that the Justice in question was more or less “likely to rule in favor of or against an environmentally protective outcome because of that outcome’s environmental dimension.”⁹⁶ Those same limits are also a reason not to be surprised when even the highest EP score is not that high—the potential result of a skewed merits docket.

In 2000, Justice Breyer had served on the Court for only six years, and his EP score back then of 66.6 after six years of service was in fact one of the highest of those then on the Court. It far exceeded Justice Scalia’s strikingly low score of 13.8 and the scores of Justices Thomas (20.0) and Kennedy (25.9). But, of course, his score came nowhere close to Justice Douglas’s, who retired from the Court in 1975. An environmentalist hero and former member of the Board of the Sierra Club,⁹⁷ Douglas boasted of a score of 100—apparently no matter the legal issue presented, and perhaps even the relative strength of the competing arguments, Douglas always voted in favor of the outcome supported by environmentalists. Justice Breyer’s EP score of 66.6 was also higher than Justices Brennan (58.3), Marshall 61.3), Blackmun (58.3), Stevens (50.6), Souter (57.1), and Ginsburg (63.6) but not to any significant extent, especially because both Justice Breyer and Ginsburg had both served for far fewer years than Justices Brennan, Marshall, and Stevens and therefore reflected very different cases too. Justices Brennan, Marshall, and Stevens were accordingly being measured based on cases in which it might have been harder on the merits to vote for the side favored by environmentalists.⁹⁸

Two decades later, the number of environmental cases in which Justice Breyer has participated has naturally risen, and interestingly his new EP score (62.3) is essentially the same as before and as the two other Justices with high scores—Sotomayor (64) and Kagan (68). Yet Justice Breyer’s score is sufficiently higher than former Justices Scalia (23.4) and Kennedy (36.0) and current members such as Chief Justice Roberts (20), Justice Thomas (20.6), and Justice Alito (10.5) for their overlapping cases since 1994 to suggest significant differences in the application of law to environmental protection. Justice Alito’s score of 10.5 is astoundingly low. The only cases in which Alito voted on the side supported by environmentalists were in four cases that the Court decided unanimously in their favor.⁹⁹

On the other end of the spectrum, although former Justices Stevens, Souter, and Ginsburg’s scores in 2000 were a tad lower than Justice Breyer’s at that time, all of their scores became higher—Stevens (78.4), Souter (80.6), and Ginsburg (71.9)—than Justice Breyer’s for the cases on which they overlapped while serving on the Court. The gap between Justice Breyer and both Justice Stevens and Souter is not especially significant but arguably enough to suggest a potential difference in their respective willingness to consider how the environmental protection dimension of the case might be relevant to their resolution of the legal issue before the Court and, for Justices Stevens and Souter, in favor of a more protective outcome.¹⁰⁰

Finally, not all environmental cases are, of course, equally important, and the individual votes are more significant in some cases than in others. Some cases are far more significant in terms of their import for environmental protection. Whether EPA has authority to regulate greenhouse gas emissions under the Clean Air Act¹⁰¹ is clearly more important than whether a certain river in Alaska is “public land” for the purposes of the Alaska National Interest Lands Conservation Act.¹⁰² And, because in some of those more important cases the vote was also closely divided, the vote of any one Justice in the majority is outcome-determinative. In that distinct respect, the individual vote of any single Justice in a five-Justice majority is more significant.

Based on this criteria, Justice Breyer’s votes in several cases were especially significant, including Alaska Department of Environmental Conservation v. EPA,¹⁰³ upholding the EPA’s authority to override Alaska’s issuance of a permit under the Clean Air Act; Kelo v. City of New London,¹⁰⁴ sustaining a local government’s exercise of its eminent domain power to condemn residential property to promote commercial development; Massachusetts v. EPA,¹⁰⁵ both upholding environmental-plaintiff standing and rejecting the EPA’s claim that it lacked authority to regulate greenhouse gas emissions under the Clean Air Act; and Murr v. Wisconsin,¹⁰⁶ rejecting a regulatory takings claim against a local environmental restriction on residential development. Those are all, moreover, cases environmentalists won.

By contrast, in only one of the sixty-one environmental law cases in which Justice Breyer participated and environmentalists lost did he provide the critical vote against their position.¹⁰⁷ Justice Breyer voted against the legal outcome favored by environmentalists on twenty-three occasions. In eleven of those cases, the Court ruled unanimously and in three others the vote was eight to one against the environmentalist position. Justice Breyer supplied the sixth and seventh vote for the majority in six cases and dissented in the last two. The only case in which Justice Breyer’s vote was outcome-determinative in a case that environmentalists lost during the past twenty-eight years was the Court’s ruling in June 2021 that the condemnation authority provided by the federal Natural Gas Act to recipients of a Federal Energy Regulatory Commission certificate of public convenience and necessity extended to the right to acquire state-owned property. Interestingly, the five-Justice majority was an unusual one, consisting of Justice Breyer, Chief Justice Roberts, who authored the Court’s opinion, and Justices Alito, Sotomayor, and Kavanaugh.¹⁰⁸

B. Justice Breyer’s Opinions in Environmental Cases

Justice Breyer has written opinions in nineteen environmental cases, which is a disproportionately large number of the sixty-one environmental cases in which he has participated. He has written three majority opinions for the Court,¹⁰⁹ six concurring opinions,¹¹⁰ and ten opinions either dissenting in full or in part.¹¹¹ Although Justice Breyer’s majority opinions are clearly the most significant because they alone announce binding legal precedent, the concurring and dissenting opinions may well be the most personally revealing because they largely resulted from the Justice’s own decision to write an opinion expressing his views rather than, as with majority opinions, an assignment from the senior Justice in the majority to write the official opinion of the Court.¹¹² The majority opinion, however, nonetheless can very much reflect the priorities and values of its author, especially whether the Justice chooses to write the opinion narrowly and tries to attract as many votes as possible or instead drafts the opinion in as sweeping a way as possible consistent with maintaining the bare minimum of five votes required for a majority.¹¹³

1. Justice Breyer’s Majority Opinions

Justice Breyer wrote the majority opinions in Ohio Forestry Association v. Sierra Club,¹¹⁴ Public Lands Council v. Babbitt,¹¹⁵ and County of Maui v. Hawaii Wildlife Fund.¹¹⁶ None is a headliner. Nor is that at all surprising, given that Justice Breyer remained the most junior Justice for his first twelve years on the Court,¹¹⁷ which does not lend itself to especially high-profile opinion assignments from his more senior colleagues. That status is also likely why it was not until 2019 that Chief Justice Roberts assigned Justice Breyer a moderately more important environmental case, County of Maui, though still far short of a blockbuster.

All three Court opinions by Justice Breyer evidence his essential pragmatism, a catchword that the White House promoted when he was nominated and that was accordingly captured in the first New York Times headline announcing his nomination.¹¹⁸ His pragmatism was similarly the theme of favorable testimony provided before Congress by one of his leading academic supporters.¹¹⁹

Ohio Forestry is a classic opinion assigned to a junior Justice. Indeed, it might well be classified as one of the “dogs” of the docket that Term, a term of art the Justices use informally in referring to the kind of case no Justice has any particular interest in writing.¹²⁰ At issue was whether the Sierra Club’s challenge to the Forest Service’s plan for managing the Wayne National Forest in Ohio was justiciable.¹²¹ The Court ruled unanimously that the lawsuit was not ripe for review on the ground that the plan did not itself create any adverse effects of a “strictly legal kind” because it did not purport to authorize any particular action within the forest.¹²² It would be far more sensible, Justice Breyer’s opinion for the Court reasoned, to wait until “the Plan is implemented” which would allow the reviewing court to benefit from “further factual development” of the issues.¹²³

Justice Breyer’s opinion for the Court evidences significant sensitivity to the administrative preferences of the federal agency and to the resources of the federal judiciary. The ruling that the case was not ripe emphasizes how allowing the Sierra Club’s lawsuit to proceed would have “require[d] time-consuming judicial consideration of the details of an elaborate, technically based plan . . . without benefit of the focus that a particular logging proposal could provide.”¹²⁴ There is obvious force to the Court’s concern. But the opinion evidences no comparable consideration of the litigation resource challenges that a public interest organization like Sierra Club faces in trying to oversee a series of site-specific logging proposals over time. What the Court posits as the better approach may well be better in theory, but the challenges of such constant site-specific oversight may in fact be preclusive as a practical matter of any meaningful review of future logging decisions in a national forest.

To that same end, the Court declined to consider a series of other ways that the Forest Plan could immediately harm the Sierra Club and its members. The Court reasoned that Sierra Club’s argument “suffer[ed] from the legally fatal problem that it ma[de] its first appearance [before the] Court in the briefs on the merits.”¹²⁵ That is a fair point, and the Court is not at all out of bounds in strictly applying administrative law exhaustion principles in denying consideration of Sierra Club’s argument. Yet, here too, the ruling ignores the practical limits of a resource-strapped public interest organization maintaining lawsuits in an effort to ensure other unrepresented interests are given voice. An organization like the Sierra Club is hard-pressed to monitor all the site-specific decisions that Forest Service personnel are making on a daily basis throughout a national forest. For this reason, the Club’s only practical recourse may be to persuade a court, as they tried unsuccessfully to accomplish in the Ohio Forestry case, to establish some guidelines for the exercise of Forest Service personnel discretion in the future. And the Court’s lack of sensitivity to that practical limitation contrasts unfavorably with the many ways that the Justices, in my experience both litigating for and against the United States, routinely allow the federal government to raise new arguments and bring to the Court’s attention new facts not considered below, because of the Justices’ awareness of the practical limits in the government’s ability to oversee all of its lower court litigation.

The Court’s providing such practical flexibility to the United States makes great sense. Otherwise, the Court would be making significant pronouncements of law affecting the country based on incomplete arguments and flawed factual assumptions. And, given the thousands of cases the federal government handles in the lower courts, it is exceedingly limited in its ability to ensure that all the best arguments are made in the timeliest manner. The Court, however, could demonstrate some sensitivity to the practical needs of environmental citizen suit litigants too. Justice Breyer’s opinion for the unanimous Court in Ohio Forestry evidences no such awareness of the problem.

Public Lands Council v. Babbitt¹²⁶ was a logical sequel to Ohio Forestry. Again, Chief Justice William Rehnquist assigned the junior Justice Breyer the task of writing an opinion for a unanimous Court in another public lands administrative law case that was likely of little, if any, interest to the Justices. The major difference was that, rather than environmental plaintiffs challenging the Forest Service’s management of a national forest, it was commercial livestock interests challenging the Bureau of Land Management’s administration of grazing permits on public lands.

The basic result was the same. The Court concluded that the federal agency’s regulations governing the issuance of permits were valid under the relevant statutory language and that the commercial plaintiffs’ concerns that they might be harmed in how those regulations were applied in the future were largely premature.¹²⁷ The plaintiffs should instead wait, not unlike the environmental plaintiffs in Ohio Forestry, until the federal agency actually applied the regulations in a specific factual context that harmed them.¹²⁸ While the reasoning is similar is tone, there is still a significant practical difference between the two cases because the commercial party subject to a grazing regulation will naturally always know as soon as such harm happens in the future, which is not true for an environmental organization striving to learn of any possible site-specific decision to allow logging or other potentially harmful activity within a very large area of land such as a national forest.¹²⁹

It took twenty more years for Justice Breyer to write a third opinion for the Court in an environmental case, County of Maui v. Hawaii Wildlife Fund.¹³⁰ And, reflecting his more senior status by that time, the case is far more significant than either Ohio Forestry or Public Lands Council. It is not a mere unanimous toss-off. County of Maui instead presents a rather thorny and important question of statutory interpretation under the Clean Water Act—the type of question that nicely lends itself to Justice Breyer’s proclivity to pragmatic solutions.

The precise legal issue raised in County of Maui concerns how direct or indirect an addition of pollutants into navigable waters must be to constitute a “discharge” of a pollutant into navigable waters requiring a pollution permit under the Clean Water Act.¹³¹ In County of Maui itself, the municipal sewage treatment facility seeking to avoid the permit requirement injected contaminated water into a well located about a half mile from the Pacific Ocean—but the discharge naturally reached the Pacific within a few months through the ground water.¹³² The facility contended that no permit was required unless the pollutants were directly introduced into a navigable water body like the Pacific, meaning that the pollution was exempted from the Clean Water Act permit requirement if it travelled even just a few inches through groundwater or over the surface land before reaching the ocean.¹³³ The EPA agreed that any travel through groundwater placed the addition of pollutants outside the Clean Water Act but contrasted that any travel over surface land would depend on a more contextual analysis of directness.¹³⁴

In rejecting both those limits, Justice Breyer’s opinion for a six-Justice majority held that a Clean Water Act permit was required “when there is a discharge from a point source directly into navigable waters or when there is the functional equivalent of a direct discharge.”¹³⁵ The Court’s ruling displays Justice Breyer’s willingness to embrace a nuanced and accordingly ultimately vague legal test—such as “functional equivalence”—when he believes clearer legal rules fail to account for all the factors that should be relevant in solving a problem. The Court’s “functional equivalence” test rejects any hard-and-fast lines for when an addition of a pollutant is too indirect in favor of a multi-factor inquiry. The opinion candidly acknowledges that there are “too many potentially relevant factors applicable to factually different cases for this Court to use more specific language,”¹³⁶ while both highlighting seven relevant factors and underscoring that “time and distance will be the most important factors in most cases, but not necessarily every case.”¹³⁷ Interestingly, the Chief Justice expressed confusion at oral argument about what Justice Breyer’s “functional equivalence” test meant, when Justice Breyer then raised the possibility of such a test,¹³⁸ but nonetheless subsequently chose Justice Breyer to write the Court’s opinion.

The County of Maui ruling is significant for environmental law. Although the Court nominally vacated the lower court’s judgment favorable to the environmental plaintiffs and remanded the case to that court for reconsideration in light of its ruling, the functional equivalent test amounted to a clear win for the plaintiffs. They will do well under that test, as will environmental plaintiffs in a host of cases across the country who have brought Clean Water Act citizen suits against sources that discharge to navigable waters in a proximate but still indirect water through groundwater and over land. For instance, in the immediate aftermath of the County of Maui ruling, environmentalists targeted leakage of coal ash into navigable water bodies from power plants.¹³⁹ Although County of Maui does not rise to the front page headline status of a case like Friends of the Earth v. Laidlaw,¹⁴⁰ expanding Article III standing for environmental citizen suit plaintiffs, or Massachusetts v EPA,¹⁴¹ establishing the EPA’s authority to regulate greenhouse gases under the Clean Air Act, the case will make a big difference in application to lots of factual circumstances and represents an increasingly rare environmentalist victory as the Court’s own bench becomes more conservative.

2. Justice Breyer’s Concurring Opinions

As described above, Justice Breyer’s separate opinions are even more revealing because, unlike majority opinions that are assigned by the most senior Justice in the majority, one can be more confident that the Justice writing separately is expressing their own views. Justice Breyer wrote six concurring opinions, four of which both address significant legal issues and relate directly to the concerns raised by environmentalists when Justice Breyer was nominated. In each, Justice Breyer expressed views that promoted the very regulatory reform themes antithetical to many environmentalists. His doing so each time in a concurring opinion makes clear that these themes remained very important to him, just as environmentalists had feared at the time of his nomination.

First, in General Electric v. Joiner,¹⁴² decided in 1997, Justice Breyer wrote separately while also joining the majority ruling that upheld the trial court’s decision to exclude from jury consideration expert testimony proffered to demonstrate a link between the plaintiffs’ exposure to polychlorinated biphenyls (“PCBs”) and small-cell lung cancer.¹⁴³ In his separate opinion, Justice Breyer stressed that “modern life, including good health and economic well-being, depends upon the use of artificial or manufactured substances,”¹⁴⁴ presumably alluding to a chemical like PCBs, and the need for judges to use their gatekeeping authority to ensure that tort liability did not effectively “destroy” the “wrong” chemicals.¹⁴⁵ Such a concern with the potential for excessive tort liability to harm businesses was in the late 1990s a major talking point for business leaders seeking to curb large tort liability awards.¹⁴⁶

In Whitman v. American Trucking Associations,¹⁴⁷ decided in 2001, Justice Breyer’s concurring opinion was the one blight on an otherwise glorious day for environmentalists. In Whitman, Justice Scalia authored a unanimous opinion for the Court that repudiated what had been a major attack on the constitutionality of a central part of the Clean Air Act. The Court rejected the D.C. Circuit’s remarkable ruling that the Act violated the nondelegation doctrine by requiring the EPA to promulgate national ambient air quality standards requisite to protect public health without basing its determination of those standards on an intelligible principle such as cost-benefit analysis.¹⁴⁸

Scalia’s opinion not only rejected the notion that cost-benefit analysis was required to satisfy the nondelegation doctrine’s requirement of an intelligible principle, but it further ruled that the relevant provisions of the Clean Air Act barred the EPA from considering economic costs at all in promulgating the national standards.¹⁴⁹ It was a sweeping win for both environmentalists and the EPA. But what made their victory even sweeter still was that it was unanimous and written by Justice Scalia, the Court’s leading conservative.

Justice Breyer’s separate concurring opinion fell far short of dampening the victor’s spirits that day, but his words were nonetheless chillingly expressive of some of the worst fears of environmentalists upon his confirmation. He disputed Scalia’s powerful statement that the EPA could consider compliance costs only if Congress’s textual commitment to such consideration was “clear.”¹⁵⁰ According to Justice Breyer, “other things being equal, we should read silences or ambiguities in the language of regulatory statutes as permitting [rather than] forbidding” regulatory agencies from adopting “rational regulation” that considered a proposed regulation’s adverse economic effects.¹⁵¹

Even more telling, Justice Breyer conflated economic costs with public health, just as industry had long been arguing should be done. According to Justice Breyer, because an overly protective environmental protection requirement that returned society to the “Stone Age” would clearly not be “requisite to protect the public health,” “the EPA, in setting standards that ‘protect the public health’ with ‘an adequate margin of safety’” should be deemed to be able to weigh compliance costs against environmental benefits at least to guard against disproportionately high costs for only trivial benefits.¹⁵²

For environmentalists, Justice Breyer’s language sounded unsettlingly similar to the business community’s claims that a healthy society was a wealthy society and environmental protection laws that reduced business profitability were accordingly undermining rather than promoting public health.¹⁵³ Had Justice Breyer authored his concurring opinion in any context other than a concurring opinion with no legal import and when environmentalists were otherwise enjoying an enormous victory, it might have garnered far more attention and concern. But, on a day of widespread relief and celebration, few paid attention to Justice Breyer’s concurrence.

Justice Breyer’s concurrence in Bates v. Dow Agriculture Sciences LLC,¹⁵⁴ decided in 2004, strikes a similar concern about the adverse impacts of excessive government regulation. In Bates, however, the issue arose in the context of a federal preemption case, in which a pesticide manufacturer was arguing that federal pesticide regulation preempted state common law tort liability.¹⁵⁵ The majority opinion, which Justice Breyer joined, rejected the manufacturer’s more sweeping preemption theories, concluded that some state tort law liability might not be preempted, and remanded the case back to the lower courts for further proceedings.¹⁵⁶ Justice Breyer wrote separately to emphasize that the EPA, the federal agency charged with administering the federal pesticide statute at issue, enjoyed authority to promulgate regulations that effectively preempted state tort liability to avoid “a counter-productive ‘crazy-quilt of anti-misbranding requirements.’ ”¹⁵⁷

Finally, similarly sensitive to his perception of excessive environmental regulations was Justice Breyer’s separate concurrence in Coeur Alaska, Inc. v. Southeast Conservation Council¹⁵⁸ in 2009, in which the Justice provided the more conservative wing of the Court with its sixth vote in a major loss to environmentalists. At issue in Coeur Alaska was in effect whether a gold mine that was discharging toxic slurry into a lake three miles away could avoid having to comply with section 402 of the Clean Water Act, which would likely have barred the activity, by characterizing their toxic slurry as “fill,” thereby triggering section 404 of the Act, which separately and less restrictively regulates the addition of fill into navigable waters.¹⁵⁹ The gold mine had placed enormous volumes of toxic slurry into the lake, which was 51 feet deep, 800 feet wide, and 2,000 feet long.¹⁶⁰ And the EPA freely acknowledged that the slurry would kill all of the lake’s fish and nearly all of its aquatic life.¹⁶¹

To the environmental plaintiffs and the Ninth Circuit in its lower court ruling, the gold mine’s claim that it was “fill” rather than pollutant seemed like a blatant end run around the Water Act’s section 402 limitations on the addition of pollutants into navigable waters. But, relying on the EPA’s agreement that section 404 rather than section 402 applied, the Court ruled in industry’s favor.¹⁶² Justice Breyer agreed, declining to join Justice Ginsburg’s dissent, which both Justices Stevens and Souter joined. Exhibiting the same preference for deferring to expert technical agencies promoted by his 1993 book, Justice Breyer explained the reasons why he joined the majority: “I cannot say whether the EPA’s compromise represents the best overall environmental result; but I do believe it amounts to the kind of detailed decision that the statutes delegate authority to the EPA, not the courts, to make (subject to the bounds of reasonableness).”¹⁶³

The contrast between Justice Breyer’s willingness to defer to the EPA, notwithstanding the extreme results, and Justice Ginsburg’s dissent for herself and Justices Stevens and Souter was stark:

The Court’s reading . . . strains credulity. A discharge of a pollutant, otherwise prohibited by firm statutory command, becomes lawful if it contains sufficient solid material to raise the bottom of a water body, transformed into a waste disposal facility. Whole categories of regulated industries can thereby gain immunity from a variety of pollution-control standards.¹⁶⁴

Justice Ginsburg, unlike Justice Breyer, was not willing to assume that the EPA would ensure this loophole was not abused in future applications, especially given the dissent’s view that it had been abused in the facts of the case then before the Court.¹⁶⁵

3. Justice Breyer’s Dissenting Opinions in Part or in Full

Justice Breyer wrote separate opinions that dissented either in part or in full on ten occasions. In some, he concurred in part or in full with conservative majorities, and in others he dissented in part from liberal majorities. And on a few occasions, he dissented in full. The latter category tended to be those instances when Justice Breyer expressed views wholly favorable to the legal arguments of environmentalists.

Two of the cases involved tort liability. In Norfolk & Western Railway v. Ayers,¹⁶⁶ decided in 2003, Justice Breyer dissented from that part of the Court’s ruling that allowed tort plaintiffs who had been exposed to asbestos fibers and were suffering from asbestosis to recover for damages from their related reasonable fear of cancer.¹⁶⁷ Justice Breyer acknowledged that the legal issue was “a close and difficult one.”¹⁶⁸ But he dissented in part from Justice Ginsburg’s majority opinion because he was worried both about the “impossibility of knowing an appropriate compensation” for such fear¹⁶⁹ and that compensating victims for their fear might leave too little money remaining later on for victims who ultimately suffered from cancer.¹⁷⁰ On the other hand, in Exxon Shipping Co. v. Baker,¹⁷¹ decided in 2008, Justice Breyer dissented from a majority ruling limiting punitive damages from the Exxon Valdez Alaska oil spill based on his view that the punitive damages awarded the oil spill victims in that case need not be reduced.¹⁷²

Unlike in Norfolk and Exxon Shipping, in Winter v. Natural Resources Defense Council, Inc.,¹⁷³ Entergy v. Riverkeeper,¹⁷⁴ and Utility Air Regulatory Group v. EPA,¹⁷⁵ it was Justice Breyer’s partial concurrences with a conservative majority rather than his dissent from a liberal majority that were telling. In each, Justice Breyer’s separate opinion reflected his pragmatism and general desire to provide federal expert agencies with flexibility absent excessive judicial second-guessing.

In Winter, Justice Breyer declined to join Ginsburg’s dissent from the majority ruling overturning a lower court injunction of U.S. Navy exercises that the environmental plaintiffs alleged were harming marine mammals.¹⁷⁶ Justice Breyer concurred in part with the conservative Justices who made up a majority and concluded that the plaintiffs’ evidentiary support was too “weak or uncertain” to justify the “seriousness of the harm” that the Navy maintained the injunction would do “to the Navy’s ability to maintain an adequate national defense.”¹⁷⁷

In Entergy, Justice Breyer returned most explicitly to his argument, reflected in his 1993 book and earlier concurring opinion in American Trucking, that cost-benefit analysis is essential in setting rational environmental protection standards. Justice Stevens, joined by Justices Souter and Ginsburg, dissented from the majority ruling that the Clean Water Act permitted the EPA to engage in cost-benefit analysis in deciding the extent to which a power plant’s cooling water intake structure must minimize its adverse environmental impact.¹⁷⁸ Justice Breyer, however, agreed that such analysis was permissible, arguing that “an absolute prohibition would bring about irrational results.”¹⁷⁹ While suggesting some limits on how demanding such a cost-benefit analysis could be, he also cautioned that “in an age of limited resources available to deal with grave environmental problems, . . . too much wasteful expenditures devoted to one problem may well mean considerably fewer resources available to deal effectively with other (perhaps more serious) problems.”¹⁸⁰

In Utility Air Regulatory Group, Justice Breyer again concurred in part with a conservative majority but this time to a very different policy end. The majority opinion, authored by Justice Scalia, ruled that the term “any pollutant” under the Clean Air Act did not extend to greenhouse gas pollutants as applied to one significant part of the Act.¹⁸¹ Justice Breyer proffered a different approach that, like the majority, avoided the EPA’s being compelled to regulate sources that the EPA agreed would be administratively impractical, but by interpreting instead the term “any source” in a manner that would ultimately provide the EPA more discretionary authority to choose how best to regulate greenhouse gas emissions. As described by Justice Breyer, “[t]he Court’s decision to read greenhouse gases out of the [Prevention of Significant Deterioration] program drain[ed] the Act of its flexibility.”¹⁸² And Justice Breyer’s preferred approach “le[ft] the EPA with the sort of discretion as to interstitial matters that Congress likely intended it to retain.”¹⁸³

On the other hand, the Justice’s practical approach prompted him to dissent in full from the Court’s ruling in U.S. Fish & Wildlife Service v. Sierra Club,¹⁸⁴ decided in March 2021, in favor of a federal agency’s decision not to release a document to environmental plaintiffs under the Freedom of Information Act.¹⁸⁵ That Act requires agencies to release to the public final agency decision-making documents unless they are deliberative in nature, reflecting Congress tempering its desire for public disclosure with a competing desire not to unduly chill those candid internal exchanges of ideas that might not occur if the participants knew all their thinking would later be made part of the public record.¹⁸⁶

Pursuant to the Endangered Species Act, the Interior Department’s Fish & Wildlife Service and the Commerce Department’s National Marine Fisheries Service provide formal “biological opinions” to any federal agency whose proposed action may “adversely affect” an endangered or threatened species or its critical habitat.¹⁸⁷ In U.S. Fish & Wildlife Service, the two Services were preparing biological opinions on a proposed rule by the EPA under the Clean Water Act to regulate power plant cooling water intake structures because of the potentially adverse impact of those structures on aquatic wildlife.¹⁸⁸

Had the Services provided the EPA with a final biological opinion, the parties would not have disputed that such a final opinion would have been subject to public disclosure.¹⁸⁹ In the case, however, the two Services never submitted a final biological opinion because the EPA ultimately rescinded its initially proposed rule after receiving an early draft of the Services’ biological opinion that had not yet been formally approved by either Service for submission.¹⁹⁰ In challenging the EPA’s final cooling water intake structure, Sierra Club sought a copy of the informal draft biological opinions on the original proposed rule, which it hoped to use to attack the final rule.¹⁹¹

The majority easily concluded, based on FOIA’s text, that such draft biological opinions—especially ones never approved by the relevant officials of the two services—need not be disclosed because they lacked any formal legal status within the ESA: “The deliberative process privilege protects the draft biological opinions at issue here because they reflect a preliminary view—not a final decision—about the likely effect of the EPA’s proposed rule on endangered species.”¹⁹² Such preliminary assessments were, the Court concluded, “both predecisional and deliberative.”¹⁹³

In dissent, Justice Breyer naturally took a more practical approach, looking not at the formal name of the relevant document, but its function in the agency decision-making process. Because, Justice Breyer reasoned, “[t]he function of a Draft Biological Opinion finding jeopardy [of an endangered species] . . . is much the same as that of a Final Biological Opinion” and triggers the same process within EPA, the same reasons that justify public release of the final biological opinion apply with equal force to the draft.¹⁹⁴ However, because it was not clear whether the biological opinions at issue in the record were “drafts” or merely “Drafts of Draft Biological Opinion,” because they had never been approved by all relevant officials in the two Services, Justice Breyer contended the case should be remanded back to the court of appeals to decide their status.¹⁹⁵

In short, in some instances Justice Breyer’s lack of commitment to formalism supports policy ends favored by environmentalists, as in U.S. Fish & Wildlife Service. But in other instances, as in Coeur Alaska, he frustrates environmentalists by allowing the government to avoid what seems, on the face of the relevant statutory language, to be a clear transgression of congressional purpose.

However, Justice Breyer’s support for the constitutionality of environmental restrictions has been unqualified in Fifth Amendment takings cases. He has participated in ten Fifth Amendment takings cases while on the Court. And he voted against the regulatory takings claim in nine of those cases¹⁹⁶ and against a per se physical takings claim in the tenth case.¹⁹⁷ In one of those regulatory cases, Palazzolo v. Rhode Island,¹⁹⁸ he filed a separate dissent to underscore his agreement both with Justice Ginsburg that the case was not ripe and with Justice O’Connor that it was relevant to regulatory takings analysis whether the landowner was challenging a land use restriction that existed at the time of their purchase of the property.¹⁹⁹ And in Cedar Point Nursery v. Hassid,²⁰⁰ Justice Breyer filed a dissenting opinion on the ground that the majority erred by analyzing the state regulation of land use as a per se physical taking rather than as a possible regulatory taking.²⁰¹

Finally, Justice Breyer also earned high marks from environmentalists for his support of environmental plaintiff Article III standing. Like regulatory takings, Article III standing has been a persistent issue in environmental law since the 1970s.²⁰² Justice Breyer voted in favor of environmental plaintiffs in the two most important standing cases during his tenure on the Court, Friends of the Earth v. Laidlaw²⁰³ and Massachusetts v. EPA,²⁰⁴ and he authored the opinion for himself and three other Justices dissenting from the Court’s ruling against the plaintiffs’ standing in Summers v. Earth Island Institute²⁰⁵ in 2009.²⁰⁶ Justice Breyer took direct issue with the majority’s ruling that the environmental plaintiffs had failed to demonstrate a concrete injury in their lawsuit challenging the U.S. Forest Service’s salvage sale of timber from a national forest.²⁰⁷

III. ASSESSING JUSTICE BREYER’S ENVIRONMENTAL LAW LEGACY: FRIEND OR FOE OF ENVIRONMENTAL PROTECTION?

Justice Breyer is likely the only Justice ever chosen because of his perceived views on environmental law. But, ironically not because he was viewed as an ardent environmentalist. Just the opposite. He was thought to be a Justice who would instead be more sensitive to business than to environmental protection concerns.

That is both why Breyer failed to secure the nomination to the Court in 1993 and environmentalists opposed his selection in 1994—strongly favoring Secretary of the Interior Bruce Babbitt. And it is also why Republican Senate Leadership, including the Senate Minority Leader Bob Dole and the Judiciary Committee’s Ranking Minority Member Orrin Hatch, informed the White House that they would fight Babbitt’s nomination and promised a smooth confirmation process for Judge Breyer. As described at this Article’s outset, President Clinton chose not to fight, notwithstanding environmentalists’ warnings that Judge Breyer would be bad for environmental law and even “hazardous to our health.”²⁰⁸

So, which has Justice Breyer turned out to be—friend or foe? The answer seems relatively clear: friendly, if still shy of an unqualified friend. As reflected in a rough sense in his EP score, especially compared to those of his colleagues on the bench, Justice Breyer has voted in favor of results supported by environmentalists far more than most of the other Justices on the Court. And, in almost all of the most important environmental cases of the past twenty-eight years, he was a reliable vote joining the majority in the big cases environmentalists won—often providing the critical fifth vote—and no less a reliable vote in dissent with liberal justices sounding the alarm in the big cases environmentalists lost—as he did in West Virginia v. EPA, the very last environmental case decided by the Court when Justice Breyer was on the bench.²⁰⁹ These cases include major cases decided under framework environmental laws like the Clean Air and Clean Water Acts, as well as those involving major issues of constitutional law, such as Article III standing, congressional Commerce Clause authority, and regulatory takings. Justice Breyer has been a reliable, forceful vote for environmental protection in the biggest cases that mattered the most.

Environmentalist concerns about Justice Breyer’s support for regulatory reform proved overblown in application, but not because they were wrong about his views on the central role that cost-benefit analysis should play in setting environmental standards and his willingness to believe that such standards are unduly protective. They weren’t incorrect. Especially in his concurring opinions and in a scattering of his votes, Justice Breyer made clear, just as they feared, his belief in the essential role of cost-benefit analysis as well as his receptivity to concerns that environmental protection requirements may be so exceedingly expensive as to undermine, rather than promote, public health.²¹⁰

However, in no case did Justice Breyer’s distinct views make a difference. He never once provided the critical “fifth vote” in any case in which he expressed those policy preferences for cost-benefit analysis.²¹¹ And his concurring voice was of no legal effect at all. Of course, had the makeup of the Court when those cases were decided been tilted slightly more to the left, Justice Breyer’s vote might well have made a critical difference, just as environmentalists had worried it would. But that concern was never realized in almost three decades.

Justice Breyer has also proved far less dogmatic in his views than assumed by his detractors at the time of his nomination. While supporting EPA’s authority to use cost-benefit analysis in his separate concurring opinion in Entergy, Justice Breyer agreed with the environmental respondents that Congress had intentionally curbed EPA’s ability to rely on cost-benefit analysis in the Clean Water Act.²¹² Where Justice Breyer departed from the environmental respondents was his view that EPA nonetheless was permitted to take such analysis into account so long as the agency did so in a very limited way: to guard against costs wholly disproportionate to environmental benefits—a far more modest invocation of cost-benefit analysis than that sought by industry.²¹³ Justice Breyer also later fully joined Justice Kagan’s forceful dissent in Michigan v. EPA,²¹⁴ which criticized the majority for concluding that EPA was required to consider potential compliance costs in determining whether regulation of toxic mercury emissions from power plants was “appropriate.” Justice Breyer agreed with the other Michigan dissenters that Congress had instead instructed EPA to base its threshold determination of the appropriateness of emissions controls only on the extent of environmental harm posed by such emissions. None of the Justices, including the four dissenters, disputed that the Clean Air Act required EPA to consider control costs in subsequently determining the extent of emissions reduction subsequently required of power plants.²¹⁵

Finally, critics of Justice Breyer’s nomination to the Court failed to appreciate how Justice Breyer’s pragmatism and openness to consideration of regulatory costs and cost-benefit analysis might prompt the Justice to favor upholding EPA regulations those critics favored. In two very significant Clean Air Act cases, EPA v. EME Homer City Generation in 2014 and the recently-decided West Virginia v. EPA, EPA’s legal arguments in favor of the regulations at issue—the Clean Air Interstate Rule in EME Homer²¹⁶ and the Clean Power Plan in West Virginia²¹⁷—were weakened by the absence of clear support in the relevant statutory text. But what strengthened each of those EPA regulations was that both sets of ambitious regulations justified their broad reading of that language by the extent to which it permitted the EPA to take costs and benefits into account. In short, the kinds of economic analysis Justice Breyer favored allowed EPA to adopt more, not less, demanding environmental protection requirements.

To be sure, Justice Breyer has been no Justice Douglas. He has not voted in favor of the position favored by environmentalists in all cases. But nor is it clear that the nation, including environmentalists, should necessarily want such a Justice on the Court. Such a Justice might be a very good environmentalist, but not an especially good judge.

As described above, in eleven of the twenty-three environmental cases in which Justice Breyer voted against the position favored by environmentalists, all the Justices voted the same way.²¹⁸ None dissented, neither Justice Souter, Stevens, Kagan, nor Sotomayor. All the other most progressive Justices on the Court agreed that there was no, or at least too little, merit to the legal position favored by environmentalists. In three more of those twenty-three cases, environmentalists lost by a vote of eight to one. Perhaps a Justice who dissented in those cases could be credited with perceiving actual strength in legal arguments that the others were missing. But it is also quite possible that they would be engaging in the very kind of ideologically driven judicial decision-making that environmentalists correctly condemn in those Justices with very low EP scores like Justices Alito and Scalia. Even those of us who care deeply about environmental protection, and worry no less deeply about the failings of our elected branches of government, should not see a judiciary that decides cases strictly on personal ideology rather than fair consideration of the actual strengths of the competing legal arguments as the proper solution to those failings.

Finally, contrary to the predictions of those advising President Clinton in June 1993, Justice Breyer has not remotely proven to be a dispassionate, “bloodless,” “cold fish” Justice lacking any “innate sense of justice.”²¹⁹ To be sure, Justice Breyer is no Justice Sonia Sotomayor—a Justice whose writings evince a compassion for victims of injustice without ready modern parallel. He is a committed pragmatist. But his striking pragmatism should not be mistaken for a lack of passion. He has proven himself deeply committed to social justice and the fundamental role of the judiciary in its pursuit. He has made that philosophy clear in both his judicial opinions and in his writings outside of the Court, especially his 2005 book, Active Liberty, in which the Justice contends that judges should not merely attend to the need to ensure that individuals are free from governmental coercion but also ensure they enjoy freedom to participate fully in government itself, including the right to vote.²²⁰

CONCLUSION

Justice Breyer was certainly not environmentalists’ dream pick in 1994. And they had good reason to be concerned. But he has proved in actual practice to be an outstanding jurist for the nation and an excellent Justice for environmental protection law.

More fundamentally, Justice Breyer’s record on the Court suggests the wisdom of rethinking what it means to be a “dream” justice. Should it mean having a Justice who shares one’s ideological preferences on certain issues like environmental protection and will vote accordingly? Or should it mean having a Justice whose votes are rooted in a broader understanding of the proper role of the courts in interpreting law and deciding cases, including the central role our Constitution assigns to the judiciary to safeguard certain individual and collective rights? While the former Justice may reliably receive an EP score of 100, the latter is the better Justice, even if that means they sometimes will, as they should, rule in ways that disappoint.²²¹

APPENDIX A

ENVIRONMENTAL LAW CASES OCTOBER TERM 1994–OCTOBER TERM 2021

Case Name	Citation	EP Designation
Dolan v. City of Tigard	512 U.S. 374 (1994)	Dissent
Babbitt v. Sweet Home Chapter Communities for A Great Oregon	515 U.S. 687 (1995)	Majority
Meghrig v. KFC Western	516 U.S. 479 (1996)	Dissent
General Electric v. Joiner	522 U.S. 136 (1997)	Dissent
Steel Company v. Citizens for a Better Environment	523 U.S. 83 (1998)	Dissent/Concur
Ohio Forestry Association v. Sierra Club	523 U.S. 726 (1998)	Dissent
United States v. Bestfoods	524 U.S. 51 (1998)	Majority
City of Monterey v. Del Monte Dunes at Monterey, Ltd.	526 U.S. 687 (1999)	Concur
Friends of the Earth v. Laidlaw Environmental Services, Inc.	528 U.S. 167 (2000)	Majority
Public Lands Council v. Babbitt	529 U.S. 728 (2000)	Majority
Solid Waste Agency of Northern Cook County v. United States	531 U.S. 159 (2001)	Dissent
Whitman v. American Trucking Associations	531 U.S. 457 (2001)	Majority
Palazzolo v. Rhode Island	533 U.S. 606 (2001)	Dissent
Tahoe-Sierra Preservation Council v. Tahoe Regional Planning Agency	535 U.S. 302 (2002)	Majority
Norfolk & Western Railway v. Ayers	538 U.S. 135 (2003)	Majority
Alaska Department of Environmental Conservation v. EPA	540 U.S. 461 (2004)	Majority
South Florida Water Management Dist. v. Miccosukee Tribe	541 U.S. 95 (2004)	Majority
Engine Manufacturers Association v. South Coast Air Quality Management District	541 U.S. 246 (2004)	Dissent
Department of Transportation v. Public Citizen	541 U.S. 752 (2004)	Dissent
Norton v. Southern Utah Wilderness Association	542 U.S. 55 (2004)	Dissent
Bates v. Dow Agrosciences	544 U.S. 431 (2005)	Majority
Lingle v. Chevron U.S.A., Inc.	544 U.S. 528 (2005)	Majority
Kelo v. City of New London	545 U.S. 469 (2005)	Majority
S.D. Warren v. Maine Board of Environmental Protection	547 U.S. 370 (2006)	Majority
Rapanos v. United States	547 U.S. 715 (2006)	Dissent
Massachusetts v. EPA	549 U.S. 497 (2007)	Majority
Environmental Defense Fund v. Duke Energy Corporation	549 U.S. 561 (2007)	Majority
United Haulers Association v. Oneida-Herkimer Solid Waste Management Authority	550 U.S. 330 (2007)	Plurality
United States v. Atlantic Research Corporation	551 U.S. 128 (2007)	Majority
National Association of Home Builders v. Defenders of Wildlife	551 U.S. 644 (2007)	Dissent
Exxon Shipping Company v. Baker	554 U.S. 471 (2008)	Dissent in part
Winter v. National Resources Defense Council, Inc.	555 U.S. 7 (2008)	Dissent
Summers v. Earth Island Institute	555 U.S. 488 (2009)	Dissent
Entergy Corporation v. Riverkeeper, Inc.	556 U.S. 208 (2009)	Dissent
Burlington Northern & Santa Fe Railway Company v. United States	556 U.S. 599 (2009)	Dissent
Coeur Alaska, Inc. v. Southeast Alaska Conservation Council	557 U.S. 261 (2009)	Dissent
Stop the Beach Renourishment, Inc. v. Florida Department of Environmental Protection	560 U.S. 702 (2010)	Concurrence in part
Monsanto Company v. Geertson Seed Farm	561 U.S. 139 (2010)	Dissent
American Electric Power Company v. Connecticut	564 U.S. 410 (2011)	Dissent
Sackett v. EPA	566 U.S. 120 (2012)	Dissent
Arkansas Game and Fish Commission v. United States	568 U.S. 23 (2012)	Dissent
Decker v. Northwest Environmental Defense Center	568 U.S. 597 (2013)	Dissent
Koontz v. St. Johns River Water Management District	570 U.S. 595 (2013)	Dissent
EPA v. EME Homer City Generation L.P.	572 U.S. 489 (2014)	Majority
Utility Air Regulatory Group v. EPA	573 U.S. 302 (2014)	Concur/Dissent
Michigan v. EPA	576 U.S. 743 (2015)	Dissent
Federal Energy Regulatory Commission v. Electric Power Supply Association	577 U.S. 260 (2016)	Majority
Sturgeon v. Frost	577 U.S. 424 (2016)	Dissent
United States Army Corps of Engineers v. Hawkes Company	578 U.S. 590 (2016)	Dissent
Murr v. Wisconsin	137 S. Ct. 1933 (2017)	Majority
Weyerhaeuser Company v. United States Fish and Wildlife Service	139 S. Ct. 361 (2018)	Dissent
Sturgeon v. Frost II	139 S. Ct. 1066 (2019)	Dissent
Virginia Uranium, Inc. v Warren	139 S. Ct. 1894 (2019)	Majority
Knick v. Township of Scott	139 S. Ct. 2162 (2019)	Dissent
Atlantic Richfield Co. v. Christian	140 S. Ct. 1335 (2020)	All But Alito
County of Maui v. Hawaii Wildlife Fund	140 S. Ct. 1462 (2020)	Majority
United States Forest Service v. Cowpasture River Preservation Association	140 S. Ct 1837 (2020)	Dissent
United States Fish and Wildlife Service v. Sierra Club	141 S. Ct. 777 (2021)	Dissent
Guam v. United States	141 S. Ct 1608 (2021)	Majority
Cedar Point Nursery v. Hassid	141 S. Ct. 2063 (2021)	Dissent
Hollyfrontier Cheyenne Refining, LLC v. Renewable Fuels Association	141 S. Ct 2172 (2021)	Dissent
PennEast Pipeline Company v. New Jersey	141 S. Ct 2244 (2021)	Dissent
West Virginia v. EPA	142 S. Ct. 2587 (2022)	Dissent

APPENDIX B

ENVIRONMENTAL PROTECTION (“EP”) SCORES OF SELECTED INDIVIDUAL JUSTICES

OCTOBER TERM 1994–OCTOBER TERM 2021

Justice	Number of Cases	EP Points	EP Score
Breyer	61	38	62.3
Scalia	47	11	23.4
Stevens	37	29	78.4
Kennedy	50	18	36
Thomas	63	13	20.6
Souter	36	29	80.6
Ginsburg	57	41	71.9
CJ Roberts	40	8	20
Alito	38	4	10.5
Sotomayor	25	16	64
Kagan	25	17	68

95 S. Cal. L. Rev. 1395

Download

Richard J. Lazarus*

* Howard J. & Katherine W. Professor of Law, Harvard Law School. I would like to thank Professors Hope Babcock, Jonathan Cannon, Bill Funk, Steph Tai, and Susannah Weaver for their terrific comments on a preliminary draft of this Article. Kathryn C. Reed, Harvard Law School Class of 2022, provided outstanding research and editorial assistance. Although it has no bearing on my analysis, for the sake of disclosure I served as counsel for either one of the parties or an amicus in the following Supreme Court cases that fall within this Article’s scope of review: Dolan v. City of Tigard, 512 U.S. 374 (1994); Palazzolo v. Rhode Island, 533 U.S. 606 (2001); Tahoe-Sierra Pres. Council v. Tahoe Reg’l Plan. Agency, 535 U.S. 302 (2002); Norfolk & Western Ry. v. Ayers, 538 U.S. 135 (2003); S. Fla. Water Mgmt. Dist. v. Miccosukee Tribe, 541 U.S. 95 (2004); S.D. Warren Co. v. Me. Bd. of Env’t Prot., 547 U.S. 370 (2006); Entergy Corp. v. Riverkeeper, Inc., 556 U.S. 208 (2009); Monsanto Co. v. Geertson Seed Farm, 561 U.S. 139 (2010); and Murr v. Wisconsin, 137 S. Ct. 1933 (2017).

Justice Breyer’s Friendly Legacy for Environmental Law

Richard J. Lazarus*

Environmentalists did not cheer President Bill Clinton’s decision in May 1994 to nominate then-First Circuit Judge Stephen Breyer to fill

Should Humanity Have Standing? Securing Environmental Rights in the United States

Daniel C. Esty*

While courts around the world are increasingly recognizing rights of nature or the rights of individuals or communities to a

Standing for Rivers, Mountains—and Trees—in the Anthropocene

David Takacs*

In his well-known article, Should Trees Have Standing?—Toward Legal Rights for Natural Objects, Professor Christopher Stone proposed that courts grant

Fish, Whales, and a Blue Ethics for the Anthropocene: How Do We Think About the Last Wild Food in the Twenty-First Century

Robin Kundis Craig*

One of the lesser celebrated threads of Christopher Stone’s scholarship was his interest in the ocean—especially international fisheries and whaling.

Identifying Contemporary Rights of Nature in the United States

Karen Bradshaw*

The Rights of Nature movement is at the precipice of watershed social changes. Leaders of this international, Indigenous-led movement call