A Commons Approach to Data Governance
In 2017, data was the new .
By 2018, data was emphatically the new oil.
Now it鈥檚 2019. Could data be the new 鈥 community forest? Not exactly, but this metaphor might be worth considering if we鈥檙e serious about figuring out how to regulate and manage our vastly growing data resources. Before we can talk about how data is similar to a community forest, we should talk about why data is not similar to your house, your car, or any other type of private property.
We gravitate to the idea that we can 鈥渙wn鈥 our data. 鈥淚t鈥檚 my data鈥 is a common refrain in privacy conversations. Earlier this year, Sen. John Kennedy (R-La.) introduced a bill called the . And California Gov. Gavin Newsom, a Democrat, has called for a 鈥,鈥 which would allow Californians to be paid for their digital trails.
And yet, the idea that data is akin to private property is easy to debunk.
If you think of owning data in the same way you own a watch or a soda can, things get silly quickly. If I give you my watch, you have a watch and I no longer have one. If I give you 鈥榤y鈥 data 鈥 well, I still have my data too, so what then? I can prevent you from wearing my watch, but it鈥檚 near impossible to prevent all third parties from using my data. As a recent article correctly points out: 鈥淪uch a data ownership regime is not practicable.鈥
The takedown is all well and good, except that private property is far from the only property governance system, and ownership is far from the only type of property right. In fact, it is estimated that of the world鈥檚 most ubiquitous form of real property鈥攍and鈥攊s governed communally. So globally, private property rights are not only not the rule, they are the exception.
What happens then, if we broaden our frame? If data isn鈥檛 private property, what type of resource is it?
Anyone who has taken Property 101 in an American law school will remember the concept of joint tenancy, a governance regime in which property is divided up amongst a handful of people who each have equal rights and obligations in the property. It鈥檚 similar to renting an apartment with a friend.
But an even more flexible type of governance regime鈥攐ne we see in practice all over the world鈥攊s a 鈥渃ommons.鈥
A commons doesn鈥檛 have a uniform definition, but is understood as any natural or manmade resource that is or could be held, used, and managed in common by many people or by society. Grazing lands, public parks, playgrounds, roads, fisheries, forests, and air are all types of commons in the natural world.
Over the last half century, multiple economists, most prominently Nobel Prize winner Elinor Ostrom, have pioneered the field of commons governance in the natural world. Ostrom broke down the world of natural resources into four categories, governed by the twin axes of rivalry and exclusion. A resource is excludable if you can prevent others from using it. It鈥檚 rivalrous if it can be 鈥渦sed up.鈥
| Excludable | Non-excludable | |
|---|---|---|
| Rivalrous | Private Goods food, clothing, cars, parking spaces |
Common Goods (Common-pool resources) fish stocks, timber, coal |
| Non-rivalrous | Club Goods cinemas, private parks, satellite television |
Public Goods free-to-air television, air, national defense |
So, for example, a soda can is a private good because it is excludable (I can keep you from drinking my soda) and it is rivalrous (if I drink the soda, there鈥檚 none left for you). By contrast, a forest is a public good. I can鈥檛 exclude you from being in the forest, and my use of the forest doesn鈥檛 preclude your use. By contrast, a private park or a country club is a club good. My use of it doesn鈥檛 preclude your use, but I can exclude you from using it (either outright or by charging an entrance fee).
In Ostrom鈥檚 model, only the resources in the upper left hand quadrant are considered private property. The resources in the other three quadrants are governed as a commons.
What does this have to do with data? While the concept of the 鈥渃ommons鈥 was originally developed to govern natural resources, it applies nicely to data as well.
Looking at Ostrom鈥檚 quadrants, it quickly becomes obvious why most data does not fit the definition of a private good, despite the attempts of scholars to place it there. But by looking at data through the other quadrants, in particular the club good and public good quadrants, things start to become more interesting. While data lacks many of the characteristics of private property, it shares a number of important similarities with different types of commons.
Just like a commons:
- Data is a resource which many people can and do use simultaneously, and for varying purposes. Similar to a community forest, which can be used by different people for different reasons, data can also be used for multiple simultaneous reasons. For example, I may use data collected by Facebook to connect with my friends, Facebook may use data collected by Facebook to gather information for advertising purposes, and advertisers may use data collected by Facebook to sell me things.
- Data is a resource that is more valuable when packaged together rather than siloed or broken down into individually owned chunks. Think about the ecosystem services and carbon storage provided by the Amazon rainforest. Compare that to the value of big data for medical research and large, representative data sets for everything from medical research to google searches. Conversely, think about the missed opportunity created by data silos (well illustrated by Open Referral鈥檚 about the failures of community research directories that don鈥檛 share information with one another).
- Data is a resource that doesn鈥檛 easily get depleted. This is probably the most important differentiating aspect of data. In fact, data is generative, with new data being created every second through each of our interactions.
Looking at data through a 鈥渃ommons鈥 lens provides a way for us to distribute the benefits of this resource widely and equitably, without commodifying or privatizing it. For example, organizations such as the Open Data Institute, Digital Public and the Stanford Center on Philanthropy and Civil Society have written about data trusts, which are an example of a data commons. These trusts serve as a way to govern data as if it were a public or club good, as opposed to a private good, and ensure that users enjoy not only access to the good itself (data) but also share in its derived benefits of that good. Google鈥檚 parent company, Alphabet, has already put this model into practice with its , which makes use of a data trust managed by an independent agency.
Another example, the , allows researchers to access scientific data as long as they agree to share the outcome of their research with others in the commons.
And, applying Ostrom鈥檚 scholarship to data governance provides flexible structures that allow this resource to be managed by different communities in accordance with their unique rules, norms, and values, instead of being subject to international regulations. For example, scholars like NYU鈥檚 Kathy Strandburg and Villanova鈥檚 Brett Frischmann have begun to adapt Ostrom鈥檚 famous Institutional Analysis and Development framework into a 鈥樷 framework.
So, what problem does this help solve?
At the end of the 20th century, Elinor Ostrom鈥檚 work introduced a 鈥渢hird way鈥 to manage natural resources. Instead of relying on top-down government regulation, or bottom-up market privatization, Ostrom introduced a blended approach that maximizes flexibility instead of mandating a one size fits all approach.
The parallel holds true for data. The current debate over how to govern data centers on two poles: privacy, and private property. Each of these poles, while helpfully answering certain governance questions, introduces a dogma that is unhelpful when dealing with a resource that is diverse and changing quickly.
Introducing a third frame鈥攖he commons鈥攎ay unlock new pieces of the data governance puzzle, and help us solve the problem of how to regulate and manage this new and rapidly changing resource.