Ask a Data Ethicist:
Reusing data is a fundamental part of artificial intelligence and machine learning. Yet, when we collect data for one purpose, and use it for other purposes, we could be crossing both legal and ethical boundaries.
How can we address the ethics of reusing data?
Understand Your Data
Before we address the issue of reuse, we malaysia whatsapp number data should first seek to understand the original context of our dataset. Why do we have this data? Where did it come from? Who collected it? For what purpose? What else do we know about it?
One tool that can help data scientists with these questions is Datasheets for Datasets. Datasheets for Datasets was first proposed by a group of researchers led by Dr. Timnit Gebru. The datasheet contains vital information about the data that addresses the kinds of social questions we’ve posed as well as other technical details about the data. You can create your own datasheet or you can find an example of one in Ethically Aligned AI’s Ethics Toolkit.
Your organization might also have data governance software tools that can provide some or perhaps all of these details. Knowing your data lineage, data quality, and other pertinent details can help you to make better decisions about the fitness of the data as you assess its usefulness in the context of your new use case.