In response to this research project at least 5 image training datasets were terminated, deactivated, or retracted. The State of AI Report 2020, a Nature magazine feature, and a U.S. Government Accountability Office (GAO) report cited this project's research in what may be turning point in how image training datasets are created and distributed.
- September 1: Altinget.dk writes about face recognition datasets in the context of data sharing within Europe (Danish)
- August 13: MIT Technology Review writes Deleting unethical data sets isn’t good enough
- August 9: Princeton researchers write about the problem of derivative datasets. Following the citation analysis approach of Exposing.ai, they find that even though datasets like DukeMTMC and MegaFace have been taken down, several datasets derived from them remain accessible and are still in use.
- Kate Crawford writes about Exposing.ai in Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence
- April 2: NRC covers our Flickr search engine (Dutch)
New York Times writes about the launch of the new Exposing.ai search engine tool.
- State of AI Report 2020 highlights research from the MegaPixels project about the MS-Celeb-1M face dataseat in their analysis of recent developments in facial recognition. (slide 136)
- June 11: MegaFace dataset is now decommissioned. University of Washington has ceased distributing the MegaFace dataset citing the challenge has concluded and that maintenance of their platform would be too burdensome.
- The webpage for the Oxford Town Centre dataset has been taken down, and the dataset is apparently no longer being distributed.
MegaPixels (now Exposing.ai) contributes to Munich Security Conference Transational Security Report on illicit data flows
- May 31: Semantic Scholar appears to be censoring citations used in this project. Two of the citations linking the Brainwash dataset to research from the National University of Defense Technology (NUDT) in China have disabled. NUDT citation 1, NUDT citation 2, and the original paper show that the NUDT citation has been censored (see the references section on Semantic Scholar pages)
- May 28: The Microsoft Celeb (MS-Celeb-1M) face dataset website is now 404 and all the download links were deactivated. It appears that someone at Microsoft Research has shuttered access to the MS Celeb dataset. Yet it remains available, as of writing this, on Imperial College London's website and on https://msropendata.com/datasets/98fdfc70-85ee-5288-a69f-d859bbe9c737
- May 29, 2019: Stories about the UnConstrained College Students Dataset appeared on Engadget, AP News, New York Times, US News, Daily Dot, Washington Post, MSN, International Association of Privacy Professionals, The Denver Channel, Daily Mail, New York Post, Yahoo! News
- May 27, 2019: Denver Post writes about the UCCS dataset: CU Colorado Springs students secretly photographed for government-backed facial-recognition research
- May 22, 2019: Interview with CS Indy about the UCCS dataset UCCS secretly photographed students to advance facial recognition technology by J. Adrian Stanley