External seminar: Dark Data: Why What You Don’t Know Matters

David J. Hand (Imperial College London)

Monday 2nd November, 2020 13:00-14:00 Zoom


Dark data are data you don’t have. It might be that you want today’s data, but all you have is yesterday’s. It might be that certain types of cases are missing from your sample. It might be that the recorded values are inaccurate – no measuring instrument is perfect. It might be that the process of collecting the data changes those very data themselves. It might be that you have only summary values, like averages, which tell you nothing about extremes. Or it might be data that has been collected and stored but not analysed – perhaps they were collected for regulatory compliance reasons. I outline a taxonomy of fifteen types of dark data, showing just how serious the consequences can be. But then I go further, showing strategies for coping with dark data, and even how to take advantage of it in a strategic application of ignorance.
(The book is available in electronic form in the University library, and is worth a read : http://tinyurl.com/yxwtsotq )

