In Data We Trust
In Data We Trust
This chapter discusses issues of data quality and veracity in open datasets, using a variety of examples from the Irish data system. These examples include the Residential Property Price Register (RPPR), the Dublin Dashboard project, the TRIPS database, and Irish crime data. There are a number of issues with Irish crime data, such as crimes being recorded in relation to the police stations that handle them, rather than the location they are committed. There are also issues in the standardization of crime categorization, with some police officers recording the same crimes in slightly different ways, and also in timeliness of recording. Moreover, there are difficulties of retrieving data from the crime management software system. In addition to errors, every dataset has issues of representativeness — that is, the extent to which the data faithfully represents that which it seeks to measure. In generating data, processes of extraction, abstraction, generalization and sampling can introduce measurement error, noise, imprecision and bias. Yet internationally, there has been much work expended on formulating data-quality guidelines and standards, trying to get those generating and sharing data to adhere to them, and promoting the importance of reporting this information to users.
Keywords: data quality, data veracity, open datasets, Irish data system, Irish crime data, representativeness, data creation, measurement error, bias
Policy Press Scholarship Online requires a subscription or purchase to access the full text of books within the service. Public users can however freely search the site and view the abstracts and keywords for each book and chapter.
Please, subscribe or login to access full text content.
If you think you should have access to this title, please contact your librarian.
To troubleshoot, please check our FAQs, and if you can't find the answer there, please contact us.