Show simple item record

dc.contributor.authorCarson, Jana
dc.contributor.authorGordon, Shannon 13:51:20 (GMT) 13:51:20 (GMT)
dc.description.abstractThe University of Waterloo recognizes bibliometric data as an important piece of evidence-based research assessment, and recommends bibliometric data as one measure, among many, for capturing research productivity trends, and elements of research impact. Even when working from a basket of measures, bibliometric data remains complex and requires significant cleaning due to issues of name ambiguity. This session will explore an innovative collaboration between the Library and Institutional Analysis and Planning (IAP) to support the integrity of local, discipline-level bibliometric data by automating key data processes of an internal project. This session will introduce how bibliometric data is relevant to the University, the process used to gather and vet local bibliometric data, and the ways in which key data processes have been successfully automated using Python and a database to support efficient reporting. Given known challenges presented by name ambiguity, this collaborative framework makes it possible to support the integrity of local bibliometric data—a key step in supporting this and similar in-demand analyses at the University.en
dc.subjectResearch Productivityen
dc.subjectBig Dataen
dc.subjectData Cleanupen
dc.subjectUniversity Partnershipsen
dc.titleAutomating Big Data Cleaning: An Example Using Local Bibliometric Dataen
dc.typeConference Presentationen
dcterms.bibliographicCitationCarson, J., & Gordon, S. (2016). Automating Big Data Cleaning: An Example Using Local Bibliometric Data. Conference presentation as presented at WatITIs, Waterloo, On.en
uws.contributor.affiliation1Waterloo Libraryen

Files in this item


This item appears in the following Collection(s)

Show simple item record


University of Waterloo Library
200 University Avenue West
Waterloo, Ontario, Canada N2L 3G1
519 888 4883

All items in UWSpace are protected by copyright, with all rights reserved.

DSpace software

Service outages