DLF Forum 2015 has ended
Welcome to the 2015 DLF Forum! Community Notes folder: http://bit.ly/1kHKur8

@CLIRDLF | #DLFforum #ourDLF | #dlfLAC

Back To Schedule
Wednesday, October 28 • 9:00am - 10:30am
Statistical DPLA: Metadata Counting and Word Analysis • Think Globally, Act Locally: How Working with DPLA has Improved Our Collections

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Presentation 1
Statistical DPLA: Metadata Counting and Word Analysis
Building on work recently presented at DPLAFest, this presentation will highlight next steps in research into metadata quantification and metadata quality. The presentation will begin with a review of phase one of the project, which focused on element counts, top level word counts across fields and across collections, and techniques for visualizing these data. The focus of phase two is applying Natural Language Processing (NLP) techniques to understand word frequencies, common words, and rare / unique words across the DPLA text corpus.

Questions posed include:

• Are there naturally occurring clusters of words that differentiate DPLA providers?
• Are there differences in language patterns between providers or between hub types (content vs. service)?
• Are there gaps or differences, or is there alignment, in the language used in search terms versus collection metadata?
• Are there relationships between language in search terms, metadata terms, and item usage as measured by Google Analytics?
• Are there patterns to the language used in Twitter references to DPLA items?

The presentation will provide preliminary responses to these research questions and discuss the development of Metadata NLP and term / N-gram frequency visualization techniques. This will be followed by a participatory discussion of metadata language research; its relationship to metadata best practice, quality, and completeness; and possible next steps to expand exploration of these techniques beyond DPLA.

Presenter: Corey Harper (New York University)

Think Globally, Act Locally: How Working with DPLA has Improved Our Collections
This panel discussion brings together various people involved with resource aggregation for the Digital Public Library of America (DPLA). Each person will discuss how working as an aggregator for the DPLA has changed how they approach, work with and describe their own local collections. This can involve metadata processes, collection organization, tools used, local discovery practices, or other workflows related to digital collections. To keep the style of the panel informal and open to audience participation, each panelist will have only a short (~5 minutes) time to present an overview of their position at the start of the event. Then the group will be guided through a series of questions and discussion points, some agreed upon in advance, some taken from the audience, which will further elucidate how working with the DPLA has positively impacted their own work.

Presenters: Emily Gore (Digital Public Library of America), Marcia McIntosh (University of North Texas), Linda Ballinger (Penn State University), Chris Stanton (Metropolitan New York Library Council / Empire State Digital Network), Christina Harlow (University of Tennessee, Knoxville)


Linda Ballinger

Metadata Strategist, Penn State University Libraries
avatar for Emily Gore

Emily Gore

Director of Content, Digital Public Library of America
avatar for Christina Harlow

Christina Harlow

Data Operations Engineer, Stanford University

Corey Harper

Metadata Services Librarian, New York University
avatar for Marcia McIntosh

Marcia McIntosh

Digital Production Librarian, University of North Texas
University of North Texas
avatar for Chris Stanton

Chris Stanton

Metadata Specialist, Metropolitan New York Library Council

Wednesday October 28, 2015 9:00am - 10:30am PDT
Ballroom I

Attendees (0)