Skip Navigation


CIT can broadcast your seminar, conference or meeting live to a world-wide audience over the Internet as a real-time streaming video. The event can be recorded and made available for viewers to watch at their convenience as an on-demand video or a downloadable file. CIT can also broadcast NIH-only or HHS-only content.

The event ended, check back after it has been posted to the Past Events section.

 

Google Dataset Search: Facilitating data discovery in an open ecosystem.

   
Air date: Thursday, November 14, 2019, 2:00:00 PM
Time displayed is Eastern Time, Washington DC Local
Description: There are thousands of data repositories on the Web, providing access to millions of datasets. National and regional governments, scientific publishers and consortia, commercial data providers, and others publish data for fields ranging from social science to life science to high-energy physics to climate science and more. Access to this data is critical to facilitating reproducibility of research results, enabling scientists to build on others’ work, and providing data journalists easier access to information and its provenance. In this talk, I will discuss recently launched Google Dataset Search, which provides search capabilities over potentially all dataset repositories on the Web. I will talk about the open ecosystem for describing and citing datasets that we hope to encourage and the technical details on how we went about building Dataset Search. Finally, I will highlight research challenges in building a vibrant, heterogeneous, and open ecosystem where data becomes a first-class citizen.

Links: https://toolbox.google.com/datasetsearch https://www.blog.google/products/search/making-it-easier-discover-datasets/
Author: Chris Gorgolewski
Runtime: 1 hour