Ihab Ilyas, professor in the Cheriton School of Computer Science at the University of Waterloo and cofounder of Tamr, a startup focusing on large-scale data integration and cleaning, will be joining ODX for a talk on Tuesday May 23, 2017 from 12:00 - 1:00pm at the Communitech Data Hub. Can't make it? Watch it live from your desk.

*WARNING* This will be our most technical talk yet. We recommend it to data analysts, data scientists, and developers who want to use machine learning to understand open data and integrate it with their own.

Details


Date: Tue, 23 May 2017
Time: 12:00 PM – 1:00 PM EST
Location: Communitech Data Hub | 14 Erb Street West, Waterloo, ON (Bring Your Own Lunch)
Live stream: Online

Communitech Data Hub

Description


A recurring challenge in open data is finding efficient ways to map the entities one finds in open data - things like company names, locations, and so on - back to a company's internal representation of those entities. This area of work is known as data curation.

Machine learning tools promise to help solve data curation problems. While the principles are well understood, the engineering details in configuring and deploying ML techniques are the biggest hurdle. Ihab Ilyas explains why leveraging data semantics and domain-specific knowledge is key in delivering the optimizations necessary for truly scalable ML curation solutions.

About the speaker

Headshot of Ihab Ilyas


Ihab Ilyas's main research is in the area of database systems, with special interest in data quality and integration, managing uncertain data, rank-aware query processing, and information extraction. Ihab is also a co-founder of Tamr, a startup focusing on large-scale data integration and cleaning. He is a recipient of the Ontario Early Researcher Award (2009), a Cheriton Faculty Fellowship (2013), an NSERC Discovery Accelerator Award (2014), and a Google Faculty Award (2014), and he is an ACM Distinguished Scientist. Ihab is an elected member of the VLDB Endowment board of trustees and an associate editor of the ACM Transactions of Database Systems (TODS). He received his PhD in computer science from Purdue University, West Lafayette.