Ricardo Baeza in the DTIC Research Seminar

Data and Algorithmic Bias in the Web 

28th September

12:00 hours  

Room 52.221

Abstract. The Web is the largest public big data repository that humankind has created. In this overwhelming data ocean, we need to be aware of the quality and, in particular, of the biases that exist in this data. In the Web, biases also come from redundancy and spam, as well as from algorithms that we design to improve the user experience. This problem is further exacerbated by biases that are added by these algorithms, specially in the context of search and recommendation systems. They include selection and presentation bias in many forms, interaction bias, social bias, etc. We give several examples and their relation to sparsity and privacy, stressing the importance of the user context to avoid these biases.

